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CERTIFICATE OF EXPRESS MAILING 

I hereby certify that this paper and the documents 
referred to as being attached or enclosed herewith are 
being deposited with the United States Postal Service 
on May 2, 2001 as Express Mail Label No. 
EL828141 124US in an envelope addressed to: BOX 
PCT, COMMISSIONER FOR PATENTS, 
WASHINGTON, D C. 20231 



William D. Noonan, M.D. 
Attorney for Applicant 



BOX PCT 

COMMISSIONER FOR PATENTS 
WASHINGTON, D.C. 20231 



PRELIMINARY AMENDMENT 



Prior to examination of the above-identified application, please amend the claims as 
follows: 



Please cancel claims 1-34, and insert the following new claims: 



35. (New) An isolated virus (RRV) as deposited with ATCC as deposit accession 
number VR-2601. 



36. (New) A purified virus, having a nucleic acid sequence 

(a) shown in SEQ ID NO: 1 or 

(b) a conservative variant thereof. 

37. (New) The purified virus of claim 36, wherein the nucleic acid sequence has at 
least 95% sequence identity to the nucleic acid sequence shown in SEQ ID NO:l. 
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38. (New) A purified protein encoded by an open reading frame of the virus of claim 



39. (New) A purified protein of claim 38 ? wherein the protein comprises an amino 



(a) an amino acid sequence shown in odd numbered sequences of SEQ ID 
NOS:3-165;and 

(b) amino acid sequences that differ from those specified in (a) by one or 
more conservative amino acid substitutions wherein the function of the protein is 
preserved. 



40. (New) A purified protein with an amino acid sequence that is at least 95% 
sequence identity to the sequences specified in claim 39^ 

41 . (New) The purified protein of claim 39, wherein the amino acid sequence is 
selected from odd numbered sequences within the grou^ consisting of SEQ ID NOS:3-19 and 
23-165. 

42. (New) An isolated nucleic acid molecule encoding a protein according to claim 



43. (New) An isolated nucleic acid molecule according to claim 42, wherein the 
molecule comprises a sequence selected from the group consisting of even numbered sequences 
of SEQ ID NOS:2-164. 



36. 




acid sequence selected from the group consisting of: 



39. 
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44. (New) The isolated nucleic acid molecule according to claim '43, wherein the 
molecule comprises a sequence selected from the group consisting of even numbered sequences 



45. (New) A recombinant nucleic acid molecule comprising a promoter sequence 
operably linked to a nucleic acid molecule according to claim 42/ 

46. (New) A cell transformed with a recombinant nucleic acid molecule according to 
claim 42. 

47. (New) A non-human mammal purposefully infected with the virus of claim 36. 



48. (New) The mammal of claim 47, wherein the mammal is a primate. 

49. (New) An oligonucleotide comprising a sequence selected from the group 
consisting of: 

(a) at least 20 contiguous nucleotides of the nucleic acid sequence of the virus 
of claim 36; 

(b) at least 30 contiguous nucleotides of the nucleic acid sequence of the virus 
of claim 36; and 



of SEQ ID NOS:2-18 and 22-164. 



(c) 



at least 50 contiguous nucleotides of the nucleic acid sequence of the virus 



of claim 36. 



50. (New) An isolated nucleic acid molecule that encodes the protein of claim 40. 



51. 



(New) An isolated nucleic acid molecule encoding a protein of claim 40. 
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52. (New) An isolated nucleic acid molecule encoding all proteins encoded by the 
virus of claim 36, and having a biological activity of an RRV virus. 

53. (New) A method for testing the efficacy of a drug in the treatment of a condition 
associated with the virus of claim 36, the method comprising: 

(a) administering the drug to a non-human primate infected with the virus of 

claim 36; and 

(b) observing the primate to determine if the drug prevents or reduces the 
presentation of one or more symptoms associated with viral infection. 

54. (New) The method of claim 53, wherein the primate is immunocompromised. 

55. (New) The method of claim 54, wherein the drug is for the treatment of Kaposi's 
sarcoma and lymphoproliferative disorders. 

56. (New) The method of claim 54, wherein the primate is immuno-compromised as a 
result of infection by Simian Immunodeficiency Virus (SIV). 

57. (New) The method of claim 53, wherein the condition associated with infection 
with the virus is one or more of B-cell hyperplasia, lymphadenopathy, splenomegaly, 
hypergammaglobulinemia or autoimmune hemolytic anemia. 

58. (New) The method of claim 53, wherein the non-human primate is a Rhesus 
macaque monkey. 
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59. (New) A method for producing a non-human primate model for testing potential 
treatments for a condition associated an infection with the virus of claim 36, comprising 

(a) administering a treatment to the primate to render the primate 
immunocompromised; and 

(b) infecting the primate with the virus of claim 36. 

60. (New) The method of claim 59, wherein the condition is Kaposi's sarcoma and 
lymphoproliferative disorders. 

61 . (New) The method of claim 59, wherein the treatment used to render the primate 
immuno-compromised is infection with SIV. 

62. (New) The method of claim 59, wherein the non-human primate is a Rhesus 
macaque monkey. 

63. (New) A method for testing the efficacy of a candidate vaccine against the virus 
of claim 36, or conditions associated infection with the virus of claim 36, the method 
comprising: 

(a) administering the vaccine to a subject capable of infection with the virus of 

claim 36; 

(b) inoculating the subject with the virus; and 

(c) observing the subject to determine if the vaccine prevents or reduces an 
incidence of viral infection or presentation of one or more conditions associated with the viral 
infection. 

64. (New) The method of claim 63, wherein the subject is a primate. 

65. (New) The method of claim 64, wherein the primate is a non-human primate. 
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66. (New) The method of claim 63, wherein the primate is immunocompromised. 



67. (New) The method of claim 63, wherein the conditions associated with infection 
include B-cell hyperplasia, lymphadenopathy, splenomegaly, hypergammaglobulinemia or 
autoimmune hemolytic anemia. 

68. (New) The method of claim 65, wherein the non-human primate is a Rhesus 
macaque monkey. 



No new matter is added. Entry of this amendment is respectfully requested prior to 
examination. If any minor matters remain to be addressed prior to examination, the Examiner is 
invited to call the undersigned at the telephone number listed below. 



One World Trade Center, Suite 1600 
121 S.W. Salmon Street 
Portland, Oregon 97204 
Telephone: (503)226-7391 
Facsimile: (503) 228-9446 



CONCLUSION 



Respectfully submitted, 



KLARQUIST SPARKMAN CAMPBELL 
LEIGH & WHINSTON, LLP 




William D. Noonan, M.D. 
Registration No. 30,878 
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CLONING OF RHADINO VIRUS GENOME AND METHODS FOR ITS USE 

FIELD OF THE INVENTION 

The invention relates to the genome of a rhesus macaque rhadinovirus and provides 
5 compositions and methods for the production of animal models useful in assessing the efficacy of 
drugs and vaccines in the treatment and prevention of conditions associated with infection by the 
virus, such as Kaposi's sarcoma and lymphoproliferative disorders. 

BACKGROUND 

10 Converging lines of evidence indicate that Kaposi's sarcoma-associated herpesvirus 

(KSHV) is the etiological agent responsible for Kaposi's sarcoma (KS) in individuals with and 
without HIV infection (Chang et al. t 1994, Science 266:1865-9; Schalling et al., 1995, Nature 
Med. 7:707-8; Moore & Chang, 1995, N. Engl J. Med. 332:1181-5; Whitby et al., 1995, Lancet 
346:799-802; Ambroziak et al., 1995, Science 268:582-3.; Dupin et al., 1995, Lancet 345:761-2.; 

15 Chuck et al, 1996, J. Infect. Dis. 173:248-51; O'Neill et al., 1996, 7. Clin. Pathol. 49:306-8; Gao 
et al., 1996, Nature Med. 2:925-8; Kedes et al., 1996, Nature Med. 2:918-24; Gao et al., 1996, 
N. Engl. J, Med. 335:233-41). In addition to KS, KSHV is also responsible for other acquired 
immunodeficiency syndrome (AIDS)-related and non-AIDS-related malignancies, such as primary 
effusion lymphomas. (Cesarman et al., 1995, N. EngL J. Med. 332:1186-91; Nador et al., 1996, 

20 Blood 88:645-56; Otsuki et al, 1996, Leukemia 10:1358-62), and multicentric Castleman's disease 
(MCD), a B cell proliferation disorder associated with overexpression of IL-6 activity (Soulier et 
al., 1995, Blood 86:1276-80; Yoshizaki et al., 1989, Blood 74:1360-7). 

More recently, KSHV has been proposed to be involved in multiple myeloma, a B cell 
abnormality of monoclonal origin (Rettig et al., 1997, Science 276: 1851-4; Said et al., 1997, 

25 Blood 90:4278-82; Parravicini et al., 1997, Science 278: 1969-70; Masood et al., 1997, Science 
278:1970-1; Whitby et al., 1997, Science 278:1971-2; Cottoni et al., 1997, Science 278:1972; 
Brousset et al., 1997, Science 278:290-4). Understanding how KSHV is involved in these 
malignancies is important for the generation of therapies against the spectrum of KSHV-associated 
diseases. 

30 Testing the efficacy of therapeutics and vaccines against any disease, such as KHSV, is 

greatly facilitated by the availability of an animal model, such as a non-human primate model, 
because non-human primates are physiologically very similar to humans. Although such models 
have been developed for the study of HIV infection (for example, U.S. Patent Nos. 5,212,084 and 
5,543,131) none has yet been developed for KSHV infection. 

35 Infection of animals with some herpesviruses, namely Herpesvirus saimiri and murine 

herpesvirus type 68, can cause a lymphoproliferative disorder (LPD). However, these animals are 
not adequate models of KSHV pathogenesis because they lack certain KSHV genes that may 



WO 00/28040 



PCT/US99/26260 



- 2 - 

contribute to viral pathogenesis or influence HIV infection, such as Interleukin 6 (IL-6) and 
macrophage inflammatory protein 1 (MIP-1) (Albrecht et al., 1992, J. ViroL 66:5047-58; Virgin 
et al., 199^&7. ViroL 71:5894-904). Thus, so far the establishment of a non-human primate 
model for KSHV infection has remained elusive. 
5 The present invention addresses this problem, and others, in the development of animal 

models for a variety of pathological conditions and diseases. 



SUMMARY OF THE DISCLOSURE 



10 Rhesus macaques naturally harbor a virus related to KSHV, referred to as RRV, for 

rhesus rhadinovirus. Genetic analysis of RRV reveals the presence of an IL-6-like gene in a 
position analogous to that of the KSHV IL-6. The present disclosure also includes information 
about pathological conditions associated with RRV infection. 

The present invention provides the genomic sequence (nucleotide and amino acid) for the 

15 RRV genome and its use for developing a non-human primate model for KSHV infection. 

The invention includes the genome of the newly isolated Rhesus macaque rhadino virus, RRV 
isolate 17577 (referred to herein as RRV), but the invention includes variant RRV viruses and 
related viruses that infect other species. RRV shows some similarity to human Kaposi's sarcoma- 
associated herpes virus (KSHV, also called HHV8) and possesses genes for both IL-6 and MIP. 

20 The invention encompasses the isolated polynucleotide genome of RRV as shown in SEQ 

ID NO 1, and the identified ORFs (open reading frames) of this genome (even-numbered SEQ ID 
NOS 2-164). Also included within the invention are oligonucleotides comprising at least 15, 20, 
30, 40, 50, 70, 100 and 150 consecutive nucleotides of the genome sequence as shown in SEQ ID 
NO 1. Additionally, the invention encompasses various segments of the RRV genome as shown in 

25 SEQ ID NO 1, for instance, segments consisting of 999 nucleotides, for example, from nucleotide 
1-999, 1000-1999, 2000-2999, 3000-3999, 4000-4999 and so on until the end of the nucleotide 
sequence. Proteins and parts of proteins encoded within such segments are also covered by the 
invention. 

The invention also includes purified proteins encoded by the RRV genome, the amino acid 
30 sequences of which are shown in odd-numbered SEQ ID NOS 3-165. Proteins that have defined 
degrees of sequence identity with the proteins of SEQ ID NO 1 are also within the scope of the 
invention. Such proteins may display, for example, at least 50%, 55%, 60%, 70%, 80%, 90%, 
95% or even 98% or greater amino acid sequence identity with the native proteins. 

The invention further includes nucleic acids encoding the RRV proteins as well as 
35 recombinant nucleic acids that include a promoter operably linked to a nucleic acid that encodes an 
RRV protein. 
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Additionally included are isolated nucleic acid molecules of various defined lengths that 
show at least 50%, 60%, 70%, 80%, 90%, 95%, 98% or 100% sequence identity with an RRV 
ORF sequence, such as the sequence shown in SEQ ID NO 1, or in one of the other sequence 
listings. The invention also includes isolated nucleic acid molecules of various defined lengths that 
5 hybridize with an ORF as shown in SEQ ID NO 1 under wash hybridization conditions of about 
70°C and 0.2 x SSC for 1 hour, or alternatively under less stringent conditions of 65°C, 60°C, or 
55 °C with about 0.2 to 2 x SSC (with, for instance, about 0.1% SDS) for 1 hour. 

Also within the invention are cells and virions that contain the nucleic acid molecules as 
described above. 

10 Additionally the scope of the invention includes the nucleic acid sequences defined by 

nucleotides 1 to 11031 of SEQ ID NO 1 and nucleotides 21625 to 133719 of SEQ ID NO 1, and 
ORFs selected from these nucleic acid sequences. The invention also includes isolated nucleic acid 
molecules of various defined lengths that show at least 50%, 60%, 70%, 80%, 90%, 95% or 98% 
sequence identity with, an ORF contained within nucleotides 1-11031 or 21625-133719 of the 

15 nucleotide sequence as shown in SEQ ID NO 1. Alternatively, the invention includes at least 15, 
20, 30, 40, 50, 70, 100 or 150 consecutive nucleotides within nucleotides 1 to 11031 of SEQ ID 
NO 1 and nucleotides 21625 to 133719 of SEQ ID NO 1, or within ORFs selected from these 
nucleic acid sequences. 

Also included are isolated nucleic acid molecules of various lengths that hybridize under 

20 wash conditions of 70°C and about 0.2 x SSC for 1 hour, or alternatively under less stringent 
conditions of 65°C, 60°C, or 55°C with from about 0.2 to 2 x SSC (with, for instance, about 
0.1% SDS) for 1 hour, with an ORF of nucleotides 1-11031 or 21625-133719 of the nucleotide 
sequence as shown in SEQ ID NO 1 . 

Recombinant molecules are also encompassed within the bounds of the invention, and 

25 include, for instance, a nucleic acid molecule encoding an RRV protein (or fragments or variants 
thereof) linked to a non-native nucleic acid sequence such as a promoter. The nucleic acid 
molecule linked to the promoter may be all or part of an ORF encoding an RRV protein, such as 
any ORF of SEQ ID NO 1 , may be one or more fragments of a DNA sequence selected from the 
DNA sequence defined by nucleotides 1 to 11031 and nucleotides 21625 to 133719 as shown in 

30 SEQ ID NO 1, or DNA sequences encoding variants or fragments of proteins encoded by those 
sequences. 

The present invention also relates to the isolation of a virus (RRV) from a rhesus macaque 
monkey which, when experimentally introduced into immuno-compromised macaques, produces 
pathological conditions, such as disease signs and symptoms, that parallel those seen in human 
35 subjects infected with KSHV, including lymphoproliferative disorders (LPD), lymphadenopathy, 
splenomegaly, B cell hyperplasia, autoimmune hemolytic anemia, retroperitoneal fibromatosis (a 
Kaposi's sarcoma-like mesenchymal proliferative disease of body cavities), and 
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hypergammaglobulinemia, wherein the virus encodes homologues of IL-6 and MIP-1 which are 
similar to KSHV. 

One aspect of the present invention is the isolated virus, RRV, and related species and 
other variants thereof. In another aspect of the invention, the virus is used to produce a non- 
5 human primate model for KSHV infection, or diseases associated with RRV infection; such a 
model may be produced, for example, by infecting a non-human primate (such as an 
immunocompromised non-human primate) with RRV. This model may thus be used to evaluate 
the efficacy of candidate therapeutics and vaccines for KSHV infection treatment and prophylaxis, 
or other pathological conditions associated with RRV infection. Although it is not required that 
10 the primate be first immuno-compromised and then infected with RRV, particular embodiments of 
the animal model include both infecting the primate with the virus and rendering it immuno- 
compromised (or equivalently obtaining an already immunocompromised primate). 

In another embodiment, the invention provides a method for testing the efficacy of a drug 
in the treatment of Kaposi's sarcoma and lymphoproliferative disorders or other pathological 
15 conditions associated with RRV infection, by administering the drug to an immuno-compromised 
non-human primate infected with RRV, and then observing the primate to determine if the drug 
prevents or reduces the presentation of one or more signs, symptoms, laboratory abnormalities, or 
other pathological conditions associated with infection with the virus. Such conditions include B- 
cell hyperplasia, lymphadenopathy, splenomegaly, hypergammaglobinulinemia, retroperitoneal 
20 fibromatosis (a Kaposi's sarcoma-like mesenchymal proliferative disease of body cavities), and 
autoimmune hemolytic anemia. The efficacy of a vaccine to prevent KSHV infection, or 
pathological conditions associated with RRV infection, may similarly be assessed by administering 
the candidate vaccine to the animal and then attempting to infect the animal with RRV. In 
particular embodiments, the animal to which the candidate vaccine is administered may be an 
25 immunocompromised animal. Failure to infect the animal, when control animals not given the 
candidate vaccine do become infected, indicates that the vaccine conferred protection. 

The foregoing and other objects, features, and advantages of the invention will become 
more apparent from the following detailed description of several examples which proceeds with 
reference to the accompanying figures. 

30 

BRIEF DESCRIPTION OF THE FIGURES 
FIG. 1 shows a phylogenetic comparison of the gammaherpesviruses Epstein-Barr virus 
(EBV), Alcelaphine herpesvirus (AHV), Murine herpesvirus (MHV), Herpesvirus saimiri (HVS), 
Kaposi's sarcoma-associated herpesvirus (KSHV), and Rheusus rhadinovirus 17577 (RRV). It 
35 shows that among the known sequenced viruses, RRV is the closest relative to KSHV, using an 
accepted maximum parsimony method of evaluating evolutionary relationships. 

FIG. 2 is a table showing the BamHI, EcoRl and Hindlll restriction fragments of the RRV 
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genome. 

FIG. 3 is a schematic map of the ORFs of RRV. Arrow direction represents direction of 
transcription. 

FIG. 4 is a table showing the size, location and description (similarity to other proteins) of 
5 the proteins encoded by the ORFs of RRV. 

FIG. 5 is a table showing a comparison of corresponding repeats in RRV and KSHV. 
FIG. 6 is a table showing the comparison of interferon regulatory elements encoded by 
RRV and KSHV. 

FIG. 7 is a table comparing the ORFs of RRV, KSHV and HVS. The table shows the 
10 start and stop points, the strand (+ or -) from which the ORF is transcribed, the size of the ORFs 
and the percentage similarity of KSHV and HVS when compared with RRV. 

FIGS. 8A-8D are graphs showing CD20+ lymphocytes, antibody response and RhKSHV 
isolation/detection in macaques infected with SIVmac239 and RRV (A) 18483 and (B) 18570 and 
macaques infected with SIVmac239 only (C) 18503 and (D) 18540. A " + w indicates positive for 
15 virus culture or viral DNA, as defined by PCR and Southern blot analysis; negative for virus 
culture or viral DN A. 

FIG. 9 shows the result of the PCR analysis of PBLs and LNMCs from each of the 
macaques (18483, 18503, 18540 and 18570) for RRV DNA and P-globin in (A) graphical form 
and (B) digital form. 

20 FIG. 10 shows the DNA sequence of the RRV ORF that encodes the IL-6 protein. The 

corresponding translated polypeptide sequence is shown in standard three letter code below the 
DNA sequence. 

FIG. 1 1 shows the DNA sequence of the RRV ORF that encodes the MIP protein. The 
corresponding translated polypeptide sequence is shown in standard three letter code below the 
25 DNA sequence. 



SEQUENCE LISTING 
The nucleic and amino acid sequences listed in the accompanying sequence listing are 
shown using standard letter abbreviations for nucleotide bases, and the code for amino acids. Only 
30 one strand of each nucleic acid sequence is shown, but the complementary strand is understood as 
included by any reference to the displayed strand. 

SEQ ID NO 1 shows the nucleotide sequence of the RRV genome. 
SEQ ID NO 2 shows the cDNA nucleotide sequence of RRV Rl, corresponding to 
nucleotides 1353-2624 of SEQ ID NO 1. 
35 SEQ ID NO 3 shows the amino acid sequence of the RRV Rl protein. 

SEQ ID NO 4 shows the cDNA nucleotide sequence of RRV ORF 2, corresponding to the 
complement of nucleotides 2692-3258 of SEQ ID NO 1 , which encodes dihydrofolate reductase, 
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and which has some similarity to Kaposi's sarcoma-associated herpesvirus (KSHV) ORF 2. 

SEQ ID NO 5 shows the amino acid sequence of the ORF2 protein, dihydrofolate 
reductase protein, which has some similarity to KSHV ORF 2 protein. 

SEQ ID NO 6 shows the cDNA nucleotide sequence of RRV ORF 4, corresponding to 
5 nucleotides 3676-5613 of SEQ ID NO 1, which encodes complement binding protein, and which 
has some similarity to KSHV ORF 4. 

SEQ ID NO 7 shows the amino acid sequence of the RRV ORF 4 protein, complement 
binding protein, corresponding to nucleotides 6045-9443 of SEQ ID NO 1 , and which has some 
similarity to KSHV ORF 4 protein, 
10 SEQ ID NO 8 shows the cDNA nucleotide sequence of RRV ORF 6, corresponding to 

nucleotides 6045-9443 of SEQ ID NO 1, which encodes ssDNA binding protein, and which has 
some similarity to KSHV ORF 6. 

SEQ ID NO 9 shows the amino acid sequence of the RRV ORF 6 protein, ssDNA binding 
protein, which has some similarity to KSHV ORF 6 protein. 
15 SEQ ID NO 10 shows the cDNA nucleotide sequence of RRV ORF 7, corresponding to 

nucleotides 9468-11528 of SEQ ID NO 1, which encodes a transport protein, and which has some 
similarity to KSHV ORF 7, 

SEQ ID NO 1 1 shows the amino acid sequence of the RRV ORF 7 protein, transport 
protein, which has some similarity to KSHV ORF 7 protein. 
20 SEQ ID NO 12 shows the cDNA nucleotide sequence of RRV ORF 8, corresponding to 

nucleotides 11515-14004 of SEQ ID NO 1, which encodes glycoprotein B, and which has some 
similarity to KSHV ORF 8. 

SEQ ID NO 13 shows the amino acid sequence of the RRV ORF 8 protein, glycoprotein 
B protein, which has some similarity to KSHV ORF 8 protein. 
25 SEQ ID NO 14 shows the cDNA nucleotide sequence of RRV ORF 9, DNA polymerase 

protein, corresponding to nucleotides 14122-17166 of SEQ ID NO 1, which has some similarity to 
KSHV ORF 9. 

SEQ ID NO 15 shows the amino acid sequence of the RRV ORF 9 protein, DNA 
polymerase protein, which has some similarity to KSHV ORF 9. 
30 SEQ ID NO 16 shows the cDNA nucleotide sequence of RRV ORF 10, corresponding to 

nucleotides 17261-18511 of SEQ ID NO 1, which has some similarity to KSHV ORF 10. 

SEQ ID NO 17 shows the amino acid sequence of the RRV ORF 10 protein, which has 
some similarity to KSHV ORF 10. 

SEQ ID NO 18 shows the cDNA nucleotide sequence of RRV ORF 11, corresponding to 
35 nucleotides 18520-19749 of SEQ ID NO 1, which has some similarity to KSHV ORF 11. 

SEQ ID NO 19 shows the amino acid sequence of the RRV ORF 1 1 protein, which has 
some similarity to KSHV ORF 1 1 . 
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SEQ ID NO 20 shows the cDNA nucleotide sequence of RRV R2, corresponding to the 
complement of nucleotides 19921-20544 of SEQ ID NO 1, which has some similarity to the 
Kaposi's sarcoma-associated IL-6 gene. 

SEQ ID NO 21 shows the amino acid sequence of the RRV R2 protein which has some 

5 similarity to IL-6. 

SEQ ID NO 22 shows the cDNA nucleotide sequence of RRV ORF 70, thymidylate 
synthase, corresponding to the complement of nucleotides 20777-21778 of SEQ ID NO 1, and 
which has some similarity to KSHV ORF 70. 

SEQ ID NO 23 shows the amino acid sequence of the RRV ORF 70 protein, thymidylate 
10 synthase, which has some similarity to KSHV ORF 70 protein. 

SEQ ID NO 24 shows the cDNA nucleotide sequence of RRV R3, corresponding to the 
complement of nucleotides 22245-22592 of SEQ ID NO 1 , which has some similarity to the KSHV 
K4 viral MIP gene. 

SEQ ID NO 25 shows the amino acid sequence of the RRV R3 protein, which has some 
15 similarity to KSHV K4 viral MIP protein. 

SEQ ID NO 26 shows the cDNA nucleotide sequence of RRV ORF 16, a Bcl-2 homolog, 
corresponding to nucleotides 26846-27409 of SEQ ID NO 1, which has some similarity to KSHV 
ORF 16. 

SEQ ID NO 27 shows the amino acid sequence of the RRV ORF 16 protein, a Bcl-2 
20 protein homolog, which has some similarity to KSHV ORF 16 protein. 

SEQ ID NO 28 shows the cDNA nucleotide sequence of RRV ORF 17, corresponding to 
the complement of nucleotides 27515-29125 of SEQ ID NO 1, encoding a capsid protein, which 
has some similarity to KSHV ORF 17. 

SEQ ID NO 29 shows the amino acid sequence of the RRV ORF 17 protein, a capsid 
25 protein, which has some similarity to KSHV ORF 17 protein. 

SEQ ID NO 30 shows the cDNA nucleotide sequence of RRV ORF 18, corresponding to 
nucleotides 28998-29897 of SEQ ID NO 1, which has some similarity to KSHV ORF 18. 

SEQ ID NO 31 shows the amino acid sequence of the RRV ORF 18 protein, which has 
some similarity to KSHV ORF 18 protein. 
30 SEQ ID NO 32 shows the cDNA nucleotide sequence of RRV ORF 19, corresponding to 

the complement of nucleotides 29905-31548 of SEQ ID NO 1, which encodes a tegument protein, 
and which has some similarity to KSHV ORF 19. 

SEQ ID NO 33 shows the amino acid sequence of the RRV ORF 19 protein, a tegument 
protein, which has some similarity to KSHV ORF 19 protein. 
35 SEQ ID NO 34 shows the cDNA nucleotide sequence of RRV ORF 20, corresponding to 

the complement of nucleotides 31043-32095 of SEQ ID NO 1, and which has some similarity to 
KSHV ORF 20. 
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SEQ ID NO 35 shows the amino acid sequence of the RRV ORF 19 protein, which has 
some similarity to KSHV ORF 19 protein. 

SEQ ID NO 36 shows the cDNA nucleotide sequence of RRV ORF 21 , corresponding to 
nucleotides 32094-33767 of SEQ ID NO 1 , which encodes a thymidine kinase protein, and which 
5 has some similarity to KSHV ORF 21 . 

SEQ ID NO 37 shows the amino acid sequence of the RRV ORF 21 protein, a thymidine 
kinase protein, which has some similarity to KSHV ORF 21 protein. 

SEQ ID NO 38 shows the cDNA nucleotide sequence of RRV ORF 22, corresponding to 
nucleotides 33754-35868 of SEQ ID NO 1, and which encodes a glycoprotein H protein, and 
10 which has some similarity to KSHV ORF 22. 

SEQ ID NO 39 shows the amino acid sequence of the RRV ORF 22 protein, a 
glycoprotein H protein, which has some similarity to KSHV ORF 22 protein. 

SEQ ID NO 40 shows the cDNA nucleotide sequence of RRV ORF 23, corresponding to 
the complement of nucleotides 35865-37073 of SEQ ID NO 1, which has some similarity to KSHV 
15 ORF 23. 

SEQ ID NO 41 shows the amino acid sequence of the RRV ORF 23 protein, which has 
some similarity to KSHV ORF 23 protein. 

SEQ ID NO 42 shows the cDNA nucleotide sequence of RRV ORF 24, corresponding to 
the complement of nucleotides 37123-39321 of SEQ ID NO 1, and which has some similarity to 
20 KSHV ORF 24. 

SEQ ID NO 43 shows the amino acid sequence of the RRV ORF 24 protein, which has 
some similarity to KSHV ORF 24 protein. 

SEQ ID NO 44 shows the cDNA nucleotide sequence of RRV ORF 25, corresponding to 
nucleotides 39323-43459 of SEQ ID NO 1, which encodes a major capsid protein, and which has 
25 some similarity to KSHV ORF 25. 

SEQ ID NO 45 shows the amino acid sequence of the RRV ORF 25 protein, a major 
capsid protein, which has some similarity to KSHV ORF 25 protein. 

SEQ ID NO 46 shows the cDNA nucleotide sequence of RRV ORF 26, corresponding to 
nucleotides 43491-44408 of SEQ ID NO 1, which encodes a capsid protein, and which has some 
30 similarity to KSHV ORF 26. 

SEQ ID NO 47 shows the amino acid sequence of the RRV ORF 26 protein, a capsid 
protein, which has some similarity to KSHV ORF 26 protein. 

SEQ ID NO 48 shows the cDNA nucleotide sequence of RRV ORF 27, corresponding to 
nucleotides 44433-45242 of SEQ ID NO 1, and which has some similarity to KSHV ORF 27. 
35 SEQ ID NO 49 shows the amino acid sequence of the RRV ORF 27 protein, which has 

some similarity to KSHV ORF 27 protein. 

SEQ ID NO 50 shows the cDNA nucleotide sequence of RRV ORF 28, corresponding to 
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nucleotides 45408-45683 of SEQ ID NO 1, and which has some similarity to KSHV ORF 28. 

SEQ ID NO 51 shows the amino acid sequence of the RRV ORF 28 protein, which has 
some simil^ty to KSHV ORF 28 protein. 

SEQ ID NO 52 shows the cDNA nucleotide sequence of RRV ORF 29b, corresponding to 
5 the complement of nucleotides 45733-46779 of SEQ ID NO 1, and which has some similarity to 
KSHV ORF 29b. 

SEQ ID NO 53 shows the amino acid sequence of the RRV ORF 29b, which has some 
similarity to KSHV ORF 29b protein. 

SEQ ID NO 54 shows the cDNA nucleotide sequence of RRV ORF 30, corresponding to 
10 nucleotides 46905-47135 of SEQ ID NO 1, and which has some similarity to KSHV ORF 30. 

SEQ ID NO 55 shows the amino acid sequence of the RRV ORF 30 protein, which has 
some similarity to KSHV ORF 30 protein. 

SEQ ID NO 56 shows the cDNA nucleotide sequence of RRV ORF 31, corresponding to 
nucleotides 47093-47746 of SEQ ID NO 1, and which has some similarity to KSHV ORF 31. 
15 SEQ ID NO 57 shows the amino acid sequence of the RRV ORF 31, protein which has 

some similarity to KSHV ORF 31 protein. 

SEQ ID NO 58 shows the cDNA nucleotide sequence of RRV ORF 32, corresponding to 
nucleotides 47683-49077 of SEQ ID NO 1, and which has some similarity to KSHV ORF 32. 

SEQ ID NO 59 shows the amino acid sequence of the RRV ORF 32 protein, which has 
20 some similarity to KSHV ORF 32 protein. 

SEQ ID NO 60 shows the cDNA nucleotide sequence of RRV ORF 33, corresponding to 
nucleotides 49049-50059 of SEQ ID NO 1, and which has some similarity to KSHV ORF 33. 

SEQ ID NO 61 shows the amino acid sequence of the RRV ORF 33 protein, which has 
some similarity to KSHV ORF 33 protein. 
25 SEQ ID NO 62 shows the cDNA nucleotide sequence of RRV ORF 29a, corresponding to 

the complement of nucleotides 49977-50960 of SEQ ID NO 1, and which has some similarity to 
KSHV ORF 29a. 

SEQ ID NO 63 shows the amino acid sequence of the RRV ORF 29a protein, which has 
some similarity to KSHV ORF 29a protein. 
30 SEQ ID NO 64 shows the cDNA nucleotide sequence of RRV ORF 34, corresponding to 

nucleotides 50959-51942 of SEQ ID NO 1, and which has some similarity to KSHV ORF 34. 

SEQ ID NO 65 shows the amino acid sequence of the RRV ORF 34 protein, which has 
some similarity to KSHV ORF 34 protein. 

SEQ ID NO 66 shows the cDNA nucleotide sequence of RRV ORF 35, corresponding to 
35 nucleotides 51923-52372 of SEQ ID NO 1, and which has some similarity to KSHV ORF 35. 

SEQ ID NO 67 shows the amino acid sequence of the RRV ORF 35 protein, which has 
some similarity to KSHV ORF 35 protein. 
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SEQ ID NO 68 shows the cDNA nucleotide sequence of RRV ORF 36, corresponding to 
nucleotides 52278-53585 of SEQ ID NO 1 , which encodes a kinase, and which has some similarity 
to KSHV ORF 36. 

SEQ ID NO 69 shows the amino acid sequence of the RRV ORF 36 protein, a kinase, 
5 which has some similarity to KSHV ORF 36 protein. 

SEQ ID NO 70 shows the cDNA nucleotide sequence of RRV ORF 37, corresponding to 
nucleotides 53566-55008 of SEQ ID NO 1, which encodes an alkaline exonuclease, and which has 
some similarity to KSHV ORF 37. 

SEQ ID NO 71 shows the amino acid sequence of the RRV ORF 37 protein, an alkaline 
10 exonuclease protein, which has some similarity to KSHV ORF 37 protein. 

SEQ ID NO 72 shows the cDNA nucleotide sequence of RRV ORF 38, corresponding to 
nucleotides 54963-55172 of SEQ ID NO 1, and which has some similarity to KSHV ORF 38. 

SEQ ID NO 73 shows the amino acid sequence of the RRV ORF 38 protein, which has 
some similarity to KSHV ORF 38 protein. 
15 SEQ ID NO 74 shows the cDNA nucleotide sequence of RRV ORF 39, corresponding to 

the complement of nucleotides 55255-56391 of SEQ ID NO 1, which encodes glycoprotein M, and 
which has some similarity to KSHV ORF 39. 

SEQ ID NO 75 shows the amino acid sequence of the RRV ORF 39 protein, glycoprotein 
M, which has some similarity to KSHV ORF 39 protein. 
20 SEQ ID NO 76 shows the cDNA nucleotide sequence of RRV ORF 40, corresponding to 

nucleotides 56526-57932 of SEQ ID NO 1, which encodes helicase/primase, and which has some 
similarity to KSHV ORF 40. 

SEQ ID NO 77 shows the amino acid sequence of the RRV ORF 40 protein, 
helicase/primase, which has some similarity to KSHV ORF 40 protein. 
25 SEQ ID NO 78 shows the cDNA nucleotide sequence of RRV ORF 41, corresponding to 

nucleotides 57917-58528 of SEQ ID NO 1, which encodes helicase/primase, and which has some 
similarity to KSHV ORF 41. 

SEQ ID NO 79 shows the amino acid sequence of the RRV ORF 41 protein, 
helicase/primase, which has some similarity to KSHV ORF 41 protein. 
30 SEQ ID NO 80 shows the cDNA nucleotide sequence of RRV ORF 42, corresponding to 

the complement of nucleotides 58525-59343 of SEQ ID NO 1, which has some similarity to KSHV 
ORF 42. 

SEQ ID NO 81 shows the amino acid sequence of the RRV ORF 42 protein, which has 
some similarity to KSHV ORF 42 protein. 
35 SEQ ID NO 82 shows the cDNA nucleotide sequence of RRV ORF 43, corresponding to 

the complement of nucleotides 59297-61027 of SEQ ID NO 1 , which encodes a capsid protein, 
and which has some similarity to KSHV ORF 43. 
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SEQ ID NO 83 shows the amino acid sequence of the RRV ORF 43 protein, a capsid 
protein, which has some similarity to KSHV ORF 43 protein. 

SEQ ID NO 84 shows the cDNA nucleotide sequence of RRV ORF 44, corresponding to 
nucleotides 60966-63338 of SEQ ID NO 1 , which encodes helicase/primase, and which has some 
5 similarity to KSHV ORF 44. 

SEQ ID NO 85 shows the amino acid sequence of the RRV ORF 44 protein, 
helicase/primase, which has some similarity to KSHV ORF 44 protein. 

SEQ ID NO 86 shows the cDNA nucleotide sequence of RRV ORF 45, corresponding to 
the complement of nucleotides 63379-64437 of SEQ ID NO 1 , and which has some similarity to 

10 KSHV ORF 45. 

SEQ ID NO 87 shows the amino acid sequence of the RRV ORF 45 protein, which has 
some similarity to KSHV ORF 45 protein. 

SEQ ID NO 88 shows the cDNA nucleotide sequence of RRV ORF 46, corresponding to 
the complement of nucleotides 64479-65246 of SEQ ID NO 1 , which encodes uracil DNA 
15 glucosidase, and which has some similarity to KSHV ORF 46. 

SEQ ID NO 89 shows the amino acid sequence of the RRV ORF 46 protein, uracil DNA 
glucosidase protein, which has some similarity to KSHV ORF 46 protein. 

SEQ ID NO 90 shows the cDNA nucleotide sequence of RRV ORF 47, corresponding to 
the complement of nucleotides 65222-65731 of SEQ ID NO 1, which encodes glycoprotein L, 
20 which has some similarity to KSHV ORF 47. 

SEQ ID NO 91 shows the amino acid sequence of the RRV ORF 47 protein, glycoprotein 
L, which has some similarity to KSHV ORF 47 protein. 

SEQ ID NO 92 shows the cDNA nucleotide sequence of RRV ORF 48, corresponding to 
the complement of nucleotides 65999-67168 of SEQ ID NO 1, and which has some similarity to 
25 KSHV ORF 48. 

SEQ ID NO 93 shows the amino acid sequence of the RRV ORF 48 protein, which has 
some similarity to KSHV ORF 48 protein. 

SEQ ID NO 94 shows the cDNA nucleotide sequence of RRV ORF 49, corresponding to 
the complement of nucleotides 67398-68303 of SEQ ID NO 1, and which has some similarity to 

30 KSHV ORF 49. 

SEQ ID NO 95 shows the amino acid sequence of the RRV ORF 49 protein, which has 
some similarity to KSHV ORF 49 protein. 

SEQ ID NO 96 shows the cDNA nucleotide sequence of RRV ORF 50, corresponding to 
nucleotides 68494-70038 of SEQ ID NO 1, which encodes a transactivator, and which has some 
35 similarity to KSHV ORF 50. 

SEQ ID NO 97 shows the amino acid sequence of the RRV ORF 50 protein, a 
transactivator protein, which has some similarity to KSHV ORF 50 protein. 
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SEQ ID NO 98 shows the cDNA nucleotide sequence of RRV R4, corresponding to 
nucleotides 70355-70888 of SEQ ID NO 1. 

SEQ ID NO 99 shows the amino acid sequence of the RRV R4 protein. 

SEQ ID NO 100 shows the cDNA nucleotide sequence of RRV R5, corresponding to 
5 nucleotides 71468-72160 of SEQ ID NO 1. 

SEQ ID NO 101 shows the amino acid sequence of the RRV R5 protein. 

SEQ ID NO 102 shows the cDNA nucleotide sequence of RRV ORF 52, corresponding to 
the complement of nucleotides 72401-72820 of SEQ ID NO 1, and which has some similarity to 
KSHV ORF 52. 

10 SEQ ID NO 103 shows the amino acid sequence of the RRV ORF 52 protein, which has 

some similarity to KSHV ORF 52 protein. 

SEQ ID NO 104 shows the cDNA nucleotide sequence of RRV ORF 53, corresponding to 
the complement of nucleotides 72884-73198 of SEQ ID NO 1, and which has some similarity to 
KSHV ORF 53. 

15 SEQ ID NO 105 shows the amino acid sequence of the RRV ORF 53 protein, which has 

some similarity to KSHV ORF 53 protein. 

SEQ ID NO 106 shows the cDNA nucleotide sequence of RRV ORF 54, corresponding to 

nucleotides 73274-74146 of SEQ ID NO 1, which encodes a dUTPase, and which has some 

similarity to KSHV ORF 54. 
20 SEQ ID NO 107 shows the amino acid sequence of the RRV ORF 54 protein, a dUTPase 

protein, which has some similarity to KSHV ORF 54 protein. 

SEQ ID NO 108 shows the cDNA nucleotide sequence of RRV ORF 55, corresponding to 

the complement of nucleotides 74207-74839 of SEQ ID NO 1 , and which has some similarity to 

KSHV ORF 55. 

25 SEQ ID NO 109 shows the amino acid sequence of the RRV ORF 55 protein, which has 

some similarity to KSHV ORF 55 protein. 

SEQ ID NO 1 10 shows the cDNA nucleotide sequence of RRV ORF 56, corresponding to 
nucleotides 74851-77337 of SEQ ID NO 1, which encodes a DNA replication protein, and which 
has some similarity to KSHV ORF 56. 
30 SEQ ID NO 111 shows the amino acid sequence of the RRV ORF 56 protein, a DNA 

replication protein, which has some similarity to KSHV ORF 56 protein. 

SEQ ID NO 112 shows the cDNA nucleotide sequence of RRV ORF 57, corresponding to 
nucleotides 77578-78906 of SEQ ID NO 1, which encodes an immediate-early gene product, and 
which has some similarity to KSHV ORF 57. 
35 SEQ ID NO 113 shows the amino acid sequence of the RRV ORF 57 protein, a 

immediate-early gene product protein, which has some similarity to KSHV ORF 57 protein. 

SEQ ID NO 114 shows the cDNA nucleotide sequence of RRV R6, corresponding to the 
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complement of nucleotides 79266-80513 of SEQ ID NO 1 , which has some similarity to KSHV 
vIRF K9 gene. 

SEQ ID NO 115 shows the amino acid sequence of the RRV R6 protein, which has some 
similarity to KSHV vIRF K9 protein. 
5 SEQ ID NO 1 16 shows the cDNA nucleotide sequence of RRV R7, corresponding to the 

complement of nucleotides 80686-81933 of SEQ ID NO 1, which has some similarity to KSHV 
vIRF K9 gene. 

SEQ ID NO 1 17 shows the amino acid sequence of the RRV R7 protein, which has some 
similarity to KSHV vIRF K9 protein. 
10 SEQ ID NO 118 shows the cDNA nucleotide sequence of RRV R8, corresponding to the 

complement of nucleotides 82262-83317 of SEQ ID NO 1, which has some similarity to KSHV 
vIRF K9 gene. 

SEQ ID NO 1 19 shows the amino acid sequence of the RRV R8 protein, which has some 
similarity to KSHV vIRF K9 protein. 
15 SEQ ID NO 120 shows the cDNA nucleotide sequence of RRV R9, corresponding to the 

complement of nucleotides 83491-84252 of SEQ ID NO 1 , which has some similarity to KSHV 
vIRF K9 gene. 

SEQ ID NO 121 shows the amino acid sequence of the RRV R9 protein, which has some 
similarity to KSHV vIRF K9 protein. 
20 SEQ ID NO 122 shows the cDNA nucleotide sequence of RRV R10, corresponding to the 

complement of nucleotides 85052-86209 of SEQ ID NO 1, which has some similarity to KSHV 
vIRF K9 gene. 

SEQ ID NO 123 shows the amino acid sequence of the RRV R10 protein, which has some 
similarity to KSHV vIRF K9 protein. 
25 SEQ ID NO 124 shows the cDNA nucleotide sequence of RRV Rl 1 , corresponding to the 

complement of nucleotides 86355-87527 of SEQ ID NO 1 .which has some similarity to KSHV 
vIRF K9 gene. 

SEQ ID NO 125 shows the amino acid sequence of the RRV Rl 1 protein, which has some 
similarity to KSHV vIRF K9 protein. 
30 SEQ ID NO 126 shows the cDNA nucleotide sequence of RRV R12, corresponding to the 

complement of nucleotides 87894-88961 of SEQ ID NO 1, which has some similarity to KSHV 
vIRF K9 gene. 

SEQ ID NO 127 shows the amino acid sequence of the RRV R12 protein, which has some 
similarity to KSHV vIRF K9 protein. 
35 SEQ ID NO 128 shows the cDNA nucleotide sequence of RRV R13, corresponding to the 

complement of nucleotides 89122-90216 of SEQ ID NO 1, which has some similarity to KSHV 
vIRF K9 gene. 
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SEQ ID NO 129 shows the amino acid sequence of the RRV R13 protein, which has some 
similarity to KSHV vIRF K9 protein. 

SEQ ID NO 130 shows the cDNA nucleotide sequence of RRV ORF 58, corresponding to 
the complement of nucleotides 90462-91544 of SEQ ID NO 1, which has some similarity to KSHV 
5 ORF 58. 

SEQ ID NO 131 shows the amino acid sequence of the RRV ORF 58 protein, which has 
some similarity to KSHV ORF 58 protein. 

SEQ ID NO 132 shows the cDNA nucleotide sequence of RRV ORF 59, corresponding to 
the complement of nucleotides 91555-92739 of SEQ ID NO 1, which encodes a DNA replication 
10 protein, and which has some similarity to KSHV ORF 59. 

SEQ ID NO 133 shows the amino acid sequence of the RRV ORF 59 protein, a DNA 
replication protein, which has some similarity to KSHV ORF 59 protein. 

SEQ ID NO 134 shows the cDNA nucleotide sequence of RRV ORF 60, corresponding to 
the complement of nucleotides 92868-93812 of SEQ ID NO 1, which encodes a small 
15 ribonucleotide reductase protein, and which has some similarity to KSHV ORF 60. 

SEQ ID NO 135 shows the amino acid sequence of the RRV ORF 60 protein, a small 
ribonucleotide reductase protein, which has some similarity to KSHV ORF 60 protein. 

SEQ ID NO 136 shows the cDNA nucleotide sequence of RRV ORF 61, corresponding to 
the complement of nucleotides 93794-96160 of SEQ ID NO 1, which encodes a large 
20 ribonucleotide reductase protein, and which has some similarity to KSHV ORF 61. 

SEQ ID NO 137 shows the amino acid sequence of the RRV ORF 61 protein, a large 
ribonucleotide reductase protein, which has some similarity to KSHV ORF 61 protein. 

SEQ ID NO 138 shows the cDNA nucleotide sequence of RRV ORF 62, corresponding to 
the complement of nucleotides 96163-97158 of SEQ ID NO 1, which encodes a assembly /DNA 
25 maturation protein, and which has some similarity to KSHV ORF 62. 

SEQ ID NO 139 shows the amino acid sequence of the RRV ORF 62 protein, a 
assembly /DNA maturation protein, which has some similarity to KSHV ORF 62 protein. 

SEQ ID NO 140 shows the cDNA nucleotide sequence of RRV ORF 63, corresponding to 
nucleotides 97157-99976 of SEQ ID NO 1, which encodes a tegument protein, and which has 
30 some similarity to KSHV ORF 63. 

SEQ ID NO 141 shows the amino acid sequence of the RRV ORF 63 protein, a tegument 
protein, which has some similarity to KSHV ORF 63 protein. 

SEQ ID NO 142 shows the cDNA nucleotide sequence of RRV ORF 64, corresponding to 
nucleotides 99980-107626 of SEQ ID NO 1, which encodes a tegument protein, and which has 
35 some similarity to KSHV ORF 64. 

SEQ ID NO 143 shows the amino acid sequence of the RRV ORF 64 protein, a tegument 
protein, which has some similarity to KSHV ORF 64 protein. 
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SEQ ID NO 144 shows the cDNA nucleotide sequence of RRV ORF 65, corresponding to 
the complement of nucleotides 107637-108146 of SEQ ID NO 1, which encodes a capsid protein, 
and which has some similarity to KSHV ORF 65. 

SEQ ID NO 145 shows the amino acid sequence of the RRV ORF 65 protein, a capsid 
5 protein, which has some similarity to KSHV ORF 65 protein. 

SEQ ID NO 146 shows the cDNA nucleotide sequence of RRV ORF 66, corresponding to 
the complement of nucleotides 108152-109498 of SEQ ID NO 1, which has some similarity to 
KSHV ORF 66. 

SEQ ID NO 147 shows the amino acid sequence of the RRV ORF 66 protein, which has 
10 some similarity to KSHV ORF 66 protein. 

SEQ ID NO 148 shows the cDNA nucleotide sequence of RRV ORF 67, corresponding to 
the complement of nucleotides 109524-110198 of SEQ ID NO 1, which encodes a tegument 
protein, and which has some similarity to KSHV ORF 67. 

SEQ ID NO 149 shows the amino acid sequence of the RRV ORF 67 protein, a tegument 
15 protein, which has some similarity to KSHV ORF 67 protein. 

SEQ ID NO 150 shows the cDNA nucleotide sequence of RRV ORF 68, corresponding to 
nucleotides 110609-111982 of SEQ ID NO 1, which encodes a glycoprotein, and which has some 
similarity to KSHV ORF 68. 

SEQ ID NO 151 shows the amino acid sequence of the RRV ORF 68 protein, a 
20 glycoprotein, which has some similarity to KSHV ORF 68 protein. 

SEQ ID NO 152 shows the cDNA nucleotide sequence of RRV ORF 69, corresponding to 
nucleotides 112004-112897 of SEQ ID NO 1, which has some similarity to KSHV ORF 69. 

SEQ ID NO 153 shows the amino acid sequence of the RRV ORF 69 protein, which has 
some similarity to KSHV ORF 69 protein. 
25 SEQ ID NO 154 shows the cDNA nucleotide sequence of RRV ORF 71, corresponding to 

the complement of nucleotides 119211-119735 of SEQ ID NO 1, which encodes a FLIP protein, 
and which has some similarity to KSHV ORF 71. 

SEQ ID NO 155 shows the amino acid sequence of the RRV ORF 71 protein, a FLIP 
protein, which has some similarity to KSHV ORF 71 protein. 
30 SEQ ID NO 156 shows the cDNA nucleotide sequence of RRV ORF 72, corresponding to 

the complement of nucleotides 1 19794-120558 of SEQ ID NO 1 , which encodes a cyclin D 
homolog, and which has some similarity to KSHV ORF 72. 

SEQ ID NO 157 shows the amino acid sequence of the RRV ORF 72 protein, a cyclin D 
homolog protein, which has some similarity to KSHV ORF 72 protein. 
35 SEQ ID NO 158 shows the cDNA nucleotide sequence of RRV ORF 73, corresponding to 

the complement of nucleotides 120866-122212 of SEQ ID NO 1, which encodes a latent nuclear 
antigen, and which has some similarity to KSHV ORF 73. 
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SEQ ID NO 159 shows the amino acid sequence of the RRV ORF 73 protein, a latent 
nuclear antigen, which has some similarity to KSHV ORF 73 protein, 

8EQ ID NO 160 shows the cDNA nucleotide sequence of RRV R15, corresponding to 
nucleotides 122866-123627 of SEQ ID NO 1, which has some similarity to KSHV K14 and ox-2. 
5 SEQ ID NO 161 shows the amino acid sequence of the RRV R15 protein, which has some 

similarity to KSHV K14 and ox-2. 

SEQ ID NO 162 shows the cDNA nucleotide sequence of RRV ORF 74, corresponding to 
nucleotides 123924-124952 of SEQ ID NO 1, which encodes a G protein coupled receptor, and 
which has some similarity to KSHV ORF 74. 
10 SEQ ID NO 163 shows the amino acid sequence of the RRV ORF 74 protein, a G protein 

coupled receptor protein, which has some similarity to KSHV ORF 74 protein. 

SEQ ID NO 164 shows the cDNA nucleotide sequence of RRV ORF 75, corresponding to 
the complement of nucleotides 125057-128953 of SEQ ID NO 1, which encodes a tegument 
protein, FGARAT, and which has some similarity to KSHV ORF 75. 
15 SEQ ID NO 165 shows the amino acid sequence of the RRV ORF 75 protein, a tegument 

protein, FGARAT, which has some similarity to KSHV ORF 75 protein. 

The cDNA sequences given in each of the even numbered sequences SEQ ID NOs 2-164 
are the open reading frames of the RRV, with the nucleotide references in each of those sequences 
being given with reference to the nucleotide numbers set forth in SEQ ID NO 1. 
20 SEQ ID NOs 166-172 are PCR primers used in the present invention. 

SEQ ID NO 173 shows the coding sequence similar to that for MIP without AUG. 
Nucleic acid numbers correspond to those given in SEQ ID NO 1. 

SEQ ID NOs 174-179 show the repeat regions of the RRV genome. Nucleic acid 
numbers correspond to those given in SEQ ID NO 1 
25 SEQ ID NOs 180-181 are probes specific for the KSHV thymidylate synthase (TS) gene 

used for Southern blot hybridization. 

SEQ ID NOs 182183 are oligonucleotide PCR primers specific for the RRV MIP gene. 



ATCC DEPOSIT 

30 A Budapest Treaty deposit of RRV 17577 was made with the American Type Culture 

Collection (ATCC), Manassas, Virginia, on March 12, 1998, and has been accorded ATCC 
Accession No . VR-260 1 . 



DETAILED DESCRIPTION OF SEVERAL EMBODIMENTS 
35 Abbreviations and Definitions 

Animal: Living multicellular vertebrate organisms, a category which includes, for 
example, humans, non-human primates, mammals, and birds. 
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Cell: A plant, animal, insect, bacterial, or fungal cell. 

Homologs: two nucleotide or amino acid sequences that share a common ancestral 
sequence and diverged when a species carrying that ancestral sequence split into two species. 
Homologs frequently show a substantial degree of sequence identity. 
5 IL-6: Interleukin 6. IL-6 is a cytokine known to have pleiotropic immunological effects 

including anti-inflammatory and immunosuppressive effects {Human Cytokines, 1991, pg. 142- 
167, Blackwell Scientific Publications, Aggarwal and Gutterman, eds). Because IL-6 is a 
pleiotropic cytokine, IL-6 activity may be measured using a number of bioassays, including 
stimulation of immunoglobulin production in SKW6-CL4 cells as described by Hirano et al. 

10 (Nature 324:73-6, 1986) and stimulation of hybridoma cell growth as described by Matsuda et al. t 
1988 Eur. J. Immunol. 18:951-956, both of which are incorporated by reference. As used herein, 
the term "IL-6 biological activity" refers to the ability of a protein to show activity in at least one 
of these assay systems 

Immunocompromised: Lacking a normal immune response. Immuno-compromised 

15 refers to a condition in which some or all of an animal's immune system is inoperative, leaving the 
animal with an increased susceptibility to infection or disease. An animal may be rendered 
immuno-compromised by a biological agent such as, in the case of non-human primates, Simian 
Immunodeficiency Virus (SIV). Many strains of SIV have been isolated and characterized; any 
SIV strain that produces an immuno-compromised state can be used in the present invention 

20 including, but not limited to, for example, SIVmac239 (Kestler et al., 1990, Science 248: 1109- 
12), SIVmac251 (Daniels et al., 1985, Science 228: 1201-4), SIVdeltaB670 (Murphy-Corb et al., 
1986, Nature 321:435) and SIVmne (Benveniste et al., 1988, J. Virol 62:2091-101). In addition, 
hybrid SIV/HIV chimeras as known in the field can be employed, as can HIV-2. Simian type D 
retroviruses (SRVs) which cause an AIDS-like disease in rhesus monkeys, can alternatively be 

25 used to immuno-compromise the animals in place of SIV. These viral agents are administered to 
the animal using conventional means, such as intravenous or intramuscular injection, or oral, 
intrarectal or intravaginal inoculation (also see Example 24). Either intact viral particles or viral 
DNA may be administered. As known in the field, plasmid constructs containing the entire SIV 
genome are infectious when inoculated into animals and so may be employed in place of purified 

30 viral DNA. 

Alternatively, an animal may be rendered immuno-compromised by administration of 
agents that target the immune system, including byt not limited to anti-CD3 antibody (CD3 being 
the T-cell receptor) either alone or conjugated with a toxic moiety, or immunosuppressive 
compounds including prednisone, azathioprine, cyclosporine A, and cyclophosphamide. Where an 
35 immunosuppressive compound such as cyclosporine is employed, an allogenic stimulus (such as a 
blood transfusion) may be administered with the subsequent administration of RRV to activate 
infection. 
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Alternatively, other methods of rendering an animal immunocompromised may be used, 
including radiation treatment and surgical intervention. 

Isolated: An "isolated" biological component (such as a nucleic acid, peptide or protein) 
has been substantially separated, produced apart from, or purified away from other biological 
5 components in the cell of the organism in which the component naturally occurs, i.e. , other 

chromosomal and extrachromosomal DNA and RNA, and proteins. Nucleic acids, peptides and 
proteins which have been "isolated" thus include nucleic acids and proteins purified by standard 
purification methods. The term also embraces nucleic acids, peptides and proteins prepared by 
recombinant expression in a host cell as well as chemically synthesized nucleic acids. 

10 KSHV: Kaposi's sarcoma-associated herpesvirus. KSHV is a herpesvirus associated with 

(and thought to be the etiological agent of) Kaposi's sarcoma in humans. 

Lymphoproliferative Disorder: a group of disorders characterized by proliferation of 
lymphoid tissue, such as lymphocytic leukemia and malignant lymphoma, and characterized by 
such signs as lymphocytosis, lymphadenopathy, and splenomegaly. 

15 MIP: macrophage inflammatory protein. The acronym MIP is used to describe a family 

of cytokines that includes MIP1 (Davatelis et al., 1989, Science 243: 1066-8) and MIP2 (U.S. 
Patent No. 5,145,676). MIPs mediate pleiotropic immunological effects including activation of 
neutrophils to undergo an oxidative burst. MIPs are also intrinsically pyrogenic. MIP biological 
activity can be detected and quantified using bioassays as described in Kedal et al. (Science 

20 277: 1656-9, 1997) and Boshoff et al. (Science 278:290-4, 1997) that measure MIP concentrations 
using HIV inhibition and calcium mobilization, respectively. As used herein, the term "MIP 
biological activity" refers to the ability of a protein to show activity in at least one of these assay 
systems. 

Non-human primate: Simian primates including chimpanzees, orangutans, baboons, and 
25 macaques. Any non-human primate may be used to produce a KSHV-disease animal model by the 
methods disclosed herein. Thus, in addition to the rhesus macaque models described in detail 
below, pigtail and cynomologus macaques and baboons may also be used to produce KSHV- 
disease animal models by the methods disclosed herein. 

Oligonucleotide: A linear polynucleotide sequence of up to about 200 nucleotide bases in 
30 length, for example a polynucleotide (such as DNA or RNA) which is at least 6 nucleotides, for 
example at least 15, 25, 50, 100 or even 200 nucleotides long. 

Operably linked: A first nucleic acid sequence is operably linked with a second nucleic 
acid sequence when the first nucleic acid sequence is placed in a functional relationship with the 
second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if 
35 the promoter affects the transcription or expression of the coding sequence. Generally, operably 
linked DNA sequences are contiguous and, where necessary to join two protein coding regions, in 
the same reading frame. 
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ORF: open reading frame. Contains a series of nucleotide triplets (codons) coding for 
amino acids without any termination codons. These sequences are usually translatable into 
protein. 

PCR: polymerase chain reaction. Describes a technique in which cycles of denaturation, 
5 annealing with primer, and then extension with DNA polymerase are used to amplify the number 
of copies of a target DNA sequence. 

Pharmaceutical^ acceptable carriers: The pharmaceutical^ acceptable carriers useful 
in this invention include conventional carriers. Remington's Pharmaceutical Sciences , by E. W. 
Martin, Mack Publishing Co., Easton, PA, 15th Edition (1975), describes compositions and 
10 formulations suitable for pharmaceutical delivery of the viruses, nucleic acids and/or proteins 
herein disclosed. 

In general, the nature of the carrier will depend on the particular mode of administration 
being employed. For instance, parenteral formulations usually comprise injectable fluids that 
include pharmaceutically and physiologically acceptable fluids such as water, physiological saline, 

15 balanced salt solutions, aqueous dextrose, glycerol, ethanol, combinations thereof, or the like, as a 
vehicle. The carrier and composition can be sterile, and the formulation suits the mode of 
administration. For solid compositions (e.g., powder, pill, tablet, or capsule forms), conventional 
non-toxic solid carriers can include, for example, pharmaceutical grades of mannitol, lactose, 
starch, sodium saccharine, cellulose, magnesium carbonate, or magnesium stearate. In addition to 

20 biologically-neutral carriers, pharmaceutical compositions to be administered can contain minor 
amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, 
and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate. 

The composition can be a liquid solution, suspension, emulsion, tablet, pill, capsule, 
sustained release formulation, or powder. The composition can be formulated as a suppository, 

25 with traditional binders and carriers such as triglycerides. 

Probes and primers: Nucleic acid probes and primers may readily be prepared based on 
the amino acid sequences provided by this invention. A probe is an isolated nucleic acid attached 
to a detectable label or reporter molecule. Typical labels include radioactive isotopes, ligands, 
chemiluminescent agents, and enzymes. Methods for labeling and guidance in the choice of labels 

30 appropriate for various purposes are discussed, e.g., in Sambrook et al., in Molecular Cloning: A 
Laboratory Manual , Cold Spring (1989) and Ausubel et al., in Current Protocols in Molecular 
Biology , Greene Publishing Associates and Wiley-Intersciences (1987). 

Primers are short nucleic acids, such as DNA oligonucleotides 10 nucleotides or more in 
length. Primers may be annealed to a complementary target DNA strand by nucleic acid 

35 hybridization to form a hybrid between the primer and the target DNA strand, and then extended 
along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for 
amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR) or other 
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nucleic-acid amplification methods known in the art. 

Methods for preparing and using probes and primers are described, for example, in 
Sambrook et al. ( Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold Spring 
Harbor, New York. 1989); Ausubel et al. ( Current Protocols in Molecular Biology , Greene Publ. 

5 Assoc. & Wiley-Intersciences. 1987) and Innis et al. (PCR Protocols, A Guide to Methods and 

Applications , 1990, Innis et al. (eds.), 21-27, Academic Press, Inc., San Diego, California). PCR 
primer pairs can be derived from a known sequence, for example, by using computer programs 
intended for that purpose such as Primer (Version 0.5, ® 1991, Whitehead Institute for Biomedical 
Research, Cambridge, MA). 

10 Probes and primers as used in the present invention typically comprise at least 15 

contiguous nucleotides of the RRV genome sequence (SEQ ID NO 1). One of skill in the art will 
appreciate that the specificity of a particular probe or primer increases with its length. Thus, for 
example, a primer comprising 20 consecutive nucleotides will anneal to a target with a higher 
specificity than a corresponding primer of only 15 nucleotides. Thus, in order to obtain greater 

15 specificity, probes and primers may be selected that comprise 20, 25, 30, 35, 40, 50 or more 
consecutive nucleotides. In order to enhance specificity, longer probes and primers may also be 
employed, such as probes and primers that comprise at least 20, 30, 40, 50, 60, 70, 80, 90, 100, 
or 150 consecutive nucleotides of the disclosed nucleic acid sequences. 

Alternatively, such probes and primers may comprise at least 15, 20, 30, 40, 50, 60, 70, 

20 80, 90, 100, or 150 consecutive nucleotides that share a defined level of sequence identity with the 
disclosed RRV sequence, for instance, at least a 50%, 60%, 70%, 80%, 90%, 95% or 98% 
sequence identity. Alternatively, such probes and primers may be nucleotide molecules which 
hybridize under wash conditions of 70°C and about 0.2 x SSC for 1 hour, or alternatively under 
less stringent conditions of 65°C, 60°C, or 55°C with from about 0.2 to 2 x SSC (with, for 

25 instance, about 0. 1 % SDS) for 1 hour with a portion of the RRV sequence. 

Purified: The term purified does not require absolute purity; rather, it is intended as a 
relative term. Thus, for example, a purified peptide preparation is one in which the peptide or 
protein is more enriched than the peptide or protein is in its natural environment within a cell. 
Preferably, a preparation is purified such that the protein or peptide represents at least 50% of the 

30 total peptide or protein content of the preparation. 

Recombinant: A recombinant nucleic acid is one that has a sequence that is not naturally 
occurring or has a sequence that is made by an artificial combination of two otherwise separated 
segments of sequence. This artificial combination is often accomplished by chemical synthesis or, 
more commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by 

35 genetic engineering techniques. 

RRV 17577: Rhesus macaque rhadinovirus RRV isolate 17577. A Budapest Treaty 
deposit of RRV 17577 was made with the American Type Culture Collection, Manassas, Virginia, 
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on March 12, 1998, and has been accorded ATCC Accession No. VR-2601. This virus may be 
grown on primary rhesus fibroblasts, as described below (see Examples 1 and 14), using standard 
virological techniques. Alternatively, it may be grown on commercially available rhesus cell lines, 
including those available from ATCC, such as ATCC CRL-6306 and ATCC CL- 160. Infection of 
5 a non-human primate with RRV 17577 may be accomplished using any standard method, including 
intravenous injection (see Examples 13, 23 and 24). Typically, infection is achieved by 
intravenous injection of around 10 6 plaque forming units (PFUs) of RRV 17577. 

RRV: A virus having the virological and immunological characteristics of RRV 17577, 
and which causes Kaposi's sarcoma in immunocompromised Rheusus monkeys which are infected 

10 with the virus. In particular examples, the RRV has at least 85% (for example at least 90%, 95% 
or 98%) sequence identity to SEQ ID NO 1. 

Sequence Identity: The similarity between two nucleic acid sequences, or two amino 
acid sequences, is expressed in terms of the similarity between the sequences, otherwise referred to 
as sequence identity. Sequence identity is frequently measured in terms of percentage identity (or 

15 similarity or homology); the higher the percentage, the more similar the two sequences are. 

Homologs or orthologs of the RRV proteins and the corresponding DNA sequences, will possess a 
relatively high degree of sequence identity when aligned using standard methods. This homology 
will be more significant when the orthologous proteins or DNAs are derived from species which 
are more closely related (e.g., human and chimpanzee sequences), compared to species more 

20 distantly related (e.g., human and C. elegans sequences). 

Typically, RRV orthologs are at least 50% identical at the nucleotide level and at least 
50% identical at the amino acid level when comparing RRV to an orthologous RRV sequences. 

Methods of alignment of sequences for comparison are well known in the art. Various 
programs and alignment algorithms are described in: Smith & Waterman, Adv. AppL Math. 

25 2:482, 1981; Needleman & Wunsch, J. MoL Biol. 48:443, 1970; Pearson & Lipman, Proc. Natl. 
Acad. ScL USA 85:2444, 1988; Higgins & Sharp, Gene, 73:237-44, 1988; Higgins & Sharp, 
CABIOS 5:151-3, 1989; Corpetetal., Nuc. Acids Res. 16:10881-90, 1988; Huang et al. Computer 
Appls. in the Biosciences 8, 155-65, 1992; and Pearson et al., Meth. MoL Bio. 24:307-31, 1994. 
Altschul et al. {Nature Genetics 6:119-29, 1994), presents a detailed consideration of sequence 

30 alignment methods and homology calculations. 

The NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et al. J. MoL Biol. 
215:403-10, 1990) is available from several sources, including the National Center for 
Biotechnology Information (NCBI, Bethesda, MD) and on the Internet, for use in connection with 
the sequence analysis programs blastp, blastn, blastx, tblastn and tblastx. It can be accessed at 

35 http://www.ncbi.nlm.nih.gov/BLAST/. A description of how to determine sequence identity using 
this program is available at http://www.ncbi.nlm.nih.gov/BLAST/blast_help.html. As used 
herein, sequence identity is commonly determined with the BLAST software set to default 
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parameters. For instance, blastn (version 2.0.6) software may be used to determine sequence 
identity between two nucleic acid sequences using default parameters. For comparison of two 
polypeptides, blastp (version 2.0:6) software may be used with default parameters. 

An alternative alignment tool is the ALIGN Global Optimal Alignment tool (version 3.0) 
5 available from Biology Workbench at http://biology.ncsa.uiuc.edu. This tool may be used with 
settings set to default parameters to align two known sequences. References for this tool include 
Meyers and Miller (CABIOS 4: 1 1-7, 1989). 

Homologs of the disclosed RRV nucleic acids typically possess at least 50% sequence 
identity counted over the length of one of the nucleic acids (the reference nucleic acid) using the 

10 NCBI Blast 2.0.6, gapped blastn set to default parameters. Nucleic acids showing substantial 

similarity when assessed by this method may show, for example, at least 50%, 60%, 70%, 80%, 
90%, 95% or even 98% or greater sequence identity. When less than the entire sequence is being 
compared for sequence identity, substantially similar nucleotide sequences will typically possess at 
least 70% sequence identity over short windows of 30-90 nucleic acids, and may possess sequence 

15 identities of at least 80%, 90%, 95% or 98% or greater. 

Homologs of the disclosed RRV proteins typically possess at least 50% sequence identity 
counted over full-length alignment with the amino acid sequence of RRV using the NCBI Blast 
2.0, gapped blastp set to default parameters. For comparisons of amino acid sequences of greater 
than about 30 amino acids, the Blast 2 sequences function is employed using the default 

20 BLOSUM62 matrix set to default parameters, (gap existence cost of 11, and a per residue gap cost 
of 1). When aligning short peptides (fewer than around 30 amino acids), the alignment should be 
performed using the Blast 2 sequences function, employing the PAM30 matrix set to default 
parameters (open gap 9, extension gap 1 penalties). Proteins with even greater similarity to the 
reference sequence will show increasing percentage identities when assessed by this method, such 

25 as at least 50%, at least 55%, at least 60%, at least 70%, at least 75%, at least 80%, at least 90%, 
at least 95%, at least 98%, or at least 99% sequence identity. When less than the entire sequence 
is being compared for sequence identity, homologs will typically possess at least 70% sequence 
identity over short windows of 10-20 amino acids, and may possess sequence identities of at least 
75%, at least 85% or at least 90%, at least 95% or 98% depending on their similarity to the 

30 reference sequence. Methods for determining sequence identity over such short windows are 
described at http://www.ncbi.nlm.nih.gov/BLAST/blast_FAQs.html. 

When comparing degrees of sequence identity between similar proteins, the degree of 
identity will be equal to or less than that the degree of similarity , due to the fact the similarity 
takes into account conservative amino acid substitutions. So, for instance, the degree of sequence 

35 identity between to substantially similar proteins may be at least 43%, 50%, 55%, 65%, 75%, 
85%, 95%, 98% or more. 

One of ordinary skill in the art will appreciate that these sequence identity ranges are 



WO 00/28040 



PCT/US99/26260 



-23- 

provided for guidance only; it is entirely possible that strongly significant homologs could be 
obtained that fall outside of the ranges provided. The present invention provides not only the 
peptide h6r$|ologs that are described above, but also nucleic acid molecules that encode such 
homologs. 

5 An alternative indication that two nucleic acid molecules are closely related is that the two 

molecules hybridize to each other under stringent conditions, as described in Example 23. 

Specific binding agent: An agent that binds substantially only to a defined target. As 
used herein, the term "RRV peptide specific binding agent" includes anti-RRV peptide antibodies 
and other agents that bind substantially only to the RRV peptide. Such "peptide specific binding 

10 agents" include anti-IL-6 and anti-MIP antibodies. The antibodies may be monoclonal or 

polyclonal antibodies that are specific for an RRV peptide, as well as immunologically effective 
portions ("fragments") thereof. In one embodiment, the antibodies used in the present invention 
are monoclonal antibodies (or immunologically effective portions thereof) and may also be 
humanized monoclonal antibodies (or immunologically effective portions thereof)- 

15 Immunologically effective portions of monoclonal antibodies include Fab, Fab', F(ab')2, Fabc and 
Fv portions (for a review, see Better and Horowitz, Methods. Enzymol. 178:476-96, 1989). Anti- 
inhibitory peptide antibodies may also be produced using standard procedures described in a 
number of texts, including Antibodies, A Laboratory Manual by Harlow and Lane, Cold Spring 
Harbor Laboratory (1988). 

20 Methods of making humanized monoclonal antibodies aTe well known, and include those 

described in U.S. Patent Nos. 5,585,089; 5,565,332; 5,225,539; 5,693,761; 5,693,762; 
5,585,089; and 5,530,101 and references cited therein. Similarly, methods of making and using 
immunologically effective portions of monoclonal antibodies, also referred to as antibody 
fragments, are well known and include those described in Better and Horowitz, 1989, Meth. 

25 Enzymol. 178:176-496; Better et al., 1990, Better and Horowitz, 1990, Advances in Gene 

technology: The Molecular Biology of Immune Disease & the Immune Response (ICSU Short 
Reports); Glockshuber et al., 1990, Biochemistry 29:1362-7; and U.S. Patent Nos. 5,648,237; 
4,946,778 and 5,455,030, and references cited therein. 

The determination that a particular agent binds substantially only to an RRV peptide may 

30 readily be made by using or adapting routine procedures. One suitable in vitro assay makes use of 
the Western blotting procedure (described in many standard texts, including Antibodies, A 
Laboratory Manual by Harlow and Lane). Western blotting may be used to determine that a given 
RRV peptide binding agent, such as an anti-IL-6 or MIP peptide monoclonal antibody, binds 
substantially only to the specific RRV protein. 

35 Subject: This term includes both human and non-human subjects. Similarly, the term 

"patient" includes both human and veterinary subjects. 
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Transformed: A transformed cell is a cell into which has been introduced a nucleic acid 
molecule by molecular biology techniques. As used herein, the term transformation encompasses 
all techniques by which a nucleic acid molecule might be introduced into such a cell, including 
transfection with viral vectors, transformation with plasmid vectors, and introduction of naked 

5 DNA by electroporation, lipofection, and particle gun acceleration. 

Transgenic Cell: Transformed cells which contain foreign, non-native DNA. 
Variants of Amino Acid and Nucleic Acid Sequences: The production of RRV proteins 
can be accomplished in a variety of ways (for example see Examples 17, 21 and 25). DNA 
sequences which encode the protein, or a fragment of the protein, can be engineered such that they 

10 allow the protein to be expressed in eukaryotic cells, bacteria, insects, and/or plants. In order to 
accomplish this expression, the DNA sequence can be altered and operably linked to other 
regulatory sequences. The final product, which contains the regulatory sequences and the 
therapeutic protein, is referred to as a vector. This vector can then be introduced into the 
eukaryotic cells, bacteria, insect, and/or plant. Once inside the cell the vector allows the protein 

15 to be produced. 

One of ordinary skill in the art will appreciate that the DNA can be altered in numerous 
ways without affecting the biological activity of the encoded protein. For example, PCR may be 
used to produce variations in the DNA sequence which encodes RRV proteins, such as IL-6 or 
MIP. Such variants may be variants that are optimized for codon preference in a host cell that is 

20 to be used to express the protein, or other sequence changes that facilitate expression. 

Two types of cDNA sequence variant may be produced. In the first type, the variation in 
the cDNA sequence is not manifested as a change in the amino acid sequence of the encoded 
polypeptide. These silent variations are simply a reflection of the degeneracy of the genetic code. 
In the second type, the cDNA sequence variation does result in a change in the amino acid 

25 sequence of the encoded protein. In such cases, the variant cDNA sequence produces a variant 

polypeptide sequence. In order to preserve the functional and immunologic identity of the encoded 
polypeptide, it is preferred that any such amino acid substitutions are conservative. Conservative 
substitutions replace one amino acid with another amino acid that has some homology in size, 
hydrophobicity, etc. Such substitutions generally are conservative when it is desired to finely 

30 modulate the characteristics of the protein. For example, conservative substitutions generally 

maintain (a) the structure of the polypeptide backbone in the area of the substitution, for example, 
as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target 
site, or (c) the bulk of the side chain. 

Examples of amino acids which may be substituted for an original amino acid in a protein 

35 and which are regarded as conservative substitutions include: Ser for Ala; Lys for Arg; Gin or His 
for Asn; Glu for Asp; Ser for Cys; Asn for Gin; Asp for Glu; Pro for Gly; Asn or Gin for His; 
Leu or Val for He; He or Val for Leu; Arg or Gin for Lys; Leu or He for Met; Met, Leu or Tyr 





WO 00/28040 



PCT/US99/26260 



-25- 



for Phe; Thr for Ser; Ser for Thr; Tyr for Trp; Trp or Phe for Tyr; and He or Leu for Val. 

The substitutions which in general are expected to produce the greatest changes in protein 
properties will be non-conservative, for instance changes in which (a) a hydrophilic residue, e.g., 
seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g., leucyl, isoleucyl, 
5 phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; 
(c) a residue having an electropositive side chain, e.g., lysyl, arginyl, or histadyl, is substituted for 
(or by) an electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side 
chain, e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine. 



10 or not, should be minimized in order to preserve the functional and immunologic identity of the 
encoded protein. The immunologic identity of the protein may be assessed by determining 
whether it is recognized by an antibody to an RRV protein; a variant that is recognized by such an 
antibody is immunologically conserved. Any DNA sequence variant will preferably introduce no 
more than 20, and preferably fewer than 10 amino acid substitutions into the encoded polypeptide. 

15 Variant amino acid sequences can, for example, be 80%, 90%, 95% or even 98% identical to the 
native amino acid sequence. 

Vector: A nucleic acid molecule as introduced into a host cell, thereby producing a 
transformed host cell. A vector may include nucleic acid sequences that permit it to replicate in 
the host cell, such as an origin of replication. A vector may also include one or more selectable 

20 marker genes and other genetic elements known in the art. 

Virion: A complete viral particle including envelope, capsid (if any), and nucleic acid 
elements. 

The present invention utilizes standard laboratory practices for the cloning, manipulation 
and sequencing of nucleic acids, purification and analysis of proteins and other molecular 
25 biological and biochemical techniques, unless otherwise stipulated. Such techniques are explained 
in detail in standard laboratory manuals such as Sambrook et al. ( Molecular Cloning: A 
Laboratory Manual , 2 nd ed., vol. 1-3, Cold Spring Harbor, New York. 1989) and Ausubel et al. 
(Current Protocols in Molecular Biology , Greene Publ. Assoc. & Wiley-Inter sciences. 1987). 



This example describes how RRV was isolated from a rhesus macaque monkey. Fresh, 
dispersed bone marrow (BM) cells were isopynic gradient-purified (Ficoll-Paque, Pharmacia) from 
a 2 yr, 202 day old captive-reared rhesus macaque that was euthanized 503 days after intravenous 
35 infection with an SIVmac239 variant. Gradient-purified BM mononuclear cells were seeded into 
T-25 culture flasks and grown in the presence of Endothelial SFM media (GIBCO) supplemented 



Variations in the DNA sequence that result in amino acid changes, whether conservative 
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EXAMPLE 1 



Isolation of RRV 
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with 10% fetal bovine serum, 1% L-glutamine, 1% penicillm-streptomycin-neomycin and 30 
/tg/mL endothelial cell growth supplement. 

Cultures developing cytopathic effects (CPE) were rapidly frozen in liquid N2 and thawed, 
and supernatants clarified by centrifugation and filtered through a 0.45 \i membrane. Filtered 
5 extracts were then used as inoculum on primary rhesus macaque fibroblast cultures. Fibroblast 
cultures developing CPE were scraped free into medium, pelleted at 400 xg, washed in phosphate- 
buffered saline and suspended in cold Ito and Karnovsky's fixative (2.5% glutaraldehyde, 0.5% 
picric acid, 1.6% paraformaldehyde, 0.005% ruthenium red) in 0.1 M sodium cacodylate buffer, 
pH 7.4 for 2 hours. Fixed cells were washed in cacodylate buffer, post-fixed in 1 % Os04 and 

10 0.8% KjFe (Cn)& in cacodylate buffer for 1 hour, rinsed in distilled H2O and pre-stained in 2% 

aqueous uranyl acetate for 1 hour. Fixed and pre-stained cells were dehydrated in a graded series 
of acetone imbedded in Epon 812 epoxy resin, polymerized at 60 °C and sectioned at 60 nm on an 
MT 5000 ultramicrotome. Copper grid mounted sections were stained with lead citrate and Uranyl 
acetate and viewed on a Phillips 300 electron microscope. 

15 By electron microscopy, numerous herpesvirus particles were observed in the cells. This 

macaque developed LPD characterized as lymphocytic masses in myeloid and nonlymphoid tissues 
which were confirmed histopathologically as solid pleomorphic lymphoid masses. 

EXAMPLE 2 

20 Initial Characterization of RRV 

Infectious virus was purified from infected primary rhesus fibroblast cultures exhibiting 
100% CPE (see Example 1). Infected cells were harvested and disrupted by freeze- thawing and 
the cell debris removed by low speed centrifugation. Supernatants were centrifuged in a Beckman 
JA-14 rotor for 1 hour at 9000 rpm to pellet the virus, which was further purified through a six- 

25 step sorbitol gradient ranging from 20 to 70%, spun in a Beckman SW41 rotor for 2 hours at 
18,000 rpm. Virus was diluted in balanced buffered salts solution and then pelleted through a 
20% sorbitol cushion. Pelleted virus was resuspended in Tris-EDTA buffer (TE; 10 mM 
Tris-HCl, pH 8.0, and 1 mM EDTA) and lysed in TE with 0.6% SDS and proteinase K (200 fig) 
at 37 °C for 5 hours. Viral DNA was then isolated by CsCh gradient centrifugation in a Beckman 

30 Ti 75 rotor at 38.4 K rpm for 72 hours, collected and dialyzed against TE. 

The viral DNA was analyzed using degenerate primer polymerase chain reaction (PCR) 
amplification and Southern blot hybridization with a probe specific for the KSHV thymidylate 
synthase (TS) gene: (5'<TATACTGCCATTTC-3\ SEQ ID NO 180 and 5'- 
ATGTTTCCGTTTGTA-3 ' , SEQ ID NO 181). The probe itself had the sequence of the KSHV TS 

35 (Orf 70 gene). Four genes were identified by these methods. A fragment encoding a portion of 
the viral DNA polymerase was obtained and DNA sequence analysis revealed that the virus was 
most likely a gamma herpesvirus, as amino acid sequence identity was highest among this class of 
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herpes viruses. DNA sequence analysis of the viral DNA fragment found to hybridize to the 
KSHV TS probe revealed three open reading frames (ORFs) with homology to KSHV (Nicholas et 
aL, 1997, Nature Med. 3:287-92; Russo et al.; 1996, Proc Natl Acad Sci USA 93:14862-7). One 
ORF encodes a homologue of macrophage inflammatory protein MIP-1 with amino acid sequence 
5 identity with KSHV MIP-II, the second ORF encodes a thymidylate synthase homologue and the 
third ORF encodes a homologue of interleukin-6 (IL-6) with homology to the rhesus IL-6 and 
KSHV IL-6. The presence of an IL-6-like cytokine and an MIP-1 -like CC-chemokine flanking TS 
resembles the genomic organization of KSHV, indicating this virus is related to KSHV and is 
referred to herein as rhesus rhadinovirus (RRV). 
10 To determine if RRV is present in tissue containing the lymphocytic masses, 

oligonucleotide PCR primers specific for the RRV MIP gene (vMIP-1, 5- 

CCTATGGGCTCC ATG AG C-3 \ SEQ ID NO 182; and vMIP-2, 5-ATCGTCAATCAGGCTGCG-3 
SEQ ID NO 183) were designed in an attempt to detect viral DNA in tissue from the macaque. By 
semi-quantitative PCR analysis, viral DNA sequences were detected in DNA samples from bone 

15 marrow at approximately 590 copies per 0. 1 fig of tissue DNA. Because rhesus macaques held in 
captivity are reported to be naturally infected with a herpesvirus similar to KSHV, bone marrow 
DNA samples were isolated from normal and SIVmac239-infected macaques without LPD and 
analyzed by PCR. There was no evidence of viral DNA sequences. Additionally, since simian 
Epstein-Barr virus (EBV) has been found to be present in high copy number in lymphomas from 

20 SIV-infected macaques (Baskin et al., 1986, J. Natl. Cancer Inst. 77:127-39; Feichtinger et al., 
1990,. Amer. 7. Pathol. 137:131 1-5), the tissue samples from the macaque with disease were also 
analyzed by PCR for rhesus EBV (RhEBV) using oligonucleotide primers for RhEBV latent 
membrane protein 1. By this analysis, no signal for RhEBV was detected, suggesting that the 
RRV may be a contributing factor for LPD in this SIV-infected macaque. 



Primary rhesus fibroblasts grown in two 850 cm 2 roller bottles were infected with RRV at 
an MOI of 0. 1 and the virus was harvested from the culture supernatant and the infected 

30 monolayers 10 to 12 days post-infection. Cellular debris was removed from the culture 

supernatant by centrifugation at 1,000 x g for 10 minutes. Intracellular virus particles were 
released by sonication followed by centrifugation to pellet debris. 

The two clarified supematants were then combined and the virus was pelleted by 
centrifugation at 12,500 x g for 1 hour at 4°C, and further purified through a six-step sorbitol 

35 gradient ranging from 20 to 70%. Gradients were centrifuged in a Beckman SW41 rotor for 2 
hours at 18,000 rpm at 4°C. The interface containing the virus was collected and diluted with 
cold buffered saline solution. The virus was then pelleted by centrifugation in the SW41 for 50 
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Preparation of RRV DNA for Cloning 
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minutes at 18,000 rpm. The virus pellet was resuspended in 9.2 ml of TE (see Example 2) before 
the addition of 0.6 ml of 10% sodium dodecylsulfate (SDS) and 0.2 ml of proteinase K (10 
mg/mL) to release the viral DNA. Viral DNA was isolated by CsCh gradient centrifugation in a 
Beckman Ti75 rotor at 38,400 rpm for 72 hours, collected, and dialyzed against TE. 
5 To ensure that the DNA isolated contained all the necessary sequences required for RRV 

replication, DNA was transfected, in duplicate, into primary rhesus fibroblasts by the calcium 
phosphate method without dimethyl sulfoxide shock and observed for cytopathic effects (CPE). 
Control transfections, lacking viral DNA or calcium phosphate, did not develop CPE. 

10 EXAMPLE 4 

Construction of the Cosmid Library 
Approximately 100 fig of purified RRV DNA (Example 3) was partially digested with 
Sau3A L Aliquots taken at various time points were run on a 0.5% agarose gel and examined for 
the fraction which gave the desired range of fragments (30 - 42 kb). The selected fraction was 

15 dephosphorylated with calf intestinal alkaline phosphatase and 1 /xg ligated into the cosmid vector 
SuperCos 1, prepared essentially as described by the manufacturer (Stratagene, La Jolla, CA). 
The resulting ligation product was packaged using GigaPack II Gold packaging extract (Stratagene) 
and grown for the isolation of recombinant cosmids. 

Individual recombinant cosmids were grown in 3 ml cultures and the cosmid DNA was 

20 isolated by alkaline lysis. Cosmid DNA was digested with EcoRl and the DNA fragments 

separated on a 0.8% agarose gel. The separated fragments were transferred to nitrocellulose and 
probed with various PCR amplification products corresponding to specific KSHV ORFs. 
Hybridization of the probes to the transferred recombinant cosmids was done under conditions of 
moderate stringency (2x SSC-0. 1 %SDS at 55°C) with each of the KSHV-specific probes and at 

25 high stringency (0.2x SSC-0. 1 %SDS at 60°C) with the RRV-specific probes. By this analysis and 
restriction endonuclease mapping, the recombinant cosmids were aligned and a set of recombinants 
was identified that represented the entire viral genome when compared to digested viral DNA. 

EXAMPLE 5 

30 Cloning and Sequencing 

Ten micrograms of each purified recombinant cosmid (Example 4) were digested with 
EcoRI and the resulting fragments isolated from a 0.8% agarose gel using the QiaQuick gel 
extraction protocol (Qiagen). Recovered fragments were ligated into pSP73 (Promega). 
Individual clones were selected by restriction enzyme screening of DNA recovered by alkaline 
35 lysis from overnight cultures. Sequencing templates were prepared by alkaline lysis, followed by 
precipitation with 6.5% polyethylene glycol and 0.8 M NaCl. Templates were resuspended at a 
concentration of 0. 1 /tgVl and end sequences were determined using primers corresponding to the 
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SP6 and T7 promoters of pSP73. Internal sequences were determined using a combination of 
subcloning using convenient restriction sites and custom primers. DNA sequencing reactions were 
performed with the Applied Biosystems (ABI) PRISM Dye Terminator Cycle Sequencing Ready 
Reaction kits with AmpliTaq DNA polymerase per the manufacturer's instructions. Sequence data 
5 was acquired using an ABI 373A Sequencer in the Molecular Biology Core at the Oregon Regional 
Primate Research Center. The primary EcoRI fragments were sequentially arranged by 
sequencing across the EcoRI sites in the intact cosmids using custom primers. Except for those 
regions containing long, high GC repeat units, the entire viral DNA sequence was determined with 
a redundancy of 3- to 4-fold. 

10 Sequences not accessible to custom primers or restriction subcloning were determined 

following deletion subcloning using the Exo Size Deletion kit (New England Biolabs). To 
accommodate this protocol, fragments were subcloned into vectors with restriction sites capable of 
generating the needed 3* and 5' overhanging ends. Double restriction digests to generate 3' and 5* 
overhanging ends were performed on 10 fig of recombinant plasmid DNA, which was then 

15 subjected to exonuclease III digestion. Aliquots were removed from the exonuclease III digests at 
empirically-determined time points, frozen on dry ice, then, after all the time points had been 
collected, incubated for 15 minutes at 65°C to inactivate the enzyme. The DNA was then treated 
with Mung bean nuclease (MBN) for 30 minutes at 30°C. Prior to addition of 3 /xl of MBN to the 
12 jul exonuclease III product, the enzyme was diluted 1/25 to reduce nonspecific digestion. 

20 Nuclease-treated DNA was recovered using the Wizard prep system (Promega), then incubated for 
30 minutes with 2.5 units of T4 DNA polymerase (Life Technologies) and 1 pM dNTPs at 37°C. 
The final product was ligated overnight with T4 DNA ligase and used to transform competent 
XL1 blue bacteria. Deletion products were size selected by restriction digests of DNA recovered 
from 3 ml cultures. 

25 

EXAMPLE 6 

Assembly of the RRV Sequence, Assignment of ORFs, and Nomenclature 
Factura (ABI) and Autoassembler (ABI) were used to assemble the final sequence from 
individual sequencing runs. Open reading frames in the RRV sequence were determined with the 

30 program Mac Vector (Oxford Molecular Group), using a setting of 100 or more amino acids. 
Putative ORFs were then translated and compared to a database of KSHV ORFs. RRV ORFs 
which matched KSHV ORFs were then compared to GenBank using BLASTP to verify the 
similarity, followed by a Gap analysis (Wisconsin GCG analysis package; Oxford Molecular 
Group) to determine the levels of similarity and identity between the RRV and KSHV proteins. 

35 When a gap in the genome of RRV corresponded to the location of a KSHV ORF with less than 
100 amino acids, Mac Vector was reset to a lower limit. RRV ORFs were assigned the names of 
HVS ORFs when they showed similarity to KSHV ORFs with the same name. 
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The nucleotide sequence data from this study have been deposited in the GenBank, 
EMBL, and DDBJ nucleotide sequence databases under accession number AF083501 (SEQ ID NO 

1). 

5 EXAMPLE 7 

Primary Structure Of the RRV Genome 

The genomic nucleotide sequence of the RRV genome (as shown in SEQ ID NO 1) was 
determined using twenty-nine EcoRI fragments (as shown in FIG. 2) from seven overlapping 
isolates of a partial Sau3A I cosmid library. Cosmids were selected by hybridization with PCR 

10 products from KSHV ORFs. EcoRI fragments from each cosmid were subcloned into pSP73 

(Promega) and sequenced. The EcoRI fragments were arranged in the proper order by sequencing 
across the EcoRI junctions in the parent cosmids using custom primers. Greater than 98% of the 
viral genome was determined on both strands. The average sequencing redundancy was between 3 
and 4, but three regions were sequenced on only one strand. One of these regions is a 106 bp 

15 segment of ORF 61 (SEQ ID Nos 136 and 137) that was blocked on one side by an apparent 

hairpin. This segment was sequenced multiple times in one direction using templates derived from 
independent overlapping cosmids. The other two regions are 1 kb, high G + C, repetitive 
sequences. These segments, which are discussed in more detail below, were sequenced completely 
on one strand using a combination of custom primers and exonuclease III deletions. 

20 Terminal repeats were identified on both the left and right ends of the genome and the 

sequence between them was designated as the LUR of the genome. The first base to the right of 
the left terminal repeat was designated base one. The LUR is 133,719 bp long (SEQ ID NO 1). 
The G + C content of RRV is 52.2%, which is comparable to the 53.5% G + C content of 
KSHV, but considerably higher than the 34.5% G + C content of the HVS genome. The CpG 

25 ratio is 1.11, which is substantially higher than the ratio found for other gamma-herpes viruses. 

ORFs were identified by Mac Vector and compared to a database containing the full 
complement of known KSHV ORFs. Matches between RRV and KSHV proteins were verified by 
a BLASTP search of GenBank with the RRV proteins and then by Gap analysis. The initial 
screening for ORFs used a minimum size limit of 100 amino acids. This limit was reduced when 

30 smaller KSHV ORFs existed in locations corresponding to unassigned regions of RRV. Using this 
approach, 82 ORFs were identified (even-numbered SEQ ID Nos 2-164), with 67 of these 
corresponding to ORFs found in both KSHV and HVS. In accordance with the standard 
nomenclature for rhadinoviruses, these ORFs were labeled according to the HVS designation. The 
15 ORFs not found in HVS were assigned labels beginning with R (for rhesus), indicating their 

35 presence in RRV, but not HVS. Some of these genes have counterparts in KSHV. 

A map of the genome of RRV is presented in FIG. 3, with all identified ORFs and their 
orientations. The BamHI, EcoRI, and Hindlll restriction sites in relation to the genome are shown 
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in FIG. 2. The BamHI and Hind III maps were generated from the final compiled sequence. The 
EcoRI map was also generated from the final compiled sequence, but it was further characterized 
by sequencing across the EcoRI junctions in the parent cosmids. Fragment sizes for each 
restriction map are presented in FIG. 4. 

5 

EXAMPLE 8 
Genomic Organization of RRV 
The overall genomic organization of RRV "matches the general structure of gamma- 
herpesviruses, with blocks of shared ORFs interrupted at specific locations (referred to as 
10 divergent loci) where the viral genomes code for acquired cellular genes. The primate 

rhadinoviruses form a subset of the gamma-herpesviruses and their genomes are correspondingly 
more similar to each other than to other members of the family. 

The genomic sequence of RRV is presented in SEQ ID NO 1. FIG. 3 shows a schematic 
representation of the ORFs of RRV with a corresponding restriction map. FIG. 4 shows the 
15 location, size and description of the RRV ORFs. 

EXAMPLE 9 
Comparison of RRV and KSHV ORFs 

A comparison of corresponding repeats in RRV and KSHV is shown in FIG. 5. In 
20 addition, FIG. 5 presents data for RRV ORFs along with the results of the Gap analysis of ORFs 
shared by RRV, KSHV, and HVS. All HVS-like ORFs found in KSHV are found in RRV. A 
comparison table of interferon regulatory elements encoded by the RRV and KSHV genomes is 
shown in FIG. 6. 

EXAMPLE 10 

25 Comparison of RRV and HVS ORFs 

FIG. 7 shows the results of the Gap analysis of ORFs shared by RRV, KSHV, and HVS. 
In general, RRV and HVS ORFs are highly similar when the corresponding RRV and KSHV 
ORFs are highly similar, although the Gap values are generally lower. 

30 EXAMPLE 11 

ORFs Unique to RRV and KSHV 

RRV includes 14 genes which are not found in HVS (Rl SEQ ID NOS 2 and 3; R2 SEQ 
ID NOS 20 and 21; R3 SEQ ID NOS 24 and 25; R4 SEQ ID NOS 98 and 99; R5 SEQ ID NOS 
100 and 101; R6 SEQ ID NOS 114 and 115; R7 SEQ ID NOS 116 and 117; R8 SEQ ID NOS 118 
35 and 119; R9 SEQ ID NOS 120 and 121; RIO SEQ ID NOS 122 and 123; Rll SEQ ID NOS 124 
and 125; R12 SEQ ID NOS 126 and 127; R13 SEQ ID NOS 128 and 129; and R15 SEQ ID NOS 
160 and 161). These are designated in FIG. 3 as "R" ORFs. Of these fifteen genes, 11 have 
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counterpans in the genome of KSHV. R2 (SEQ ID NOs 20 and 21) and R3 (SEQ ID NOs 24 and 
25) are cytokine genes, R2 has functional homology to K2, the vIL-6 gene of KSHV. Gap 
analysis of the vIL-6 genes from KSHV and RRV shows no notable similarity, but both possess 
four conserved cysteines found in cellular IL-6. In addition, RRV vIL-6 has IL-6-like activity in 
5 cell culture. R3 has a small, but clear, similarity to KSHV K4, a vMIPip gene. It is the only 
vMIP gene in RRV, as compared to the three vMIP genes found in KSHV. 

RRV R6 through R13 are vIRFs as are KSHV K9 through Kl I (FIG. 6). K9, the most 
studied of the KSHV vIRFs, does not have a DNA binding domain, but has been demonstrated to 
inhibit the endogenous cellular interferon response pathways. Five of the RRV vIRFs (R6, R7, 

10 R8, R10, and Rl 1) are similar to K9, though only RIO has a similarity greater than 30%. The 

remaining similarities fall between 26% and 30%. There is no measurable similarity between any 
RRV vIRF and any KSHV vIRF other than K9. There is, however, a pattern of higher similarity 
between members of the RRV vIRF family. R6, R7, R8, and R9 are most similar to R10, Rll, 
R12, and R13, respectively, with the similarities falling between 50% and 62%. The pattern of 

15 similarity suggests a single, possibly recent, gene duplication event for RRV which increased the 
number of vIRFs in the genome from four to eight. 

The final RRV gene with a unique KSHV counterpart is R15, which has some similarity 
to K14, a viral NCAM Ox-2 homologue. The similarity between R15 and K14 (35.2%) is 
relatively low compared to most other shared proteins. 

20 A number of genes in RRV appear to be truly unique. Rl colocalizes with, but has no 

similarity to, Kl, a KSHV gene that has been demonstrated to have in vivo transforming ability. 
Kl and Rl both colocalize with ORF1, or STP (saimiri transforming protein), although both Kl 
and Rl are in opposite orientations compared to STP. A BLASTP search of GenBank using Rl 
reveals a limited amino-terminal similarity to a series of Fc receptors, including a potential 

25 transmembrane domain. These data suggest that Rl, like Kl and STP, may have transforming 
potential via transmembrane signaling. 

R4 and R5 are located between ORF 50 and 52, the same location as K8 and K8. 1 in 
KSHV; however, there is no similarity between either R4 or R5 and the KSHV proteins. A 
BLASTP search of GenBank failed to show any significant alignments with either R4 or R5, so 

30 their functions are unknown. 

RRV has no confirmed ORFs in the region corresponding to K12, the ubiquitously 
expressed kaposin gene. A large ORF exists to the right of ORF 71, but it has no apparent control 
regions (TATA box or polyadenylation signal), so it has not been designated as a true ORF, 
pending identification of transcripts from this region. No ORFs corresponding to KSHV K15 have 

35 been identified. 
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EXAMPLE 12 
Co-localization of Repeat Units in RRV and KSHV 

The RRV genome contains three highly repetitive regions, which correspond to three of 
5 the repetitive regions of KSHV: fink, zppa, and mdsk (FIG. 5). KSHV frnk and zppa, and the 
corresponding RRV repetitive regions, rDL-B and rDL-E, respectively, are tandem repeats. 

The first element of the RRV syko repeat is much lower inG + C content than the 
corresponding KSHV element, although the sizes are comparable (FIG. 5). The second element is 
over 700 bp longer than the corresponding KSHV element. The first element of the RRV vrtgo 
10 repeat is 30% longer than the corresponding KSHV element, and the second RRV element is over 
four times as long as the second KSHV element. There is no sequence similarity between the 
various elements of the two viruses nor is there any similarity between any two repeat sequences in 
RRV. 

Not all repeat elements found in KSHV have corresponding repeats in RRV. This 
15 includes the KSHV vnct and waka/jwka repeats. This also includes the moi repeat, which is 

located in the center of the KSHV ORF 73 and is responsible for the divergent lengths of RRV and 
KSHV ORF 73. Moi is described in the annotations to the KSHV GenBank entry as having 15 
different 1 1-16 bp repeats. The result of this repeat element is the presence in ORF 73 of a highly 
acidic central domain, with a large number of glutamate residues coded by a repeating GAG 
20 codon. KSHV ORF 73 is a potential leucine zipper protein, with a number of leucine zipper sites 
in the repeat region. RRV lacks the moi repeat and its concomitant acidic domain. It also lacks 
any evidence for a leucine zipper, indicating that the biology of ORF 73 in RRV may be 
substantially different than the biology of ORF 73 in KSHV. 

25 EXAMPLE 13 

Production of Simian Kaposi's Sarcoma (KS) and Lymphoprolifcrative Disorders Model 
This example describes how the RRV cloned above can be used to produce a non-human 
primate model for Kaposi's sarcoma and lymphoproliferative disorders. Four rhesus macaques 
(identification numbers 18483, 18503, 18540 and 18570) that were approximately 1.5 years old, 
30 and PCR- and seronegative for RRV were selected. To perform the antibody analysis, infected 

cells were solubilized with 0.5% Nonidet P-40 and 1% sodium deoxycholate in phosphate buffered 
saline, and clarified in a Beckman SW28 rotor at 23,500 rpm for 1 hour at 4°C. The clarified 
supernatant was used as antigen for coating enzyme-linked immunosorbent assay (ELISA) plates 
(500 ng/well). ELISAs were then performed essentially as described by Kodama et al. (AIDS Res 
35 Hum Retroviruses 5:337-43, 1989). 

All of the animals were then inoculated intravenously with cell-free supernatants 
containing the equivalent of 5 ng of p27 prepared from COS-1 cells transfected with an 
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SIVmac239 molecular clone (Endres et aL, 1995, SW. J Med. Primatol. 24:141-4). The PBMCs 
from all macaques were prescreened for in vitro susceptibility to virus infection as described by 
Naidu et al. (/. ViroL 62:4691-6, 1988). All inoculations and animal manipulations were 
performed according to institutional guidelines at the Oregon Regional Primate Research Center 
5 (Beaverton, OR). Every 3-4 days for 4 weeks, then at 2-week intervals, macaques were sedated 
with ketamine hydrochloride (10 mg per kilogram of body weight) and examined for fever, weight 
loss, cutaneous signs, lymphadenopathy, and hepatomegaly or splenomegaly. At these times, 
venipuncture was performed and blood specimens collected. Plasma was monitored for virus 
during the first 4 weeks with the SIV p27 enzyme-linked immunosorbent assay (ELISA) (Coulter 

10 Corp. Hiaieah, FL.). T cell subsets and B cells were measured by flow cytometry with the OKT4 
(CD4, Ortho), B9.ll (CD8, Coulter), and B-Ly-1 (CD20, Coulter) monoclonal antibodies. 

At 8 weeks post-SIV infection, rhesus macaques 18483 and 18570 were inoculated 
intravenously with 5 x 10 6 plaque forming units of gradient purified RRV that was grown and 
titered by plaque assay on primary rhesus fibroblasts. The two remaining macaques (18503 and 

15 18540) were kept as SIV-infected controls. Every 3-4 days for 2 weeks, once a week for 4 weeks, 
then at 2 week intervals, the macaques were examined and blood samples collected and analyzed. 
Virus isolations were performed by cocultivation of 2 x 10 5 PBMCs from each of the macaques 
with primary rhesus fibroblasts in duplicate. Cell cultures were monitored every 2-3 days for 3-4 
weeks for cytopathic effects characteristic of RRV. PBLs were also analyzed by PCR for the 

20 presence of viral DNA. PCR analysis for RRV was performed with the following oligonucleotide 
primers: vMIP-1, 5* CCTATGGGCTCCATGAGC 3' (SEQ ID NO 166); and vMIP-2, 5' 
ATCGTCAATCAGGCTGCG 3* (SEQ ID NO 167). The conditions for PCR were 94°C for 2 
minutes (1 cycle); 94 °C for 0.5 minutes, 50°C for 0.5 min, 72°C for 0.5 minutes (30 cycles); 
72°C extension for 5 minutes (1 cycle). Each PCR reaction used 0.1 Fg of total DNA, 50 pmole 

25 of each primer, 1 U of Vent polymerase, 40 /aM each of deoxynucleotide triphosphate, 10 mM 
KC1, 10 mM Tris-HCl (pH 8.8), 10 mM (NH 4 )2S04, 2 mM MgS0 4 and 0.1 % Triton X-100 in a 
final volume of 50 fiL. The PCR reactions were run out on a 1 % agarose gel, transferred to 
nitrocellulose, and probed with a 32 P-ATP-labeled oligonucleotide primer specific for vMIP-3 (5* 
ATATTAAACACTCGCCGC- 3' SEQ ID NO 168). Hybridizations were performed overnight at 

30 room temperature in 6X SSC, 0. 1 % SDS and 10 figimh E. coli tRNA. Southern blots were then 
washed with 2X SSC and 0.1% SDS twice at room temperature followed by two washes for 1 hour 
in 2X SSC and 0. 1 % SDS at 47°C. Bound probe was visualized by exposing NEN duPont 
reflection film to the washed membrane at 80°C with an NEN duPont Reflection screen. 

Infectious RRV was recovered from the peripheral blood mononuclear cells (PBMCs) of 

35 both RRV macaques injected with RRV as early as 4 weeks after inoculation for one macaque 
(18570) and 8 weeks for the other macaque (18483), but not from the control macaques. The 
peripheral blood leukocytes (PBL) from both macaques were also shown to harbor viral DNA as 
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detennined by PCR and Southern blot analysis for the viral MIP gene, as early as 4 weeks after 
inoculation for one macaque (18483) and as late as 14 weeks for the second macaque (18570). 
Additionally, antibody responses to RRV were observed as measured by ELISA in the RRV- 
infected macaques beginning 4 weeks post-infection, but not in the control macaques. 
5 Flow cytometry analysis (FACS) of PBLs at the indicated weeks post-infection (FIGS 8A- 

8D) showed there was limited CD4 + lymphocyte depletion after SIV infection in both groups of 
macaques followed by a rebound and sustained CD4+ lymphocyte counts. However, examination 
of CD20+ B lymphocytes revealed significant differences between the two groups. The two 
control macaques exhibited a dramatic and sustained decline in CD20+ B lymphocytes (FIGs. 8C 

10 and 8D), whereas both co-infected macaques exhibited a transient increase in B lymphocytes 
beginning 6 weeks after RRV infection (FIGs. 8A and 8B). The increase in CD20+ B 
lymphocytes correlated with the isolation and/or detection of RRV in both macaques; however, 
viral load did not appear to correlate with the increase in CD20+ B lymphocytes when all samples 
from each macaque were analyzed simultaneously. It has been reported that CD23, a B cell 

15 activation marker, is induced by RhEBV infection of macaques (Moghaddam et al. 1997, Science 
276:2020-33). FACS analysis of PBMCs from RRV-infected macaques revealed no detectable 
CD23+ cells. This would suggest that the mechanism responsible for increased numbers of 
CD20+ B lymphocytes following RRV infection differ from the activation of B lymphocytes by 
RhEBV. 

20 Routine physical examinations were performed on all four macaques, and early symptoms 

of SIV infection were observed in all four macaques by 2 weeks, including fever, rash and 
malaise. However, 11 weeks after inoculation with RRV, macaques 18483 and 18570 developed 
marked lymphadenopathy and splenomegaly, estimated to be enlarged 10 to 20 times the size of a 
normal spleen. In contrast, there was only slight lymph node enlargement in the control macaques 

25 not infected with RRV and no detectable enlargement of the spleen. Lymph node biopsies of the 
RRV-infected macaques revealed almost identical histology, characterized by a predominately 
follicular lesion with giant germinal centers and paracortical hyperplasia with increased 
vascularity, resembling angiofollicular lymph node hyperplasia associated with KSHV in 
Castleman's disease (Lachant et al. 1985, Am. J. Clin. Pathol. 83:27-33). In contrast, the lymph 

30 nodes of the control macaques exhibited atrophied lymphoid follicles and paracortical depletion 
characteristic of SIV-induced lymphoid atrophy (Chalifoux et al., 1987, Am. J. Pathol. 128:104- 
10; Ringler et al., 1989, Am. J, Pathol. 134:373-83; Wyand et al, 1989, Am. J. Pathol. 134:385- 
93). By FACS analysis, the majority of the lymph node mononuclear cells were CD20 + B 
lymphocytes in RRV-infected macaques, whereas CD4+ and CD8+ T lymphocytes predominated 

35 in the control macaques. 

The presence of viral DNA was determined by PCR analysis on DNA derived from PBLs. 
Detection of antibodies to RRV was determined by enzyme-linked immunosorbent assay (ELISA) 
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on plates coated with extracts derived from RRV-infected cells. By PCR analysis, RRV sequences 
were more prevalent in the lymph nodes than in the peripheral blood of RRV-infected macaques, 
whereas control macaques were negative for RRV sequences (FIGs. 9A and 9B). 

Additional disease manifestations were also observed in the RRV-infected macaques that 

5 parallel clinical features and B cell abnormalities observed in AIDS patients. 

Hypergammaglobulinemia was observed in the RRV-infected macaque that the virus was derived 
from, as well as in the macaques experimentally infected with RRV, whereas the two control 
macaques had gammaglobulin levels similar to those before SIV infection. In addition, one of two 
RRV-infected macaques (18570) developed severe autoimmune hemolytic anemia 30 weeks after 

10 RRV infection, a condition frequently observed in MCD patients (Parravicini et ah, 1997, Am. J. 
Pathol. 151:1517-22). 

The second of the two RRV-infected macaques developed other unique clinical 
manifestations that paralleled those of AIDS patients with KS. At 60 weeks post-RRV infection it 
developed a distended abdomen that was clinically evident upon physical examination. Palpation 

15 revealed a pronounced fluid accumulation in the peritoneal cavity. This animal was euthanized 
due to persistent fluid accumulation and hyperbilirubinemia. Necropsy analysis on this animal 
revealed an abundance of ascites fluid, which was comprised predominately of CD20 B cells, as 
identified by FACS analysis. In addition, this animal exhibited a mesenchymal proliferative lesion 
throughout the viscera, that was identified by histopathological examination to be retroperitoneal 

20 fibromatosis (RF). RF is an abnormal highly vascularized mesenchymal proliferative lesion that 
exhibits histological features resembling Kaposi's Sarcoma. Analysis of DNA isolated from the 
ascites and RF lesion by PCR with RRV MIP primers (given in Example 2) revealed a high viral 
load, implying RRV infection was responsible for these abnormal proliferations. 



Obtaining the RRV Viral Genome 

The RRV genome of the invention (SEQ ID NO 1) can be procured by de novo isolation 
from a viral culture. A biological sample of the virus (accession number VR-2601) may be 

30 obtained from the ATCC in Manassas, VA. This virus can be grown in vitro using primary rhesus 
fibroblasts (see Example 1). The virus is harvested from the culture supernatant and the infected 
host cells. Cellular debris is removed by centrifugation and intracellular virus particles may be 
released by sonication followed by centrifugation to pellet debris. The virus is then pelleted by 
centrifugation and further purified through a six-step sorbitol gradient. The interface containing 

35 the virus is collected and the virus then pelleted by centrifugation, and the viral DNA released by 
SDS disruption. Viral DNA may be isolated by CsCh gradient centrifugation. 



25 



EXAMPLE 14 



Other Methods to Prepare RRV Nucleic Acid Sequences 
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Obtaining Selected Polynucleotides from the Viral genome 

The isolated viral genome can be used as a source of polynucleotides as identified by the 
sequencers disclosed herein (SEQ ID NO 1). The polymerase chain reaction (PCR) may be used 
to amplify any polynucleotide selected from the known viral sequence using the viral genome as a 
5 source of template DNA. The template DNA may also be provided in the form of one or more 
cosmids that contain fragments of the viral genome. Alternately, cDNA, produced by reverse 
transcription of RNA extracted from RRV infected host cells, may be used as a template in a 
reverse-transcription PCR (RT-PCR) reaction. Methods and conditions for PCR and RT-PCR 
amplification are described in Innis et al. ( PCR Protocols , A Guide to Methods and 

10 Applications , 1990, Innis et al. (eds.), 21-27, Academic Press, Inc., San Diego, California). 

The selection of PCR primers may be made according to the portions of the genome to be 
amplified. Primers may be chosen to amplify small fragments of the genome, ORFs or fragments 
including many contiguous genes from the genome. Variations in amplification conditions may be 
required to accommodate primers of differing lengths, and such considerations are well known in 

15 the art and are discussed in Innis at al. ( PCR Protocols, A Guide to Methods and 

Applications , 1990, Innis et al. (eds.), 21-27, Academic Press, Inc., San Diego, California), 
Sambrook et al. (Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold Spring 
Harbor, New York. 1989) and Ausubel et al. ( Current Protocols in Molecular Biology , Greene 
Pubi. Assoc. & Wiley-Intersciences. 1987). For example, the ORF corresponding to the MIP 

20 gene may be amplified from an RRV genomic (or appropriate eosmid) template using die 

following pair of primers: 5' ATGAGGGGCCTTTTCGTGTGC 3' (SEQ ID NO 169) and 5' 
CTGAATCCCGCTGCCAAGGCC 3' (SEQ ID NO 170). 

Likewise, the ORF corresponding to the IL-6 gene may be amplified from an RRV 
genomic (or appropriate eosmid) template using the following pair of primers: 5* 

25 ATGTTCCCTGTCTGGTTCGTC 3' (SEQ ID NO 171) and 5* TTACATCATAGCTATTGCGCG 
3' (SEQ ID NO 172). 

Such primers are illustrative only and it will be readily appreciated by one of ordinary 
skill in the art that many different primers may be selected from the sequence disclosed and used 
in PCR amplification reactions to amplify DNA sequences of interest from the RRV genome. 

30 Polynucleotides that may be obtained by the above methods include, for example: the 

entire polynucleotide genome of RRV as shown in SEQ ID NO 1; ORFs of this genome; 
oligonucleotides comprising at least 15, 20, 30, 40, 50, 70, 100 and 150 consecutive nucleotides 
of the genome sequence as shown in SEQ ID NO 1; nucleic acid sequences defined by nucleotides 
1 to 11031 of SEQ ID NO 1 and nucleotides 21625 to 133719 of SEQ ID NO 1; and ORFs 

35 selected from these nucleic acid sequences. It is readily apparent that fragments of any length may 
be made using the above methods and information. 
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EXAMPLE 15 

Therapeutic and Diagnostic Uses of the RRV IL-6 Protein 

As disclosed herein, the genome of RRV possesses an IL-6 gene (FIGS. 1 and 10 and 
5 SEQ ID NO 20) similar to that found in KSHV. The IL-6 and MIP proteins of KSHV are thought 
to be important in disease pathology, such as in Kaposi's sarcoma. The primary structure of the 
RRV IL-6 protein is shown in FIG. 10 (SEQ ID NO 21). Given this sequence information, one 
can readily make derivative proteins of RRV IL-6. Such derivative proteins include proteins that 
differ from the primary amino acid sequence as shown in FIG. 10 (SEQ ID NO 21) by one or 

10 more conservative amino acid substitutions. Examples of such conservative substitutions are given 
in the DEFINITIONS section of the specification. Derivative proteins also include proteins 
consisting of an amirfo acid sequence that has a defined degree of amino acid similarity with the 
RRV IL-6 protein. For instance, such derivative proteins will typically have at least 50% 
sequence similarity (and may have at least 60%, 70%, 80%, 90%, 95%, 98% or even 99% 

15 sequence similarity) with the RRV IL-6 protein. Such derivative proteins will not only share 
sequence similarity with KSHV IL-6 but will also possess IL-6 biological activity. 

IL-6 is a cytokine known to have pleiotropic immunological effects including anti- 
inflammatory and immunosuppressive effects, and may be used in several therapeutic and 
diagnostic applications. RRV IL-6 of the invention may be likewise be used. For instance, IL-6 

20 may be used to induce stimulation of hematopoietic stem cells, and to encourage proliferation, 
differentiation and terminal maturation of erythroid cells from hematopoietic cells. Thus, for 
instance, RRV IL-6 may be used in vivo or ex vivo to treat diseases that involve leukopenia and 
thrombocytopenia. Such uses include stimulation of hematopoietic cells of radiotherapy patients or 
people exposed to radiation accidentally. IL-6 may be used in such applications in conjunction 

25 with GM-CSF (granulocyte-macrophage stimulating factor) (see U.S. Patent Nos. 5,610,056 and 
5,620,685, herein incorporated by reference). IL-6 can also be used to stimulate growth of 
megakaryocytes and platelets, and for the inhibition of tumor growth (see U.S. Patent No. 
5,620,685, herein incorporated by reference). IL-6 can also be used for the treatment of 
leukemias, such as chronic myeloid leukemia (CML) and acute myeloid leukemia, by inducing 

30 terminal differentiation of cells with IL-6 (see WO 90/01943, herein incorporated by reference). 
RRV IL-6 may be used for all such applications. 

Therapeutic applications may involve the administration of RRV IL-6 in a number of 
ways. RRV IL-6 may be administered in vivo, e.g., by injection systemically or locally, for 
instance, into a subject. Many other forms of in vivo administration are possible including 

35 intravenous, subcutaneous, across a mucous membrane (anally, vaginally or sublingually), 

transdermal or by direct injection. Additionally, it may be administered ex vivo, by the removal of 
cells from a subject, the treatment of these cells in vitro with RRV IL-6, and the replacement of 
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ihese cells into the subject. Another recently developed method of delivery of a protein drug is by 
introducing the gene coding for the drug into a subject, for instance within the genome of a virus, 
such as an adenovirus or a retrovirus, whereby the protein is expressed in the subject. Other 
modes of administration are provided in Example 25. 
5 Such examples are provided for illustrative purposes only and it will be seen that RRV IL- 

6 may be used in a variety of topical and systemic immunological treatments where it would be 
desirable to stimulate cell proliferation or to induce anti-inflammatory or immunosuppressive 
effects. Additionally, IL-6 of the invention may be used for research and diagnostic purposes as 
discussed generally herein. For instance, IL-6 may be used to produce antibodies for diagnostic 
10 purposes to diagnose diseases characterized by increased or decreased production of IL-6, and the 
nucleic acid sequence encoding IL-6 may be used to produce probes and primers for diagnostic 
and research purposes or for gene therapy applications. The IL-6 could also be used as a targeting 
molecule for identifying cells with receptors for IL-6, and for directing therapeutic agents to these 
cells, for example by linking detector or therapeutic molecules to IL-6. 

15 

EXAMPLE 16 
Therapeutic and Diagnostic Uses of the RRV MIP Protein 

The genome of RRV as disclosed herein possesses an MIP gene (FIGS. 1 and 1 1 and SEQ 

20 ID NO 24) similar to that found in KSHV. The primary structure of the RRV MIP protein is 
shown in FIG. 1 1 (SEQ ID NO 25). Given this sequence information, one can readily make 
derivative proteins of RRV MIP. Such derivative proteins include proteins that differ from the 
primary amino acid sequence as shown in FIG. 1 1 (SEQ ID NO 25) by one or more conservative 
amino acid substitutions. Derivative proteins also include proteins consisting of an amino acid 

25 sequence that has a defined degree of amino acid similarity with the RRV MIP protein. Typically 
such derivative proteins will have at least 50% sequence similarity with the RRV MIP protein, and 
may have at least 60%, 70%, 80%, 90%, 95%, 98%, or even 99% sequence similarity. Such 
derivative proteins will not only share sequence similarity with KSHV MIP but will also possess 
MIP biological activity. MIP biological activity can be detected and quantified using bioassays as 

30 described in Kedal et al. (Science 277: 1656-9, 1997) and Boshoff et al. (Science 278:290-4, 1997) 
that measure MIP concentrations using HIV inhibition and calcium mobilization, respectively. 

MIP is a cytokine that activates neutrophils to undergo an oxidative burst and is also 
intrinsically pyrogenic. The MIP genes and proteins of the invention may be used in several 
therapeutic and diagnostic ways. The RRV MIP protein may be used for the same applications as 

35 other MIP proteins. Treatment of wounds to promote healing by application of MIP to the wound 
site is discussed in U.S. Patent No. 5,145,676. U.S. Patent No. 5,474,983 (herein incorporated 
by reference) discusses various methods of treatment of inflammatory diseases including asthma, 
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allergies and dermatitis. U.S. Patent No. 5,656,724 (herein incorporated by reference) discloses 
the use of MIP to suppress proliferation of dividing myeloid cells e.g., for the treatment of 
neutropenia. Use of MIP to inhibit HIV is discussed by Kedal et ah (Science 277:1656-9, 1997). 
RRV MIP may be used for all such applications. 
5 As illustrated for IL-6 above, MIP may be administered in various ways to provide a 

therapeutic effect including in vivo, ex vivo and by gene therapy. 

Such examples are provided for illustrative purposes only and it will be seen that MIP 
may be used in a variety of topical, systemic, in vivo and ex vivo immunological treatments where 
it would be desirable to activate neutrophils or to induce fever. Additionally, MIP of the invention 

10 may be used for diagnostic purposes as discussed generally herein. For instance, MIP may be 

used to produce antibodies for diagnostic purposes to diagnose diseases characterized by increased 
or decreased production of MIP, and the nucleic acid sequence encoding MIP may be used to 
produce probes and primers for diagnostic and research purposes, or for gene therapy applications. 

The MIP could also be used as a targeting molecule for identifying cells with receptors for 

15 MIP, and for directing therapeutic agents to these cells, for example by linking detector or 
therapeutic molecules to MIP. 

Although Examples 15 and 16 provide examples of therapeutic uses of the RRV IL-6 and 
MIP proteins, any of the other proteins encoded by the RRV can also be administered 
therapeutically, or diagnostically. For example, RRV proteins that induce pathological or 

20 physiological conditions in a recipient can be administered to stimulate that condition for study, or 
to provide an animal or human model of the condition. That model can then be used to study the 
condition, or treatments for it. 

EXAMPLE 17 
Expression of RRV cDNA Sequences 

25 With the provision of the RRV genomic (SEQ ID NO 1) and cDNAs (even-numbered 

SEQ ID Nos 2-164), the expression and purification of any of the RRV proteins (odd-numbered 
SEQ ID Nos 3-165), from any species, by standard laboratory techniques is now enabled. 
Fragments amplified as described herein can be cloned into standard cloning vectors and expressed 
in commonly used expression systems consisting of a cloning vector and a cell system in which the 

30 vector is replicated and expressed. Purified proteins may be used for functional analyses, antibody 
production, diagnosis, and patient therapy. Furthermore, the DNA sequences of the RRV cDNAs 
can be manipulated in studies to understand the expression of RRV genes and the function of their 
products. Mutant forms of RRV may be isolated based upon information contained herein, and 
may be studied in order to detect alteration in expression patterns in terms of relative quantities, 

35 and functional properties of the encoded mutant RRV protein. Partial or full-length cDNA 
sequences, which encode for the protein, may be ligated into bacterial expression vectors. 
Methods for expressing large amounts of protein from a cloned gene introduced into E. coil may 
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be utilized for the purification, localization and functional analysis of proteins. For example, 
fusion proteins consisting of amino terminal peptides encoded by a portion of the E. coli lacZ or 
trpE gene linked to RRV protein may be used to prepare polyclonal and monoclonal antibodies 
against this protein. Thereafter, these antibodies may be used to purify proteins by immunoaffmity 
5 chromatography, in diagnostic assays to quantitate the levels of protein and to localize proteins in 
tissues and individual cells by immunofluorescence and microscopy. 

Intact native protein may also be produced in E. coli in large amounts for functional 
studies. Standard prokaryotic cloning vectors may also be used, for example pBR322, pUC\ & or 
pUCl9 as described in Sambrook et al. ( Molecular Cloning: A Laboratory Manual , 2 nd ed. r vol. 1- 

10 3, Cold Spring Harbor, New York. 1989). Nucleic acids of RRV may be cloned into such vectors 
which may then be transformed into bacteria such as E. coli which may then be cultured so as to 
express the protein of interest. Other prokaryotic expression systems include, for instance, the 
arabinose-induced pBAD expression system that allows tightly controlled regulation of expression, 
the IPTG-induced pRSET system that facilitates rapid purification of recombinant proteins and the 

15 IPTG-induced pSE402 system that has been constructed for optimal translation of eukaryotic 

genes. These three systems are available commercially from Invitrogen and, when used according 
to the manufacturer's instructions, allow routine expression and purification of proteins. 

Methods and plasmid vectors for producing fusion proteins and intact native proteins in 
bacteria are described in Sambrook et al. (Molecular Cloning: A Laboratory Manual , Cold Spring 

20 Harbor, New York, 1989, Chapter 17). Such fusion proteins may be made in large amounts, are 
easy to purify, and can be used to elicit antibody response. Native proteins can be produced in 
bacteria by placing a strong, regulated promoter and an efficient ribosome binding site upstream of 
the cloned gene. If low levels of protein are produced, additional steps may be taken to increase 
protein production; if high levels of protein are produced, purification is relatively easy. Suitable 

25 methods are presented in Sambrook et al. ( Molecular Cloning: A Laboratory Manual , Cold 

Spring Harbor, New York, 1989) and are well known in the art. Often, proteins expressed at high 
levels are found in insoluble inclusion bodies. Methods for extracting proteins from these 
aggregates are described by Sambrook et al. (Molecular Cloning: A Laboratory Manual , Cold 
Spring Harbor, New York, 1989, Chapter 17). 

30 Vector systems suitable for the expression of lacZ fusion genes include the pUR series of 

vectors (Ruther and Muller-Hill, 1983, EMBO J. 2:1791), pEXl-3 (Stanley and Luzio, 1984, 
EMBO J. 3:1429) and pMRlOO (Gray et al., 1982, Proc. Nail. Acad. ScL USA 79:6598). Vectors 
suitable for the production of intact native proteins include pKC30 (Shimatake and Rosenberg, 
1981, Nature 292:128), pKK177-3 (Amann and Brosius, 1985, Gene 40:183) and pET-3 (Studiar 

35 and Moffatt, 1986, J. Mol. Biol. 189:1 13). The RRV fusion protein may be isolated from protein 
gels, lyophilized, ground into a powder and used as an antigen. The DNA sequence can also be 
transferred to other cloning vehicles, such as other plasmids, bacteriophages, cosmids, animal 
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viruses and yeast artificial chromosomes (YACs) (Burke et al., 1987, Science 236:806-12). These 
vectors may then be introduced into a variety of hosts including somatic cells, and simple or 
complex organisms, such as bacteria, fungi (Timberlake and Marshall, 1989, Science 244:1313-7), 
invertebrates, plants (Gasser and Fraley, 1989, Science 244:1293), and mammals (Pursel et al., 
5 1989, Science 244:1281-8), which cell or organisms are rendered transgenic by the introduction of 
one or more heterologous RRV DNAs. 

Various yeast strains and yeast-derived vectors are commonly used for expressing and purifying 
proteins, for example, Pichia pastoris expression systems are available from Invitrogen (Carlsbad, 
CA). Such systems include suitable Pichia pastoris strains, vectors, reagents, transformants, 

10 sequencing primers and media. 

Non-yeast eukaryotic vectors can also be used for expression of the RRV proteins. 
Examples of such systems are the well known Baculovirus system, the Ecdysone-inducible 
mammalian expression system that uses regulatory elements from Drosophila melanogaster to 
allow control of gene expression, and the Sindbis viral expression system that allows high level 

15 expression in a variety of mammalian cell lines. These expression systems are available from 
Invitrogen. 

For expression in mammalian cells, the cDNA sequence may be ligated to heterologous 
promoters, such as the simian virus SV40, promoter in the pSV2 vector (Mulligan and Berg, 
1981, Proc. Natl. Acad. Sci. USA 78:2072-6), and introduced into cells, such as monkey COS-l 

20 cells (Gluzman, 1981, Cell 23:175-82), to achieve transient or long-term expression. The stable 
integration of the chimeric gene construct may be maintained in mammalian cells by biochemical 
selection, such as neomycin (Southern and Berg, 1982, J. Mol. AppL Genet. 1:327-41) and 
mycophoenolic acid (Mulligan and Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072-6). 

DNA sequences can be manipulated with standard procedures such as restriction enzyme 

25 digestion, fill-in with DNA polymerase, deletion by exonuclease, extension by terminal 
deoxynucleotide transferase, ligation of synthetic or cloned DNA sequences, site-directed 
sequence-alteration via single-stranded bacteriophage intermediate or with the use of specific 
oligonucleotides in combination with PCR. 

The cDNA sequence (or portions derived from it) or a mini gene (a cDNA with an intron 

30 and its own promoter) may be introduced into eukaryotic expression vectors by conventional 

techniques. These vectors are designed to permit the transcription of the cDNA eukaryotic cells 
by providing regulatory sequences that initiate and enhance the transcription of the cDNA and 
ensure its proper splicing and polyadenylation. Vectors containing the promoter and enhancer 
regions of the SV40 or long terminal repeat (LTR) of the Rous Sarcoma virus and polyadenylation 

35 and splicing signal from SV40 are readily available (Mulligan and Berg, 1981, Proc. Natl. Acad. 
Sci. USA 78:2072-6; Gorman et al., 1982, Proc. Natl. Acad. Sci USA 78:6777-81). The level of 
expression of the cDNA can be manipulated with this type of vector, either by using promoters 
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that have different activities (for example, the baculovirus pAC373 can express cDNAs at high 
levels in S. frugiperda cells (Summers and Smith, 1985, Genetically Altered Viruses and the 
Environment , Fields et al. (Eds.) 22:319-328, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York.) or by using vectors that contain promoters amenable to modulation, for 
5 example, the giucocorticoid-responsive promoter from the mouse mammary tumor virus (Lee et 
al., 1982, Nature 294:228). The expression of the cDNA can be monitored in the recipient cells 
24 to 72 hours after introduction (transient expression). 

In addition, some vectors contain selectable markers such as the gpt (Mulligan and Berg, 
1981, Proc. NatL Acad. Sci. USA 78:2072-6) or neo (Southern and Berg, 1982, J. MoL Appl. 
10 Genet. 1:327-41) bacterial genes. These selectable markers permit selection of transfected cells 

that exhibit stable, long-term expression of the vectors (and therefore the cDNA). The vectors can 
be maintained in the cells as episomal, freely replicating entities by using regulatory elements of 
viruses such as papilloma (Sarver et al., 1981, MoL Cell BioL 1:486) or Epstein-Barr (Sugden et 
al., 1985, MoL Cell Biol. 5:410). Alternatively, one can also produce cell lines that have 
15 integrated the vector into genomic DNA. Both of these types of cell lines produce the gene 

product on a continuous basis. One can also produce cell lines that have amplified the number of 
copies of the vector (and therefore of the cDNA as well) to create cell lines that can produce high 
levels of the gene product (Alt et al., 1978, J. BioL Chem. 253:1357). 

The transfer of DNA into eukaryotic, in particular human or other mammalian cells, is 
20 now a conventional technique. The vectors are introduced into the recipient cells as pure DNA 
(transfection) by, for example, precipitation with calcium phosphate (Graham and vander Eb, 
1973, Virology 52:466) or strontium phosphate (Brash et al., 1987, MoL Cell BioL 7:2013), 
electroporation (Neumann et al., 1982, EMBO J. 1:841), lipofection (Feigner et al., 1987, Proc. 
Natl. Acad. Sci USA 84:7413), DEAE dextran (McCuthan et al., 1968, 7. Natl. Cancer Inst. 
25 41:351), microinjection (Mueller et al., 1978, Cell 15:579), protoplast fusion (Schafner, 1980, 
Proc. Natl. Acad. Sci. USA 77:2163-7), or pellet guns (Klein et al., 1987, Nature 327:70). 
Alternatively, the cDNA can be introduced by infection with virus vectors. Systems are developed 
that use, for example, retroviruses (Bernstein et ah, 1985, Gen. Engrg. 7:235), adenoviruses 
(Ahmad et aL, 1986, J. ViroL 57:267), or Herpes virus (Spaete et al., 1982, Cell 30:295). 
30 These eukaryotic expression systems can be used for studies of RRV genes and mutant 

forms of these genes, the RRV proteins and mutant forms of these proteins. Such uses include, for 
example, the identification of regulatory elements located in the 5' region of RRV genes on 
genomic clones that can be isolated from genomic DNA libraries, such as human or mouse 
libraries, using the information contained in the present invention. The eukaryotic expression 
35 systems may also be used to study the function of the normal complete protein, specific portions of 
the protein, or of naturally occurring or artificially produced mutant proteins. Naturally occurring 
RRV wild-type or mutant proteins may exist in a variety of cancers or diseases, while artificially 
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produced mutant proteins can be designed by site directed mutagenesis as described above. These 
latter studies may probe the function of any desired amino acid residue in the protein by mutating 



mammalian cells from other species or non-mammalian cells as desired. The choice of cell is 
determined by the purpose of the treatment. For example, monkey COS cells (Gluzman, 1981, 
Cell 23: 175-82) that produce high levels of the SV40 T antigen and permit the replication of 
vectors containing the SV40 origin of replication may be used. Similarly, Chinese hamster ovary 

10 (CHO), mouse NIH 3T3 fibroblasts or human fibroblasts or lymphoblasts may be used. 

One method that can be used to express RRV polypeptides from the cloned RRV cDNA 
sequence in mammalian cells is to use the cloning vector, pXTI. This vector is commercially 
available from Stratagene, contains the Long Terminal Repeats (LTRs) and a portion of the GAG 
gene from Moloney Murine Leukemia Virus. The position of the viral LTRs allows highly 

15 efficient, stable transfection of the region within the LTRs. The vector also contains the Herpes 
Simplex Thymidine Kinase promoter (TK), active in embryonal cells and in a wide variety of 
tissues in mice, and a selectable neomycin gene conferring G418 resistance. Two unique 
restriction sites Bglll and Xhol are directly downstream from the TK promoter. RRV cDNA, 
including the entire open reading frame for an RRV protein such as IL-6 and the 3' untranslated 

20 region of the cDNA is cloned into one of the two unique restriction sites downstream from the 
promoter. 

The ligated product is transfected into mouse NIH 3T3 cells using Lipofectin (Life 
Technologies, Inc.) under conditions outlined in the product specification. Positive transfectants 
are selected after growing the transfected cells in 600 /ig/ml G418 (Sigma, St. Louis, MO). The 

25 protein is released into the supernatant and may be purified by standard immunoaffinity 

chromatography techniques using antibodies raised against RRV proteins (see Examplel8). 

Expression of RRV proteins in eukaryotic cells can be used as a source of proteins to raise 
antibodies. The RRV proteins may be extracted following release of the protein into the 
supernatant as described above, or, the cDNA sequence may be incorporated into a eukaryotic 

30 expression vector and expressed as a chimeric protein with, for example, P-globin. Antibody to 
P-globin is thereafter used to purify the chimeric protein. Corresponding protease cleavage sites 
engineered between the p-globin gene and the cDNA are then used to separate the two polypeptide 
fragments from one another after translation. One useful expression vector for generating p- 
globin chimeric proteins is pSG5 (Stratagene). This vector encodes rabbit p-globin. 

35 The present invention thus encompasses recombinant vectors which comprise all or part of 

RRV genome or cDNA sequences, for expression in a suitable host. The RRV DNA is opcratively 
linked in the vector to an expression control sequence in the recombinant DNA molecule so that a 



5 



the nucle^de coding for that amino acid. 

Using the above techniques, the expression vectors containing RRV genes or cDNA 
sequence or fragments or variants or mutants thereof can be introduced into human cells, 
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RRV polypeptide can be expressed. The expression control sequence may be selected from the 
group consisting of sequences that control the expression of genes of prokaryotic or eukaryotic 
cells and their viruses and combinations thereof. The expression control sequence may be 
specifically selected from the group consisting of the lac system, the tip system, the tac system, the 
5 trc system, major operator and promoter regions of phage lambda, the control region of fd coat 
protein, the early and late promoters of SV40, promoters derived from polyoma, adenovirus, 
retrovirus, baculovirus and simian virus, the promoter for 3-phosphogiycerate kinase, the 
promoters of yeast acid phosphatase, the promoter of the yeast alpha-mating factors and 
combinations thereof. 

10 The host cell, which may be transfected with the vector of this invention, may be selected 

from the group consisting of: E. coli, Pseudomonas , Bacillus subtilis. Bacillus stearothermophilus 
or other bacilli; other bacteria; yeast; fungi; plant; insect; mouse or other animal; or human tissue 
cells. 

It is appreciated that for mutant or variant RRV DNA sequences, similar systems are 
15 employed to express and produce the mutant or variant product. 

EXAMPLE 18 
Production of Antibodies to RRV and RRV Proteins 

Polyclonal or monoclonal antibodies (including humanized monoclonal antibodies) and 
20 fragments of monoclonal antibodies such as Fab, F(ab*)2 and Fv fragments, as well as any other 
agent capable of specifically binding to an RRV protein, may be produced to the RRV virion or 
any of the RRV proteins (for example odd-numbered SEQ ID Nos 3-165). Optimally, antibodies 
raised against an RRV protein would specifically detect the RRV protein of interest (or a virion 
containing the protein of interest). That is, such antibodies would recognize and bind the protein 
25 and would not substantially recognize or bind to other proteins found in human or other cells. The 
determination that an antibody specifically detects the RRV protein is made by any one of a 
number of standard immunoassay methods; for instance, the Western blotting technique 
(Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual , Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York). 
30 To determine that a given antibody preparation (such as one produced in a mouse) 

specifically detects the RRV protein by Western blotting, total cellular protein is extracted from 
murine myeloma cells and electrophoresed on a SDS-polyacrylamide gel. The proteins are then 
transferred to a membrane (for example, nitrocellulose) by Western blotting, and the antibody 
preparation is incubated with the membrane. After washing the membrane to remove non- 
35 specifically bound antibodies, the presence of specifically bound antibodies is detected by the use 

of an anti-mouse antibody conjugated to an enzyme such as alkaline phosphatase; application of the 
substrate 5-bromo-4-chloro-3~indolyl phosphate/nitro blue tetrazolium results in the production of 
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a dense blue compound by immuno-locaJized alkaline phosphatase. Antibodies which specifically 
detect an RRV protein will, by this technique, be shown to bind to the RRV protein band (which 
will be localized at a given position on the gel determined by its molecular weight). Non-specific 
binding of the antibody to other proteins (such as serum albumin) may occur and may be 
5 detectable as a weak signal on the Western blot. The non-specific nature of this binding will be 
recognized by one skilled in the art by the weak signal obtained on the Western blot relative to the 
strong primary signal arising from the specific antibody-VIAP protein binding. 

A substantially pure virion can be obtained, or substantially pure RRV protein suitable for 
use as an immunogen is isolated by purification or recombinant expression. Concentration of 

10 protein in the final preparation is adjusted, for example, by concentration on an Amicon filter 

device, to the level of a few micrograms per milliliter. Monoclonal or polyclonal antibody to the 
protein can then be prepared as described by Harlow and Lane ( Antibodies, A Laboratory Manual, 
Cold Spring Harbor Press. 1988). 

Alternatively, antibodies may be raised against synthetic RRV peptides synthesized on a 

15 commercially available peptide synthesizer (see Example 26) based upon the predicted amino acid 
sequence of the RRV protein (Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring 
Harbor Press. 1988). 

Another method of raising antibodies against RRV proteins is by subcutaneous injection of 
a DNA vector which expresses the RRV protein into laboratory animals, such as mice. Delivery 
20 of the recombinant vector into the animals may be achieved using a hand-held form of the Biolistic 
system (Sanford et al. ? 1987, Particulate Sci. TechnoL 5:27-37) as described by Tang et al. 
{Nature 356:152-4, 1992). Expression vectors suitable for this purpose may include those which 
express the RRV protein under the transcriptional control of either the human p-actin promoter or 
the cytomegalovirus (CMV) promoter. 

25 

Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of the RRV protein identified and isolated as described 
can be prepared from murine hybridomas according to the classical method of Kohler and Milstein 
{Nature 256:495, 1975) or derivative methods thereof. Briefly, a mouse is repetitively inoculated 

30 with a few micrograms of the selected protein over a period of a few weeks. The mouse is then 
sacrificed, and the antibody-producing cells of the spleen isolated. The spleen cells are fused by 
means of polyethylene glycol with mouse myeloma cells, and the excess unfused cells destroyed by 
growth of the system on selective media comprising aminoptcrin (HAT media). The successfully 
fused cells are diluted and aliquots of the dilution placed in wells of a microtiter plate where 

35 growth of the culture is continued. Antibody-producing clones are identified by detection of 
antibody in the supernatant fluid of the wells by immunoassay procedures, such as ELISA, as 
originally described by Engvall {Enzymol. 70:419, 1980), and derivative methods thereof. 
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Selected positive clones can be expanded and their monoclonal antibody product harvested for use. 



Detailed procedures for monoclonal antibody production are described in Harlow and Lane 
( Antibodies: A Laboratory Manual . 1988, Cold Spring Harbor Laboratory, New York). 

5 Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single protein 
can be prepared by immunizing suitable animals with the expressed protein (for example see 
Example 17), which can be unmodified or modified to enhance immunogenicity. Effective 
polyclonal antibody production is affected by many factors related both to the antigen and the host 

10 species. For example, small molecules tend to be less immunogenic than others and may require 
the use of carriers and adjuvant. Also, host animals vary in response to site of inoculations and 
dose, with both inadequate or excessive doses of antigen resulting in low titer antisera. Small 
doses (ng level) of antigen administered at multiple intradermal sites appears to be most reliable. 
An effective immunization protocol for rabbits can be found in Vaitukaitis et al. (J. Clin. 

15 Endocrinol. Metab. 33:988-91, 1971). 

Booster injections can be given at regular intervals, and antiserum harvested when 
antibody titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion 
in agar against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony 
et al. (In: Handbook of Experimental Immunology , Wier, D. (ed.). Chapter 19. Blackwell. 

20 1973). Plateau concentration of antibody is usually in the range of 0. 1 to 0.2 mg/ml of serum 
(about 12 ^M). Affinity of the antisera for the antigen is determined by preparing competitive 
binding curves, as described, for example, by Fisher ( Manual of Clinical Immunology , Chapter 
42. 1980). 

25 Labeled Antibodies 

Antibodies of the present invention can be conjugated with various labels for their direct 
detection (see Chapter 9, Harlow and Lane, Antibodies: A Laboratory Manual . 1988). The label, 
which may include, but is not limited to, a radiolabel, enzyme, fluorescent probe, or biotin, is 
chosen based on the method of detection available to the user. 



An embodiment of the present invention is a method for screening a subject to determine 
if the subject has been infected with RRV. One major application of the RRV sequence 
35 information presented herein is in the area of diagnostic testing for predisposition to a disease 

(such as Kaposi's Sarcoma and lymphopToIiferativc disorders) that develops in at least a sub-set of 
hosts infected with RRV. The gene sequence of the RRV genes, including intron-exon boundaries 



30 



EXAMPLE 19 



Diagnostic Methods 
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is also useful in such diagnostic methods. The method includes providing a biological sample 
obtained from the subject, in which sample includes DNA or RNA, and providing an assay for 
detecting in the biological sample the presence of any of the RRV genes or proteins. Suitable 
biological samples include samples obtained from body cells, such as those present in peripheral 
5 blood, urine, saliva, tissue biopsy, surgical specimen, fine needle aspirate specimen, amniocentesis 
samples and autopsy material. The detection in the biological sample may be performed by a 
number of methodologies, as outlined below. 

The foregoing assay may be assembled in the form of a diagnostic kit and preferably 
comprises either: hybridization with oligonucleotides; PCR amplification of the gene or a part 

10 thereof using oligonucleotide primers; RT-PCR amplification of the RNA or a part thereof using 
oligonucleotide primers; or direct sequencing of any of the RRV genes present in a subject using 
oligonucleotide primers. The efficiency of these molecular genetic methods should permit the 
rapid identification of patients infected with RRV. 

One embodiment of such detection techniques is the polymerase chain reaction 

15 amplification of reverse transcribed RNA (RT-PCR) of RNA isolated from cells (for example 

lymphocytes) followed by direct DNA sequence determination of the products. The presence of 
one or more RRV genes is taken as indicative of a potential RRV infection. 

Alternatively, DNA extracted from lymphocytes or other cells may be used directly for 
amplification. The direct amplification from genomic DNA would be appropriate for analysis of 

20 an entire RRV gene including regulatory sequences located upstream and downstream from the 
open reading frame. Recent reviews of direct DNA diagnosis have been presented by Caskey 
(Science 236:1223-1228, 1989) and by Landegren et al. (Science 242:229-37, 1989). 

Further studies of RRV genes isolated from subjects may reveal particular mutations, 
deletions or alterations in gene sequences, which occur at a high frequency within particular 

25 populations of individuals. In this case, rather than sequencing the entire RRV gene, it may be 
possible to design DNA diagnostic methods to specifically detect the most common RRV 
mutations, deletions or alterations in gene sequences. 

The detection of specific DNA mutations or alterations in gene sequences may be 
achieved by methods such as hybridization using specific oligonucleotides (Wallace et al., 1986, 

30 Cold Spring Harbor Symp. Quant. Biol. 51:257-61), direct DNA sequencing (Church and Gilbert, 
1984, Proc. Natl. Acad. ScL USA. 81:1991-5), the use of restriction enzymes (Flavell et aL, 1978, 
Cell 15:25; Geever et al., 1981, Proc. Natl. Acad. Sci USA 78:5081), discrimination on the basis 
of electrophoretic mobility in gels with denaturing reagent (Myers and Maniatis, 1986, Cold 
Spring Harbor Symp. Quant. Biol. 51:275-284), RNase protection (Myers ct al., 1985, Science 

35 230:1242), chemical cleavage (Cotton et aL, 1985, Proc. Natl. Acad. Sci. USA 85:4397-401), 
and the ligase-mediated detection procedure (Landegren et aL, 1988, Science 241:1077). 

Oligonucleotides specific to normal, mutant or alterative sequences are chemically 
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synthesized using commercially available machines, labeled radioactiveiy with isotopes (such as 
32 P) or non-radioactively, with tags such as biotin (Ward and Langer et al., 1981 . Proc. Natl. 
Acad. Sci. USA 78:6633-57), and hybridized to individual DNA samples immobilized on 
membranes or other solid supports by dot-blot or transfer from gels after electrophoresis. The 
5 presence of these specific sequences are visualized by methods such as autoradiography or 

fluorometric (Landegren et al., 1989, Science 242:229-37) or colorirnetric reactions (Gebeyehu et 
al., 1987, Nucleic Acids Res, 15:4513-34). The absence of hybridization would indicate a 
mutation in the particular region of the gene, or that the patient is not infected with RRV. 



10 revealed by the direct DNA sequencing method of Church and Gilbert (Proc. NaiL Acad. Sci. USA 
81:1991-5, 1988). Cloned DNA segments may be used as probes to detect specific DNA 
segments. The sensitivity of this method is greatly enhanced when combined with PCR (Wrichnik 
et aL, 1987, Nucleic Acids Res. 15:529-42; Wong et al., 1987, Nature 330:384-6; Stoflet et ah, 
1988, Science 239:491-4). In this approach, a sequencing primer which lies within the amplified 

15 sequence is used with double-stranded PCR product or single-stranded template generated by a 
modified PCR. The sequence determination is performed by conventional procedures with 
radiolabeled nucleotides or by automatic sequencing procedures with fluorescent tags. 

Sequence alterations may occasionally generate fortuitous restriction enzyme recognition 
sites or may eliminate existing restriction sites. Changes in restriction sites are revealed by the use 

20 of appropriate enzyme digestion followed by conventional gel-blot hybridization (Southern, 1975, 
J. Mol. Biol 98:503). DNA fragments carrying the site (either normal, mutant, or alternative) are 
detected by their reduction in size or increase of corresponding restriction fragment numbers. 
Genomic DNA samples may also be amplified by PCR prior to treatment with the appropriate 
restriction enzyme; fragments of different sizes are then visualized under UV light in the presence 

25 of ethidium bromide after gel electrophoresis. 

Screening based on DNA sequence differences may be achieved by detection of alteration 
in electrophoretic mobility of DNA fragments in gels with or without denaturing reagent. Small 
sequence deletions and insertions can be visualized by high-resolution gel electrophoresis. For 
example, a PCR product with small deletions is clearly distinguishable from a normal sequence on 

30 an 8% non-denaturing polyacrylamide gel (WO 91/10734; Nagamine et al., 1989, Am. J. Hum. 
Genet. 45:337-9). DNA fragments of different sequence compositions may be distinguished on 
denaturing formamide gradient gels in which the mobilities of different DNA fragments are 
retarded in the gel at different positions according to their specific "partial-melting" temperatures 
(Myers et al., 1985, Science 230: 1242). Alternatively, a method of detecting a mutation 

35 comprising a single base substitution or other small change could be based on differential primer 
length in a PCR. For example, an invariant primer could be used in addition to a primer specific 
for a mutation. The PCR products of the normal and mutant genes can then be differentially 



Sequence differences between disclosed and other forms of RRV genes may also be 
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detected in acrylamide gels. 

In addition to conventional gel-electrophoresis and blot-hybridization methods, DNA 
fragments may also be visualized by methods where the individual DNA samples are not 
immobilized on membranes. The probe and target sequences may be both in solution, or the probe 
5 sequence may be immobilized (Saiki et al., 1989, Proc. Nat. Acad. Sci. USA 86:6230-4). A 

variety of detection methods, such as autoradiography involving radioisotopes, direct detection of 
radioactive decay (in the presence or absence of scintillant), spectrophotometry involving 
calorigenic reactions and fluorometry involved fluorogenic reactions, may be used to identify 
specific individual genotypes. 

10 If more than one mutation or alternative sequence is frequently encountered in one or 

more RRV genes, a system capable of detecting such multiple mutations would be desirable. For 
example, a PCR with multiple, specific oligonucleotide primers and hybridization probes may be 
used to identify all possible mutations or alternative sequences at the same time (Chamberlain et 
al., 1988, NucL Acids Res. 16:1141-55). The procedure may involve immobilized sequence- 

15 specific oligonucleotides probes (Saiki et al., 1989, Proc. Nat. Acad. Sci. USA 86:6230-4). 

EXAMPLE 20 
Quantitation of RRV Proteins 

An alternative method of determining if a subject has been infected with RRV is to 

20 quantitate the level of one or more RRV proteins in the cells of a subject. This diagnostic tool 

would be useful for detecting the levels of RRV proteins which result from, for example, infection 
by RRV. These diagnostic methods, in addition to those described in EXAMPLE 19, provide an 
enhanced ability to diagnose susceptibility to diseases caused by RRV infection. 

The determination of RRV protein levels would be an alternative or supplemental 

25 approach to the direct determination of the presence of one or more RRV genes by the methods 
outlined above in EXAMPLE 19. The availability of antibodies specific to one or more of the 
RRV proteins (for example those described in Example 18) will facilitate the quantitation of 
cellular RRV proteins by one of a number of immunoassay methods which are well known in the 
art and are presented in Harlow and Lane ( Antibodies, A Laboratory Manual , Cold Spring Harbor 

30 Laboratory, New York. 1988). 

Such assays permit both the detection of RRV proteins in a biological sample and the 
quantitation of such proteins. Typical methods involve: providing a biological sample of the 
subject in which the sample contains cellular proteins, and providing an immunoassay for 
quantitating the level of RRV protein in the biological sample. This can be achieved by combining 

35 the biological sample with an RRV specific binding agent, such as an anti-RRV antibody (such as 
monoclonal or polyclonal antibodies), so that complexes form between the binding agent and the 
RRV protein present in the sample, and then detecting or quantitating such complexes. 
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In particular forms, these assays may be performed with (he RRV specific binding agent 
immobilized on a support surface, such as in the wells of a microtiter plate or on a column. The 
biological ^jnple is then introduced onto the support surface and allowed to interact with the 
specific binding agent so as to form complexes. Excess biological sample is then removed by 
5 washing, and the complexes are detected with a reagent, such as a second anti- RRV protein 
antibody that is conjugated with a detectable marker. 

In an alternative assay, the cellular proteins are isolated and subjected to SDS-PAGE 
followed by Western blotting, for example as described in Example 18. After resolving the 
proteins, the proteins are transferred to a membrane, which is probed with specific binding agents 

10 that recognize any of the RRV proteins. The proteins are detected, for example with HRP- 
conjugated secondary antibodies, and quantitated. 

In yet another assay, the level of one or more RRV proteins in cells is analyzed using 
microscopy. Using specific binding agents which recognize RRV, samples can be analyzed for the 
presence of one or more RRV proteins. For example, frozen biopsied tissue sections are thawed at 

15 room temperature and fixed with acetone at -200 °C for 5 minutes. Slides are washed twice in cold 
PBS for 5 minutes each, then air-dried. Sections are covered with 20-30 fd of antibody solution 
(15-45 ^g/ml) (diluted in PBS, 2% BSA at 15-50 jig/ml) and incubated at room temperature in 
humidified chamber for 30 minutes. Slides are washed three times with cold PBS 5 minutes each, 
allowed to air-dry briefly (5 minutes) before applying 20-30 fi\ of the second antibody solution 

20 (diluted in PBS, 2% BSA at 15-50 /xg/ml) and incubated at room temperature in humidified 
chamber for 30 minutes. The label on the second antibody may contain a fluorescent probe, 
enzyme, radiolabel, biotin, or other detectable marker. The slides are washed three times with 
cold PBS 5 minutes each then quickly dipped in distilled water, air-dried, and mounted with PBS 
containing 30% glycerol. Slides can be stored at 4°C prior to viewing. 

25 For samples prepared for electron microscopy (versus light microscopy), the second 

antibody is conjugated to gold particles. Tissue is fixed and embedded with epoxy plastics, then 
cut into very thin sections ( — 1-2 pm). The specimen is then applied to a metal grid, which is then 
incubated in the primary anti-RRV antibody, washed in a buffer containing BSA, then incubated in 
a secondary antibody conjugated to gold particles (usually 5-20 nm). These gold particles are 

30 visualized using electron microscopy methods. 

For the purposes of quantitating the RRV proteins, a biological sample of the subject, 
which sample includes cellular proteins, is required. Such a biological sample may be obtained 
from body cells, such as those present in which expression of the protein has been detected. The 
expression of RRV proteins in peripheral blood leukocytes is clearly the most accessible and 

35 convenient source from which specimens can be obtained. Specimens can be obtained from 
peripheral bloody urine, saliva, tissue biopsy, amniocentesis samples, surgical specimens, fine 
needle aspirates, and autopsy material, particularly cancer cells. Quantitation of RRV proteins 
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would be made by immunoassay and compared to levels of the protein found in non-RRV 
expressing cells or to the level of RRV proteins in non-RRV infected cells (cells of the same origin 
that are not infected). A significant (preferably 50% or greater) increase in the amount of one or 
more RRV proteins in the cells of a subject compared to the amount of one or more RRV proteins 
5 found in non-RRV infected cells or that found in normal cells, would be taken as an indication that 
the subject may have been infected with RRV. 

EXAMPLE 21 
Sequence Variants of RRV 

]0 The amino acid sequence of the RRV proteins which are encoded by the RRV cDNAs 

(even-numbered SEQ ID NOS 2-164), are shown in odd-numbered SEQ ID NOS 3-165. Having 
presented the nucleotide sequence of the RRV genome and cDNAs and the amino acid sequence of 
these proteins, this invention now also facilitates the creation of DNA molecules, and thereby 
proteins, which are derived from those disclosed but which vary in their precise nucleotide or 

15 amino acid sequence from those disclosed. Such variants may be obtained through a combination 
of standard molecular biology laboratory techniques and the nucleotide sequence information 
disclosed by this invention. 

Variant DNA molecules include those created by standard DNA mutagenesis techniques, 
for example, M13 primer mutagenesis. Details of these techniques are provided in Sambrook et 

20 aL (In: Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York, 1989, 

Chapter 15). By the use of such techniques, variants may be created which differ in minor ways 
from those disclosed. DNA molecules and nucleotide sequences which are derivatives of those 
specifically disclosed herein and which differ from those disclosed by the deletion, addition or 
substitution of nucleotides while still encoding a protein which possesses the functional 

25 characteristics of the RRV proteins are comprehended by this invention. Also within the scope of 
this invention are small DNA molecules which are derived from the disclosed DNA molecules. 
Such small DNA molecules include oligonucleotides suitable for use as hybridization probes or 
polymerase chain reaction (PCR) primers. As such, these small DNA molecules will comprise at 
least a segment of the RRV cDNA molecules or the RRV gene and, for the purposes of PCR, will 

30 comprise at least a 15 or a 20-50 nucleotide sequence of the RRV cDNAs (even-numbered SEQ ID 
Nos 2-164) or the RRV genes (i.e., at least 20-50 consecutive nucleotides of the RRV cDNA or 
gene sequences). DNA molecules and nucleotide sequences which are derived from the disclosed 
DNA molecules as described above may also be defined as DNA sequences which hybridize under 
stringent conditions to the DNA sequences disclosed, or fragments thereof. 
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Hybridization conditions resulting in particular degrees of stringency will vary depending 
upon the nature of the hybridization method of choice and the composition and length of the 
hybridizing DNA used. Generally, the temperature of hybridization and the ionic strength 
5 (especially the Na + concentration) of the hybridization buffer will determine the stringency of 
hybridization. Calculations regarding hybridization conditions required for attaining particular 
degrees of stringency are discussed by Sambrook et al. (In: Molecular Cloning: A Laboratory 
Manual , Cold Spring Harbor, New York, 1989 ch. 9 and 11), herein incorporated by reference. 
By way of illustration only, a hybridization experiment may be performed by hybridization of a 

10 DNA molecule (for example, a deviation of the RRV cDNA) to a target DNA molecule (for 

example, the RRV cDNA) which has been electrophoresed in an agarose gel and transferred to a 
nitrocellulose membrane by Southern blotting (Southern, 7. MoL BioL 98:503, 1975), a technique 
well known in the art and described in Sambrook et al. ( Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor, New York, 1989). Hybridization with a target probe labeled with 

15 [ 32 P]-dCTP is generally carried out in a solution of high ionic strength such as 6xSSC at a 
temperature that is 20-25 °C below the melting temperature, Tm, described below. For such 
Southern hybridization experiments where the target DNA molecule on the Southern blot contains 
10 ng of DNA or more, hybridization is typically carried out for 6-8 hours using 1-2 ng/ml 
radiolabeled probe (of specific activity equal to 10 9 CPM/^g or greater). Following hybridization, 

20 the nitrocellulose filter is washed to remove background hybridization. The washing conditions 
should be as stringent as possible to remove background hybridization but to retain a specific 
hybridization signal. The term Tm represents the temperature above which, under the prevailing 
ionic conditions, the radiolabeled probe molecule will not hybridize to its target DNA molecule. 
The Tm of such a hybrid molecule may be estimated from the following equation (Bolton and 

25 McCarthy, Proc. NatL Acad. Scz. USA 48:1390, 1962): Tm = 81.5°C - 16.6(logio[Na + ]) + 
0.41(%G + C) - 0.63(% formamide) - (600/1); where I = the length of the hybrid in base pairs. 

This equation is valid for concentrations of Na + in the range of 0.01 M to 0.4 M, and it is 
less accurate for calculations of T m in solutions of higher [Na + ]. The equation is also primarily 
valid for DNAs whose G + C content is in the range of 30% to 75%, and it applies to hybrids 

30 greater than 100 nucleotides in length (the behavior of oligonucleotide probes is described in detail 
in Ch. 1 1 of Sambrook et al. ( Molecular Cloning: A Laboratory Manual , Cold Spring Harbor, 
New York, 1989). 

Thus, by way of example, for a 150 base pair DNA probe derived from the open reading 
frame of the RRV cDNA (with a hypothetical %GC = 45%), a calculation of hybridization 
35 conditions required to give particular stringencies may be made as follows: For this example, it is 
assumed that the filter will be washed in 0.3 xSSC solution following hybridization, thereby: 
[Na + ] = 0.045M; %GC = 45%; Formamide concentration = 0; 1 = 150 base pairs; Tm = 81.5 - 
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16.6(logio[Na + ]) + (0.41 x 45) - (600/150); and so Tm = 74.4°C. 

The Tm of double-stranded DNA decreases by 1-1. 5°C with every 1 % decrease in 
homology (Bonner et al. t -/. MoL Biol. 81 : 123, 1973). Therefore, for this given example, 
washing the filter in 0.3 xSSC at 59.4-64.4°C will produce a stringency of hybridization 
5 equivalent to 90%; that is, DNA molecules with more than 10% sequence variation relative to the 
target RRV cDNA will not hybridize. Alternatively, washing the hybridized filter in 0.3 xSSC at 
a temperature of 65.4-68.4°C will yield a hybridization stringency of 94%; that is, DNA 
molecules with more than 6% sequence variation relative to the target RRV cDNA molecule will 
not hybridize. The above example is given entirely by way of theoretical illustration. One skilled 

10 in the art will appreciate that other hybridization techniques may be utilized and that variations in 
experimental conditions will necessitate alternative calculations for stringency. 

In particular embodiments of the present invention, stringent conditions may be defined as 
those under which DNA molecules with more than 25%, 15%, 10%, 6% or 2% sequence 
variation (also termed "mismatch'*) will not hybridize. 

15 The degeneracy of the genetic code further widens the scope of the present invention as it 

enables major variations in the nucleotide sequence of a DNA molecule while maintaining the 
amino acid sequence of the encoded protein. For example, the eleventh amino acid residue of the 
RRV MIP protein is alanine (SEQ ID NO 25). This is encoded in the RRV cDNA by the 
nucleotide codon triplet GCG. Because of the degeneracy of the genetic code, three other 

20 nucleotide codon triplets, GCT, GCA and GCC, also code for alanine. Thus, the nucleotide 

sequence of the RRV DNA could be changed at this position to any of these three codons without 
affecting the amino acid composition of the encoded protein or the characteristics of the protein. 
Based upon the degeneracy of the genetic code, variant DNA molecules may be derived from the 
DNA molecules disclosed herein using standard DNA mutagenesis techniques as described above, 

25 or by synthesis of DNA sequences. DNA sequences which do not hybridize under stringent 
conditions to the DNA sequences disclosed by virtue of sequence variation based on the 
degeneracy of the genetic code are herein also comprehended by this invention. 

The invention also includes DNA sequences that are substantially identical to any of the 
DNA sequences disclosed herein, where substantially identical means a sequence that has identical 

30 nucleotides in at least 75%, 80%, 85%, 90%, 95%, 98%, or even 99% of the aligned sequences. 

One skilled in the art will recognize that the DNA mutagenesis techniques described above 
may be used not only to produce variant DNA molecules, but will also facilitate the production of 
proteins which differ in certain structural aspects from the RRV proteins, yet which proteins arc 
clearly derivative of this protein and which maintain the essential characteristics of the RRV 

35 proteins. Newly derived proteins may also be selected in order to obtain variations on the 
characteristic of the RRV proteins, as described above. Such derivatives include those with 
variations in amino acid sequence including minor deletions, additions and substitutions. 
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While the site for introducing an amino acid sequence variation is predetermined, the 
mutation per se need not be predetermined. For example, in order to optimize the performance of 
a mutation at a given site, random mutagenesis may be conducted at the target codon or region and 
the expressed protein variants screened for the optimal combination of desired activity. 
5 Techniques for making substitution mutations at predetermined sites in DNA having a known 
sequence as described above are well known. 

Amino acid substitutions are typically of single residues; insertions usually will be on the 
order of about from 1 to 10 amino acid residues; and deletions will range about from 1 to 30 
residues. Deletions or insertions preferably are made in adjacent pairs, i.e., a deletion of two 
10 residues or insertion of two residues. Substitutions, deletions, insertions or any combination 

thereof may be combined to arrive at a final construct. Obviously, the mutations that are made in 
the DNA encoding the protein must not place the sequence out of reading frame and preferably 
will not create complementary regions that could produce secondary mRNA structure. 

Substitutional variants are those in which at least one residue in the amino acid sequence 
15 has been removed and a different residue inserted in its place. Such substitutions generally are 
made conservatively, as defined above. 

The effects of these amino acid substitutions or deletions or additions may be assessed for 
derivatives of the RRV proteins by assays in which DNA molecules encoding the derivative 
proteins are transfected into cells using routine procedures. These RRV proteins are expressed 
20 recombinantly (for example see Example 17), purified, and analyzed for their ability to cause 
symptoms associated with RRV infection, for example KS-like symptoms in rhesus macaque 
monkeys, as described in Examples 13 and 23. 

EXAMPLE 22 

25 Cloning RRV in Other Species 

Having presented the genomic (SEQ ID NO 1) and cDNA nucleotide sequences of the 
rhesus macaque RRV (even-numbered SEQ ID Nos 2-164) and the amino acid sequence of the 
encoded proteins (odd-numbered SEQ ID Nos 3-165), this invention now also facilitates the 
identification of DNA molecules, and thereby proteins, which are the RRV homologs in other 

30 species. These other homologs can be derived from those sequences disclosed, but which vary in 
their precise nucleotide or amino acid sequence from those disclosed. Such variants may be 
obtained through a combination of standard molecular biology laboratory techniques and the 
nucleotide and amino acid sequence information disclosed by this invention. 

The Japanese macaque RRV isolate was isolated from a lesion that was minced and co- 

35 cultured with primary rhesus fibroblasts. The isolate was then cloned by limiting dilution and a 
stock of virus generated from this clone. Total cellular DNA was harvested from virus infected 
cells and the DNA subjected to degenerate PGR for viral DNA polymerase, exactly as described 
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above for RRV. Once confirmed, a cosmid library of this virus was made from purified viral 
DNA (as described for RRV) and then a portion of the protein genes was cloned and sequenced. 
Results for this analysis are shown in the following Table 1 ; 

TABLE 1 

5 RRV Sequences from Japanese Macaque 

Total number of amino acid residues inferred: 972 
Number of differences compared to RRV: 29 
Percent identity: 97.02% 

10 

Japanese Macaque Data 

These are end sequences. For ORFs represented twice, section I is from one plasm id, section II is 
from another plasmid. These are non-overlapping sections. 

15 Orf 7 section I 

GLFNSIDDTINALSRDCSVTFFQQANYTNVMRKQNELFTRLNSILCQGSAGSQKPATPSEPRT 
ATVAATAASDVIKDAQYRKEQYMKKVARDGFKKLTECLQTQSAVLANALCMRVWGGVA 
YGEASELVNHFLLRRRFVALPWEARCRSNQILFENSKYIKNSLYSQRLSREHVEIITLQFYGLI 
TGPLTRQSDLFPGPANVVLAQCFEAAGMLPHHKMLVSEMIW 

20 

Orf 7 section II 

PIESLFCGGLFNSIDDTINALSRDCSVTFFQQANYTNXMRKQNELFTRLNSILCQGSAGSXKP 
ATPSEPRTATVXATAASDVIKDAQYRXEQYMKKVARDXFKKLTECLQTQSAVLANALCMR 
RMGGRRI 

25 

Orf 8 

YRKVATSVTVYRGWTETAVTGKQEVIRPVPQYEINHMDTTYQCFSSMRVNVNGIENTYTD 
RDFTNQTVFLQPVEGLTDNIQRYFSQPVLYTTPGWFPGIYRVRTTVNCEIVDMIARSAEPYS 
YFVTALGDTVEVSPFCLNDSTCSVADKAENGLGVRVLTNYTIVDFATRTPTTETRVFADSGE 
30 YTVSWKAEDPKSAVCALTLWKTFPRAIQTTHESQLPLCGQRR 

Orf 9 section I 

VPSRFQTDIIPSGTVLKLLGRTENGTSVCVNVFRQQVYFYAKVPAGVNVTHVLQQALKNTA 
GRAACGFSTRRVTKKILKTYDVAEHPVTEITLSSGSMLSTLSDRLVACGCEVFESNVDAVRR 
35 FVLDHGFTTFGWYSCARATPRLAXRDARTALEFDCSWEDLSV 

Orf 9 section II 

MDFFNPYLGPRGPRPPSHKCTDAPAPAGAVQPPPDVCRLIPACLRTPGAGGMIPVTIPFPPTY 
FENGARGDVLLAHERSMWTARGQRPVVPDPQDQSITFHAYDVVETTYAADRCAEV 

40 

Orf 10 

AQMKIIYAPGDPNAEIVLGQSGPVLPTHTGGRVLGVYADAEKTIQPGSSAEVRVQLIFPTGSA 
ARGDLAFLVAGVAPEPLFIVTPTLLLSGCTTHLRLFNPNGT 

45 Orf 29b 

NVAVEGNSSQDAGVAIATVLNEICSVPLSFLHHADKNTLIRSPIYMLGPEKAKAFESFIYALN 
SGTFSASQTVVSHTIKLSFDPVAYLIDQIKAIRCIPLKDGGHTYCAKQKTMSDDVLVATVMA 
HYMATNDKFVFKSLE 



50 

EXAMPLE 23 
Screening Assays for Pharmaceutical Agents of Interest 
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The present disclosure provides a virus that is involved in the causation or progression of 
certain diseases, such as KS, and therefore provides an animal model and assays directed to 
identifying potential pharmaceutical agents that inhibit the biological activity of the virus. Drug 
5 screening assays which determine whether or not a drug has activity against the virus can include 
incubating a compound to be evaluated for use in treatment of the condition with cells which 
express the RRV proteins or peptides, and determining the effect of the compound on the activity 
of the virus. In vitro assays in which the virus is maintained in suitable cell culture are preferred, 
though in vivo animal models would also be effective. 

10 In vitro assays include infecting cells such as rhesus fibroblasts, peripheral blood 

leukocytes or susceptible T cell lines such as MT-4 with the agent of interest in the presence of 
varying concentrations of compounds targeted against viral replication, including nucleoside 
analogs, chain terminators, antisense oligonucleotides and random polypeptides. (Asada et al., J. 
Clin. Microbiol. 27:2204, 1989; Kikuta et aL, Lancet 7:861, 1989). Infected cultures and their 

15 supernatants can be assayed for the total amount of virus, including the presence of the viral 
genome, by quantitative PCR, by dot blot assays, or by using immunologic methods. For 
example, a culture of susceptible cells could be infected with the RRV in the presence of various 
concentrations of drug, fixed on slides after a period of days, and examined for viral antigen by 
indirect immunofluorescence with monoclonal antibodies to viral polypeptides (Kikuta et al, 

20 supra). Alternatively, chemically adhered MT-4 cell monolayers can be used for an infectious 
agent assay using indirect immunofluorescent antibody staining to search for focus reduction 
(Higashi, J. Clin. Microbiol 27:2204, 1989, incorporated by reference). 

As an alternative to whole cell in vitro assays, purified enzymes isolated from the RRV 
can be used as targets for rational drug design to determine the effect of the potential drug on 

25 enzyme activity, such as thymidylate sunthase or DNA polymerase. The genes for these two 

enzymes are provided herein. A measure of enzyme activity indicates an effect on the infectious 
agent itself. Drug screens using herpes viral products are known and have been previously 
described in EP 0514830 (herpes proteases) and WO 94/04920 (Ul 13 gene product). 



30 anti-KS chemotherapeutics. Infected cells can be incubated in the presence of a chemical agent 
that is a potential chemotherapeutic against KS (e.g. acyclo-guanosinc). The level of virus in the 
cells is then determined after several days by IFA for antigens or Southern blotting for viral 
genome or Northern blotting for mRNA and compared to control cells. This assay can quickly 
screen large numbers of chemical compounds that may be useful against KS. This invention also 

35 provides an assay system that is employed to identify drugs or other molecules capable of binding 
to the DNA molecule or proteins, either in the cytoplasm or in the nucleus, thereby inhibiting or 
potentiating transcriptional activity. This assay would be useful in die development of drugs that 



In particular embodiments, this invention provides an assay for screening 
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are specific against particular cellular activity, or that would potentiate such activity, in time or in 
level of activity. Also included are drugs identified by this assay which have an anti-viral activity, 
and an eftfe^t against conditions associated with RRV infection, such as KS. 



Animal models are useful for resolving a number of fundamental problems of infectious 
diseases that include, but are not limited to, determinants of virulence of the organism, 
mechanisms of host resistance, mechanisms of pathogenicity, establishment and regulation of 

10 chronic infection, and antimicrobial and chemotherapeutic actions of drugs on infectious agents. 
Variables that are commonly manipulated to address fundamental problems include, but are not 
limited to, the strain of infectious agent, the infecting dose of infectious agent and the route of 
administration of the infectious agent, the species or subspecies of animal, the age of animal, and 
the genetic background of the animal (Viral pathogenesis, N. Nathanson, Lippincot-Raven, 

15 Philadelphia, 1997). 

In an embodiment in which one or more RRV strains are employed for generating an 
animal model, the RRV used may be naturally occurring variant isolates recovered from rhesus 
macaques and other non-human primate species, molecular clones generated from these naturally 
occurring variant isolates and recombinant viruses with introduced mutations, deletions or 

20 recombined genomes designed to address function of specific genes. 

By manipulating the infecting dose and route of RRV administration virus-host 
interactions dependent upon dose and tissue or organ-specific disease manifestations can be 
explored. Thus, the present invention includes various doses of RRV administered by oral, 
inhalation, intratracheal, intravaginal, intrarectal and parenteral routes including, but not limited to 

25 intravenous, intraarterial, intradermal, subcutaneous, intramuscular, intraperitoneal and organ- 
specific administration routes such and intracerebral and intraocular administration. 

Many disease manifestations with a given infections agent are highly influenced by age 
and species or subspecies of the host and the particular genetic makeup of the host. The present 
disclosure provides a virus that is involved in the causation or progression of certain diseases, such 

30 as KS, in the rhesus macaque, but is also useful for the study of and discovery of disease 
manifestations that are host species, age and genetic background dependent. In particular 
embodiments, one skilled in the art may vary the species of animal to which the RRV is 
administered to produce or discover a particular disease manifestation, or similarly vary the 
genetic background of the animal to produce or discover a particular disease manifestation, even 

35 including the use of genetically engineered animals. 



5 



EXAMPLE 24 



Generating Animal Models 



EXAMPLE 25 
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Pharmaceutical Compositions and Modes of Administration 
Various delivery systems for administering pharmaceutical proteins from the RRV include 
encapsulation in liposomes, microparticles, microcapsules, expression by recombinant cells, 
receptor-mediated endocytosis (see Wu and Wu, 7. Biol. Chem. 1987, 262:4429-32), and 
5 construction of a therapeutic nucleic acid (such as an anti-sense molecule) as part of a retroviral or 
other vector. Methods of introduction include, but are not limited to, intradermal, intramuscular, 
intraperitoneal, intravenous, subcutaneous, intranasal, and oral routes. The compounds may be 
administered by any convenient route, for example by infusion or bolus injection, by absorption 
through epithelial or mucocutaneous linings (e.g., oral mucosa, rectal and intestinal mucosa, etc.) 

10 and may be administered together with other biologically active agents. Administration can be 

systemic or local. In addition, the pharmaceutical compositions may be introduced into the central 
nervous system by any suitable route, including intraventricular and intrathecal injection; 
intraventricular injection may be facilitated by an intraventricular catheter, for example, attached 
to a reservoir, such as an Ommaya reservoir. 

15 The use of liposomes as a delivery vehicle is another delivery method of the present 

invention. The liposomes fuse with the target site and deliver the contents of the lumen 
intracellularly. The liposomes are maintained in contact with the target cells for a sufficient time 
for fusion to occur, using various means to maintain contact, such as isolation and binding agents. 
Liposomes may be prepared with purified proteins or peptides that mediate fusion of membranes, 

20 such as Sendai virus or influenza virus. The lipids may be any useful combination of known 

liposome forming lipids, including cationic lipids, such as phosphatidylcholine. Other potential 
lipids include neutral lipids, such as cholesterol, phosphatidyl serine, phosphatidyl glycerol, and 
the like. For preparing the liposomes, the procedure described by Kato et al. {J. Biol. Chem. 
1991, 266:3361) may be used. 

25 The present invention also provides pharmaceutical compositions which include a 

therapeutically effective amount of one or more RRV proteins or DNA, alone or with a 
pharmaceutially acceptable carrier. 

The pharmaceutical compositions or methods of treatment may be administered in 
combination with other therapeutic treatments, such as other antineoplastic or antirumorigenic 

30 therapies. 

Administration of Nucleic Acid Molecules 

In an embodiment in which one or more RRV nucleic acids are employed for generating 
an animal model, the analog may be delivered intracellularly (e.g., by expression from a nucleic 
35 acid vector or by receptor-mediated mechanisms). In a specific embodiment where the therapeutic 
molecule is a nucleic acid, administration may be achieved by an appropriate nucleic acid 
expression vector which is administered so that it becomes intracellular, e.g., by use of a retroviral 
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vector (see U.S. Patent No. 4,980,286), or by direct injection, or by use of microparticle 
bombardment (e.g., a gene gun; Biolistic, Dupont), or coating with lipids or cell-surface receptors 
or transfecting agents, or by administering it in linkage to a homeobox-like peptide which is 
known to enter the nucleus (see e.g., Joliot et aL, Proc. Natl. Acad. ScL USA 1991, 88:1864-8). 
5 Alternatively, the nucleic acid can be introduced intracellularly and incorporated within host cell 
DNA for expression, by homologous recombination. 

The vector pCDNA, is an example of a method of introducing the foreign cDNA into a 
cell under the control of a strong viral promoter (CMV) to drive the expression. However, other 
vectors can be used. Other retroviral vectors (such as pRETRO-ON, Clontech), also use this 

10 promoter but have the advantages of entering cells without any transfection aid, integrating into the 
genome of target cells ONLY when the target cell is dividing (as cancer cells do, especially during 
first remissions after chemotherapy) and they are regulated. It is also possible to turn on the 
expression of the RRV nucleic acid by administering tetracycline when these plasmids are used. 
Hence these plasmids can be allowed to transfect the cells, then administer a course of tetracycline 

15 with a course of chemotherapy to achieve better cytotoxicity. 

Other plasmid vectors, such as pMAM-neo (also from Clontech) or pMSG (Pharmacia) 
use the MMTV-LTR promoter (which can be regulated with steroids) or the SV10 late promoter 
(pSVL, Pharmacia) or metallothionein - responsive promoter (pBPV, Pharmacia) and other viral 
vectors, including retroviruses. Examples of other viral vectors include adenovirus, AAV (adeno- 

20 associated virus), recombinant HSV, poxviruses (vaccinia) and recombinant lentivirus (such as 
HIV). All these vectors achieve the basic goal of delivering into the target cell the cDNA 
sequence and control elements needed for transcription. The present invention includes all forms 
of nucleic acid delivery, including synthetic oligos, naked DNA, plasmid and viral, integrated into 
the genome or not. 

25 Also contemplated are inhibitory nucleic acid therapeutics which can inhibit the activity of 

RRV, for example in subject with KS or other diseases associated with RRV infection. Inhibitory 
nucleic acids may be single-stranded nucleic acids, which can specifically bind to a complementary 
nucleic acid sequence. By binding to the appropriate target sequence, an RNA-RNA, a DNA- 
DNA, or RNA-DNA duplex or triplex is formed. These nucleic acids are often termed "antisense" 

30 because they are usually complementary to the sense or coding strand of the gene, although 
recently approaches for use of "sense" nucleic acids have also been developed. The term 
"inhibitory nucleic acids" as used herein, refers to both "sense" and "antisense" nucleic acids. 

By binding to the target nucleic acid, the inhibitory nucleic acid can inhibit the function of 
the target nucleic acid. This could, for example, be a result of blocking DNA transcription, 

35 processing or poly(A) addition to mRNA, DNA replication, translation, or promoting inhibitory 
mechanisms of the cells, such as promoting RNA degradation. Inhibitory nucleic acid methods 
therefore encompass a number of different approaches to altering expression of RRV genes. 
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Cleavage, and therefore inactivation, of the target nucleic acids may be effected by attaching a 
substituent to the inhibitory nucleic acid which can be activated to induce cleavage reactions. The 
substituent can be one that affects either chemical, or enzymatic cleavage. Alternatively, cleavage 
can be induced by the use of ribozymes or catalytic RNA. In this approach, the inhibitory nucleic 
5 acids would include either naturally occurring RNA (ribozymes) or synthetic nucleic acids with 
catalytic activity. 

The inhibitory nucleic acid therapies can be used to target nucleic acids to sequences of 
RRV for use in treating conditions caused by the RRV, or proteins of the RRV, for example for 
treating KS or KS-like syndromes. 



Administration of Antibodies 

Therapeutic, intravenous, polyclonal or monoclonal antibodies has been used as a mode of 
passive immunotherapy of herpesviral diseases, such as infection with CMV. Immune globulin 

15 from subjects previously infected with the RRV and bearing a suitably high titer of antibodies 
against the virus can be given in combination with antiviral agents (e.g. ganciclovir), or in 
combination with other modes of immunotherapy that are currently being evaluated for the 
treatment of KS, which are targeted to modulating the immune response (i.e. treatment with 
copolymer-1, antiidiotypic monoclonal antibodies, T cell "vaccination"). Antibodies specific for 

20 an epitope expressed on cells infected with the RRV are preferred and can be obtained as described 
above. 

The present invention also provides pharmaceutical compositions which include a 
therapeutically effective amount of the antibody, and a pharmaceutically acceptable carrier or 
excipient. 

25 EXAMPLE 26 

Vaccines 

This invention provides substances suitable for use as vaccines for the prevention of 
diseases associated with RRV infection, such as KS, and methods for administering them. The 
vaccines are directed against RRV, and may include antigens obtained from RRV. In one 

30 embodiment, the vaccine contains attenuated RRV. In another embodiment, the vaccine contains 
killed RRV. In another embodiment, the vaccine contains a nucleic acid vector encoding RRV, or 
a surface protein, such as a capsid protein. In another embodiment, the vaccine is a subunit 
vaccine containing an RRV subunit, such as glycoprotein B, major capsid protein, or other gene 
products found to elicit appropriate humoral and/or cell mediated immune responses. 

35 This invention also provides a method of vaccinating a subject against Kaposi's sarcoma 

and lymphoproliferative disorders, comprising administering to a susceptible subject an effective 
amount of the peptide or polypeptide encoded by an isolated DNA molecule encoding a 
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polypeptide or combination of polypeptides expressed by the DNA molecule, and a suitable 
acceptable carrier. In one embodiment, naked DNA is administered to the subject in an effective 
amount to vaccinate the subject against Kaposi's sarcoma and lymphoproliferative disorders, or 
other disease associated with RRV infection. 
5 The vaccine can be made using synthetic peptide or recombinantly-produced polypeptide 

described above as antigen. Typically, a vaccine will include from about 1 to 50 micrograms of 
antigen, for example from about 15 to about 45 micrograms. Typically, the vaccine is formulated 
so that a dose includes about 0.5 milliliters. The vaccine may be administered by any route known 
in the art, for example parenteral, subcutaneous or intramuscular. 

10 There are a number of strategies for amplifying an antigen*s effectiveness, particularly as 

related to the art of vaccines. For example, cyclization of a peptide can increase the peptide's 
antigenic and immunogenic potency. See U.S. Pat. No. 5,001,049. More conventionally, an 
antigen can be conjugated to a suitable carrier, usually a protein molecule. This procedure can 
allow multiple copies of an antigen, such as a peptide, to be conjugated to a single larger carrier 

15 molecule. Additionally, the carrier may possess properties which facilitate transport, binding, 
absorption or transfer of the antigen. 

For parenteral administration, such as subcutaneous injection, examples of suitable 
carriers are the tetanus toxoid, the diphtheria toxoid, serum albumin and lamprey, or keyhole 
limpet, hemocyanin because they provide the resultant conjugate with minimum genetic restriction. 

20 Conjugates including these universal carriers can function as T cell clone activators in individuals 
having very different gene sets. The conjugation between a peptide and a carrier can be 
accomplished using one of the methods known in the art. Specifically, the conjugation can use 
Afunctional cross-linkers as binding agents as detailed, for example, by Means and Feeney, "A 
recent review of protein modification techniques," Bioconjugate Chem. 1:2-12 (1990). 

25 Vaccines against RRV can be made from the RRV envelope glycoproteins. These 

proteins can be purified and used for vaccination (Lasky, L. A., 1990, J. Med. Virol. 31:59). 
MHC-binding peptides from cells infected with the human herpesvirus can be identified for 
vaccine candidates per the methodology of Marloes, et al., 1991, Eur. J. Immunol. 21:2963-2970. 
The RRV antigen may be combined or mixed with various solutions and other compounds as is 

30 known in the art. For example, it may be administered in water, saline or buffered vehicles with 
or without various adjuvants or immunodiluting agents. Examples of such adjuvants or agents 
include aluminum hydroxide, aluminum phosphate, aluminum potassium sulfate (alum), beryllium 
sulfate, silica, kaolin, carbon, water-in-oil emulsions, oil-in-water emulsions, muramyl dipeptide, 
bacterial endotoxin, lipid X, Corynebacterium parvum (Propionibactcrium acnes), Bordetclla 

35 pertussis, polyribonucleotides, sodium alginate, lanolin, lysolecithin, vitamin A, saponin, 

liposomes, levamisole, DEAE-dextran, blocked copolymers or other synthetic adjuvants. Such 
adjuvants are available commercially from various sources, for example, Merck Adjuvant 65 
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(Merck and Company, Inc., Rahway, N.L) or Freund's Incomplete Adjuvant and Complete 
Adjuvant (Difco Laboratories, Detroit, Mich.). Other suitable adjuvants are Amphigen (oil-in- 
water), Alhydrogel (aluminum hydroxide), or a mixture of Amphigen and Alhydrogel. Only 
aluminum is approved for human use. 
5 The proportion of antigen and adjuvant can be varied over a broad range so long as both 

are present in effective amounts. For example, aluminum hydroxide can be present in an amount 
of about 0.5% of the vaccine mixture (AI2O3 basis). On a per-dose basis, the amount of the 
antigen can range from about 0.1 ug to about 100 ug protein per subject, for example about 1 ug 
to about 50 ug per dose, or about 15 ug to about 45 ug. A suitable dose size is about 0.5 ml. 
10 Accordingly, a dose for intramuscular injection, for example, would comprise 0.5 ml containing 
45 ug of antigen in admixture with 0.5% aluminum hydroxide. After formulation, the vaccine 
may be incorporated into a sterile container which is men sealed and stored at a low temperature, 
for example 4°C, or it may be freeze-dried. Lyophilization permits long- term storage in a 
stabilized form. 

15 The vaccines may be administered by any conventional method for the administration of 

vaccines including oral and parenteral (e.g., subcutaneous or intramuscular) injection. 
Intramuscular administration is preferred. The treatment may consist of a single dose of vaccine 
or a plurality of doses over a period of time. Also, the antigen could be a component of a 
recombinant vaccine which could be adaptable for oral administration. Vaccines of the invention 

20 may be combined with other vaccines for other diseases to produce multivalent vaccines. A 
pharmaceutically effective amount of the antigen can be employed with a pharmaceutical ly 
acceptable carrier such as a protein or diluent useful for the vaccination of mammals, particularly 
humans. Other vaccines may be prepared according to methods well-known to those skilled in the 
art. 

25 Those of skill will readily recognize that it is only necessary to expose a mammal to 

appropriate epitopes in order to elicit effective immunoprotection. The epitopes are typically 
segments of amino acids which are a small portion of the whole protein. Using recombinant 
genetics, it is routine to alter a natural protein's primary structure to create derivatives embracing 
epitopes that are identical to or substantially the same as (immunologically equivalent to) the 

30 naturally occurring epitopes. Such derivatives may include peptide fragments, amino acid 

substitutions, amino acid deletions and amino acid additions of the amino acid sequence for the 
viral polypeptides from the human herpesvirus. For example, it is known in the protein art that 
certain amino acid residues can be substituted with amino acids of similar size and polarity without 
an undue effect upon the biological activity of the protein. The human herpesvirus proteins have 

35 significant tertiary structure and the epitopes arc usually conformational. Thus, modifications 
should generally preserve conformation to produce a protective immune response. 





WO 00/28040 



PCT/US99/26260 



-64- 



EXAMPLE 27 



Peptide Synthesis and Purification 



The peptides provided by the present invention can be chemically synthesized by any 
of a number of manual or automated methods of synthesis known in the art. For example, solid 
5 phase peptide synthesis (SPPS) is carried out on a 0.25 millimole (mmole) scale using an Applied 
Biosystems Model 431 A Peptide Synthesizer and using 9-fluorenylmethyloxycarbonyl (Fmoc) 
amino-terminus protection, coupling with dicyclohexylcarbodiimide/ hydroxy benzotriazole or 2- 
(lH-benzo-triazol-l-yl)-l, 1 ,3,3-tetramethyluronium hexafluorophosphate/ hydroxy benzotriazole 
(HBTU/HOBT), and using p-hydroxymethyiphenoxymethylpolystyrene (HMP) or Sasrin resin for 
10 carboxyl-terminus acids or Rink amide resin for carboxyl-terminus amides. 

Fmoc-derivatized amino acids are prepared from the appropriate precursor amino 
acids by tritylation and triphenylmethanol in trifluoroacetic acid, followed by Fmoc derivitization 
as described by Atherton et al. (Solid Phase Peptide Synthesis , IRL Press: Oxford, 1989). 
Sasrin resin-bound peptides are cleaved using a solution of 1 % TFA in 
15 dichloromethane to yield the protected peptide. Where appropriate, protected peptide precursors 
are cyclized between the amino- and carboxyl-termini by reaction of the amino-terminal free amine 
and carboxyl-terminal free acid using diphenylphosphorylazide in nascent peptides wherein the 
amino acid sidechains are protected. 

HMP or Rink amide resin-bound products are routinely cleaved and protected 
20 sidechain-containing cyclized peptides deprotected using a solution comprised of trifluoroacetic 

acid (TFA), optionally also comprising water, thioanisole, and ethanedithiol, in ratios of 100 : 5 : 
5 : 2.5, for 0.5 - 3 hours at room temperature. 

Crude peptides are purified by preparative high pressure liquid chromatography 
(HPLC), for example using a Waters Delta-Pak C18 column and gradient elution with 0. 1 % TFA 
25 in water modified with acetonitrile. After column elution, acetonitrile is evaporated from the 
eluted fractions, which are then lyophilized. The identity of each product so produced and 
purified may be confirmed by fast atom bombardment mass spectroscopy (FABMS) or 
electrospray mass spectroscopy (ESMS). 

Having illustrated and described the principles of cloning the RRV genome, cDNA, 
30 proteins encoded by the cDNA, and modes of use of these biological molecules, it should be 
apparent to one skilled in the art that the invention can be modified in arrangement and detail 
without departing from such principles. In view of the many possible embodiments to which the 
principles of our invention may be applied, it should be recognized that the illustrated 
embodiments are only examples of the invention and should not be taken as a limitation on the 
35 scope of the invention. Rather, the scope of the invention is in accord with the following claims. 
We therefore claim as our invention all that comes within the scope and spirit of these claims. 




25 



35 



> • • • • . 

• • • • • • 



► • • • • 



65 




CLAIMS AS AMENDED UNDER ARTICLE 34 

An isolated vims (RRV) as deposited with ATCC as deposit accession 



A purified virus, having a nucleic acid sequence 
(a) shown in SEQ ID NO 1 or 
10 (b) a conservative variant thereof. 



3. The purified virus of claim 2, wherein the nucleic acid sequence has at 
least 95 % sequence identity to the nucleic acid sequence shown in SEQ ID NO 1 . 

„ ,. 

claim 2. 



5. A purified protein ofolaim 4, wherein the protein comprises an amino 
acid sequence selected from the group consisting of: 
20 (a) an amino acid sequence shown in odd numbered sequences of 

SEQ ID NOS. 3-165; and \ 

(b) amino acid sequences^that differ from those specified in (a) by 

one or more conservative amino acid substitutions wherein the function 

\ 



of the protein is preserved. 

\ 

6. A purified protein with an amino acid sequence that is at least 95% 
sequence identity to the sequences specified in claim 5(a) or\(b) 



7. The purified protein of claim 5, wherein the amino acid sequence is 

30 selected from odd numbered sequences within the group consisting of^SEQ ID NOS 3-19 
and 23-165. 




8. An isolated nucleic acid molecule encoding a protein according to claim 

5, 



9. An isolated nucleic acid molecule according to claim 8, wherein th 

molecule comprises a sequence selected from the group consisting of even numbered 
sequences of SEQ ID NOS 2-164. 
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10. The isolated nucleic acid molecule according to claim 9, wherein the 
mole v uile comprises a sequence selected from the group consisting of even numbered 
sequences of SEQ ID NOS 2-18 and 22-164. 



A recombinant nucleic acid molecule comprising a promoter sequence 




operably linked to a nucleic acid molecule according to claim 8, 




10 



15 



20 



25 



30 



12. A cell transformed with a recombinant nucleic acid molecule according 
to claim 8. \ 



35 



13. A non-human mammal purposefully infected with the virus of claim 2. 

\ 

14. The mammal of claim 13, wherein the mammal is a primate. 



\ 



\ 

15. An oligonucleotide comprising a sequence selected from the group 



consisting of: 



(a) at least 20 contiguous nucleotides of the nucleic acid sequence of 

\ 

the virus of claim 2; \^ 

(b) at least 30 contiguous nucleo tides of the nucleic acid sequence of 
the virus of claim 2; and 



\ 

(c) at least 50 contiguous nucleotides of the nucleic acid sequence of 



the virus of claim 2- 



\ 



16. An isolated nucleic acid molecule that: 



\ 



(a) hybridizes under stringent conditions with^a nucleic acid probe 
comprising the sequence of claim 15; and 

(b) encodes a protein of claim 6. 




17. An isolated nucleic acid molecule encoding a protein of claim\6. 



18. An isolated nucleic acid molecule encoding all proteins encoded by^he 
virus of claim 2, and having a biological activity of an RRV virus. 

19. A method for testing the efficacy of a drug in the treatment of a 
condition associated with the virus of claim 2. the method comprising: 

(a) administering the drug to a non-human primate infected with the virus 

of claim 2; and 
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(b) observing the primate to determine if the drug prevents or reduces the 
presentation of one or more symptoms associated with viral infection. 

20. The method of claim 19, wherein the primate is immunocompromised. 

21. The method of claim 20, wherein the drug is for the treatment of 
aposi's sarcoma and lymphoproliferative disorders. 

22. The method of claim 20, wherein the primate is immuno-compromised 
10 as a result of infection by Simian Immunodeficiency Virus (SIV). 

23. \ The method of claim 19, wherein the condition associated with infection 
with the virus of claim 2 is one or more of B-cell hyperplasia, lymphadenopathy, 
splenomegaly, hypergammaglobinulinemia or autoimmune hemolytic anemia. 




24. The method of claim 19, wherem the non-human primate is a Rhesus 

\ 

macaque monkey. 




25. A method for producing a non-human primate model for testing potential 
20 treatments for a condition associated an\mfection with the virus of claim 2, comprising 

(a) administering a treatment to the primate to render the primate 
immunocompromised; and \ 

(b) infecting the primate with the virus of claim 2. 



25 26 ■ The method of claim 25, wherein theXcondition is Kaposi's sarcoma and 

lymphoproliferative disorders. ^- 



27. The method of claim 25 wherein the treatmentuised to render the primate 
immuno-compromised is infection with SIV. 



28. The method of claim 25 wherein the non-human primate is a Rhesus 
macaque monkey. 




29. A method for testing the efficacy of a candidate vaccine againsrthe virus 
35 of claim 2, or conditions associated infection with virus of claim 2, the method 
comprising: 
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^ virus of claim 2; 




(a) administering the vaccine to a subject capable of infection with the 



(b) inoculating the subject with the virus; and 

(c) observing the subject to determine if the vaccine prevents or reduces 
5 an incidence of viral infection or presentation of one or more conditions associated with 

the viral infection. 





M 



30. \The method of claim 29, wherein the subject is a primate. 
10 31 • The^method of claim 30, wherein the primate is a non-human primate. 

32. The raethod^of claim 29, wherein the primate is immunocompromised. 



33. The method of clairn29, wherein the conditions associated with infection 
15 include B-cell hyperplasia, Iymphadenopauiy, splenomegaly, hypergammaglobinulinemia 
or autoimmune hemolytic anemia. 



20 





34. The method of claim 31 , wherein the non-human primate is a Rhesus 
macaque monkey. 
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(54) Title: CLONING OF RHADINOVIRUS GENOME AND METHODS FOR ITS USE 



(57) Abstract 

A novel rhesus macaque rhadinovirus, herein 
designated RRV, is disclosed. The genomic, cDNA 
and proteins sequences are provided. RRV has some 
similarity to human Kaposi's sarcoma-associated her- 
pesvirus and causes Kaposi's sarcoma-like symptoms 
in immunocompromised non-human primates. RRV 
possesses genes for both Interleukin-6 and macrophage 
inflammatory protein 1. The genome of RRV is useful 
for research, clinical and diagnostic applications aimed 
towards the rhadinoviruses and herpesviruses in general 
and KSHV in particular. In addition, methods for us- 
ing RRV to produce a non-human primate model for 
the testing of Kaposi's sarcoma-associated herpesvirus 
therapeutics and vaccines are presented. 
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FIG. 2 

Restriction Fragments of the RRV 17577 Genome 
BamHI EcoRI Hind W 
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FIG. 3 
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FIG. 5 

Comparison of Corresponding Repeals in 
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FIG. 10 

atg ttc cct gtc tgg ttc gtc ttg ttt tac ctg teg tgt tgg gcg gec 4 8 

Met Phe Pro Val Trp Phe Val Leu Phe Tyr Leu Ser Cys Trp Ala Ala 
15 10 15 

age cct acg ctg gcg cct ccc ccg act gec get gga att aac gtt etc 96 
Ser Pro Thr Leu Ala Pro Pro Pro Thr Ala Ala Gly lie Asn Val Leu 
20 25 30 

ccc cag tgg gee ggc aac cgc gec tct ctt gac agg acc agg ggg cgc 144 
Pro Gin Trp Ala Gly Asn Arg Ala Ser Leu Asp Arg Thr Arg Gly Arg 
35 40 45 

ctg tct gaa gtg ggg tta aac ata cag cgc tgg ttc gtt tac ctg tgc 192 
Leu Ser Glu Val Gly Leu Asn lie Gin Arg Trp Phe Val Tyr Leu Cys 
50 55 60 

cac cac tec act etc tgt egg gtg cgt gag tac ccg cgc ate atg teg 240 
His His Ser Thr Leu Cys Arg Val Arg Glu Tyr Pro Arg lie Met Ser 
65 70 75 80 

ttt gtt cac ttc cct ata ttg atg tct aac gtt gag tgc cag cgc cgc 288 
Phe Val His Phe Pro lie Leu Met Ser Asn Val Glu Cys Gin Arg Arg 
85 90 95 

gag ttt cgc ggg gec gag tgt atg aac gee atg gtt cgc ggg etc egg 336 
Glu Phe Arg Gly Ala Glu Cys Met Asn Ala Met Val Arg Gly Leu Arg 
100 105 . 110 

gec tac gag agt tac ctg acg cga ctg agg acg erg ctg gac gac gcg 384 
Ala Tyr Glu Ser Tyr Leu Thr Arg Leu Arg Met Leu Leu Asp Asp Ala 
IIS 120 125 

ccc ggg gac gcg gac gec gcg gec att ggc tec gcg gtg acc gtg gtg 432 
Pro Gly Asp Ala Asp Ala Ala Ala lie Gly Ser Ala Val Thr Val Val 
13 0' ' 135 140 

ctg tec gec etc gac tct eta att gag gag ctt ccc gta aat aac aag 480 
Leu Ser Ala Leu Asp Ser Leu lie Glu Glu Leu Pro Val Asn Asn Lys 
145 ' 150 155 160 

ata ggt ggc gcg gag tct aat gaa aaa acc gtg cgt gcg ttg gga ggg 528 
lie Gly Gly Ala Glu Ser Asn Glu Lys Thr Val Arg Ala Leu Gly Gly 
165 170 175 

cag age ccc egg gac gtt gtt etc age gcg ttt cgc ata ctg gaa tac 576 
Gin Ser Pro Arg Asp Val Val Leu Ser Ala Phe Arg lie Leu Glu Tyr 
180 " IBS 190 

eta cag atg ttt ttg egg gac ggg cgc cgc gca ata get atg atg taa 62*; 
Leu Gin Met Phe Leu Arg Asp Gly Arg Arg Ala lie Ala Met Met 
195 200 205 
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FIG. 11 

atg agg ggc ctt ttc gtg tgc gtt ttt ttt gcg gtg ttc gcg tgt gta 4 8 

Met Arg Gly Leu Pile Val Cys Val Phe Phe Ala Val Phe Ala Cys Val 
15 10 15 

gtt: gat tat gcc ttt cct atg ggc tec atg age gga ccc gcg ccc gaa 96 

Val Asp Tyr Ala Phe Pro Met Gly Ser Met Ser Gly Pro Ala Pro Glu 
20 25 30 

etc tgc tgt ttg ggg tat gta act cat ctg ccg cca ccc ggt tta gtg 144 

Leu Cys Cys Leu Gly Tyr Val Thr His Leu Pro Pro Pro Gly Leu Val 

35 40 45 



gtc tct tac tec cac acc teg teg cag tgc teg gtg gac gcc gtg ata 
Val Ser Tyr Ser His Thr Ser Ser Gin Cys Ser Val Asp Ala Val lie 
50 55 60 

tta aac act cgc cgc ggt aaa aag ctg tgt gcc aat ccc ggg 'gac gac 
Leu Asia Thr Arg Arg Gly Lys Lys Leu Cys Ala Asn Pro Gly Asp Asp 
65 70 75 80 

gca gtg aag aaa ctg ctt cag gcg gtg gac aag cgt ccc aaa aag ggc 
Ala Val Lys. Lys Leu Leu Gin Ala Val Asp Lys Arg Pro Lys Lys Gly 
85 90 95 

aga aga acc egg cgc age ctg att gac gat tec gaa gag ggc ctt ggc 
Arg Arg Thr Arg Arg Ser Leu lie Asp Asp Ser Glu Glu Gly Leu Gly 
100 105 110 

age ggg att tag 
Ser Gly lie 
115 



192 



240 



288 



336 



34 8 
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I hereby claim the benefit wider Title *5, United Stales Code, § 120 of any United States application^) or 
$ 365(c) of any PCT International applications) designating the United States, listed below and, insofar as 
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SEQUENCE LISTING 

<110> Wong et al . 

<12 0> CLONING OF RHADINOVIRUS GENOME AND METHODS FOR ITS USE 

<130> 53683 

<140> 60/107,507 
<141> 1998-11-06 

<140> 60/109,409 
<141> 1998-11-20 

<160> 179 

<170> 

<210> 1 
<211> 133719 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

1 GATCGGGAAA ACGCGAGGGG AGCGGGGGAC AGGGGACGGC GTGTGCGTGC TTGTGAGACA 
61 CCGGGTACGG CTGCCTGCCT GCTCG CTGGC CTGCTTGCTG AGGGGACAGT AGGCCTGCTT 
121 GCTCGCTGGC CTGCTTGCTG AGGGGACAGT AGGCCTGCTT GCTGAGGGGA CAGTAGGCCT 
181 GCTTGCTCGC TGGCCTGCTT GCTGAGGGGA CAGTAGGGCT GCTGGCTTGC TAGTAGGGCT 
241 GCTCGCTGGC CTG CTTGCTT GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTT 
3 01 GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTT 

3 61 GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTG 
421 AGGGGACAGT AGGGCTGCTT GCTTGCTAAG GGGACGGTAC GCCTGCCTGA TGGCTTGATA 

4 81 GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA 
541 GTAGGGCTGC TGGGCTGCTA GTAGGGCTCC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA 
6 01 GTAGGGCTCC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA 
6 61 GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC CTGCTGGCTT GCTTGCTTGC TTGCTAGTGG 
721 GGCCGCTTGC CTGCTACTAG GGCTGCTGTG CAGCTGGGAG AACAGAGTAG GGCTGCCGGC 
781 CAGCTGCGTG CGAGGGCGTC CGAGGGCCAG ACGAGGACAC GGGACCCGGG CCTCTCCCCC 
841 AGGCACAAAG CAGTAGGGCT GGCCAGGGGA AACAGTAAGG CTGATTGCTT G CTG AAAAAC 
901 AGTAGGGCTG CTGGTTTGTT GCTAACGGAA AAAGGGGAGG TGTGTGTATG CTTTTGTGAC 
961 TTTCCAGCGG AGGGGGTACA GTGCACGGCC AAGTTACAAG CACCTGCTTA ACTTG CTTTG 

1021 GCTCTGTGCG GTTTTGTTGC TAGGTACTAG TAGTAACACA TAGTATTTCA TCGCATGGCG 
10 81 CCTGCATACG CTCCACAGCA TCCGAAACAC GTTATTTCTA TAGACTAACT TTAGTGTTCT 
1141 TCTTGGTTGG TACCATTTTA CGAAGTTTGC TCCGTTGAAT AATAAGAAAT CCGTTGTGGT 
12 01 TACAATACAC CTGCCAACGT ATTGGATGGT TCCTTTTGCC ATCAACCTTT GCTTGCATTC 

12 61 TAACATTGAA TTTTCACATT ACACCCTAGT CTTCTCATGC AATTAGGTAT GTCTTTGCTC 
1321 GTGTTTAATT TTCTCTACAC ATTTCTGTGG AAATGTTTGT GTTGGTTTTA TTTATGTTAT 

13 81 TGCAACCTGT ATCGGTGGAG CTTTTGCCTG CAAAATTAAC TTCTGTTCCA ACGTGGTGTC 

14 41 CACCACATCC CGGAGATACC TACTTGCTAA CCTGCCGCGG GACGTCTACG GCCAGAGACC 
1501 AGCGAAGCAC ACAATGGTTT CGCAACAACA CGCTTATGCG TGGGAGTAAT TTCTACGGCA 

15 61 GACTGGTATC TGTGACTCCC AATGCTACGA TATCTGACCG GTATGCGTGT CAAACAAAAA 
1621 CAACAACGCG GAGTAACAAC ATCGATTTTC GGGTAAGCTC ATCGCGCCTC ACGCTCCAAG 
1681 AACGGTGCTC TTCATACGGC TATACTTACG CGAATAACAC AAGGGTATTG AGGTGTTACT 
1741 CTGGTGGAAA CGTAACTTTA AGAAACGTTG TCTTTCATTT AAACGGTACA GCGGTCATCA 
18 01 ACGGTACTAC AACAAACATA CATACATTTG TGTTAACAGA AAAGACAGGA GGGACGTATT 
1861 TCTGTTCTGC GTTTATTGGG AATGAAAAAT TCTATTCTCA GACAATTAAT GTGTTTTTTA 
1921 CTTCATTTAC CTTTAAACCT ACAAACGACA TTCCCAATGA GTCACATTTT AATAAAACTG 
1981 GGCAAATACA ACAAACAGCT AGTGTACAAC ATCCTGAAAA CTACGTTGTG TTCTCTGTTC 



1 



WO 00/28040 



PCT/US99/26260 



2 041 CCGTTTTTTC TATTGGCGTT TTAACAGGTA TTGCAATATC GTTGATTATG TGTTGGTTAT 
2101 TTACAATACG CTGCAACGAG AACTCTGAAT CATCAACTAA TAGTTATGCA AGCCAGACAA 
2161 GCTACATTCA ACCCTCCCAT AATCAGCGTT CCAATACTAA TGAATGTAGT CGCCATACCT 
2221 ACAGAAATGC TCATCAAGAA GAGAGTATTG AAGAACTACC AAACCAACAC ACAAGTGAAA 
22 SI CTGATTCTTG CTGTCAATTA GTTTTACTTG AAGTGAAAAA TGTAGCCTAC GATGGACCGC 
2341 AG G AAAAC AC AATTAACGAA GTTATGGAAC AGTATGATGA TGTGGTTGTA AAAAATATAG 
24 01 AACAAACATC ATATGAGGAT AATGTTGAGC ACATGGACTA TAGTGATACT AT AAATC C C A 

24 61 ATTTTAATTA CTACAGTGGA CTAATATTGG AAGAAGTAGA TGAAGTTTTT TACAATGAAC 
2521 TAGAAAATCA ATATCATGGA TTAATACTGG AGAATTTAGA TCACAATGAG TACAATCATT 

25 81 TAAATGAATT AAACATGATA GAACAATATG ATTGGTTAGA ATAAATAACT TGTGG GTT AT 
2 641 TTTTAAATTA AAACATAAAC AACATTAAAG C AG C ATTTTT GTGTAAATCC TTTATTTATT 
27 01 AAAATTTTTT TCATATACCT GAAACTTATA TTTAATT C C A TTTTCCTCAA AGTTTGTATC 
27 61 TTTGCCCGGT ATCTCTGATA ACATGGTGTA TTCGGTAAAA TTAATTGATG GAAAAAATAC 
2 821 ATCACAATCA AAACTCTCCA TAATGCGCGT AATATATAGT TTTAAGGGAC ACTTATAGTT 
2 8 81 AAGAACACTC TCATAAACAG ATTTTCCTCC AATAACCCAG ACCGTGTTTA ATTGTTCTTT 

2 941 TAGTTTGTAT TGTCTATAGA AATTAAACGC ATCGTCTAGC GTTCTCGCTA GAAAGTGTGC 

3 001 TCCGTGCGGT GGTTCACGTA ATTCTCTGCT CAAAATAATA TTAATTCTGT TCACCAGAGG 
3 061 GCGCTTCTTT TCAGGGATGG AAAACCATGT TCTTTTCCCC ATAATAACCA CATTCTTTTC 
3121 ACCTACAACC GATGGCGTAG ACGTCATTTT CTGAAAATAC ATCATTTCGT TTCTAAGGTA 
3181 CGGCCAAGGC ATTGTTCCGT TTTTACCAAT TCCTAATTGT TCATCAACTG CAACGATGCA 
3 241 GTTAACTGTA ATGTCCATGG TTTTAGTTGC CACTGACGGG TTTAACACAG AAGTATTTCA 
33 01 CAATTATAAA CAAATAACCC ACGTG A CATG TACTTACTAA TGTAAGTACG TAACGTGATA 

33 61 TAGTTAATCA TATTCGCTTT CACATAATGG ACAAAACGAA AAAATGTTAC CTGTGTTAAG 

34 21 CCTACCTTGT TTGGATGGAT TTATGGTTAC AATGAAAAAT AAAAATATAT ATATAACGTT 
34 81 TTTTACG TAA AACATATTGT AATTTAATCA CGTGATGTTT ATGTTGG CT A CTAACACACA 
3 541 ATGTTTG CAT AAAAGCACTT ATGTTAGTGA GCTTCAATCT TTATATTTTA TTCAAATTGT 
3 6 01 TTAAAGAGGC ATGTGAATAT ATTTAACACG CATTCAGAAA TAGGCGCAAC GCTGTGCCGT 
3661 ATAACTGGTA AAAACATGAC GTTTAAACTT TTTCCTCTGT TTTTATTACA CGCCATAATG 
3721 TACGTCCACT GCGATGAAAA CTGTAAACCT CCACATTTCA CGGAATATCG CGTCAAGTCT 
37 81 AACACAGAAA AGGACTTATA TAGTGTTGGA GAAACAGCTG AATTAATTTG TCGTCCTGGT 
3 841 TATGTTACAA ATACAAAAAT AATAACAACA GAATGTTTAC AAAATGGTAC GTGGTCAACA 
3 9 01 CCAAATTTTC CATG C G AC AG AAAAAGATGT CCCACACCTG CTGACTTGCT GAATGGAGCC 

3 961 GTGCACATTC ACGGGGGAGA TAATGCCTTA AAATTTG G AT CCAATATTTC CTATGAGTGT 

4 021 AATGAAGGTT ATGATTTAAT TGGTAGTAAT GTT CGTTTTT GTATTTTACA AGACACAGAA 
4 081 AACGTAAATT GGGATTCAAA TGAACCAGTC TGTGAAATTC AGAAATGTAT TAAACCACCG 
4141 GCAGTGGAAC ATGGGGACTA CCTACCTAAC CAAGATGTTT ATAACTATGG AGATGCAATT 
42 01 ACATTTAAAT GTTCATTGTC GTATACACTC GTTGGATCAA CAACATTAGT ATGCACGTCA 
42 61 AACAAAAAGT GGTCAAACTC TTTCCCAACG TGTTTAATGC TCGTATGTGA AAGTCCACAA 
4 321 ATAGACAATG GGTACATAGA CATTGGCTTG TCCAGAAGAT A CAA C CATGG ACAATCAATT 
4 3 81 ACTGTAAAGT GTAGCGACGG GTACAACATT GTTGGGCCTG AAACATTAAC GTGCACAAAC 
44 41 ACAACTTGGG TTCC AC C ATT ACCTAAGTGT GTATTAGTTA CAAATAACCC AAGCACACCC 
4 5 01 ATGCCAGAAA CACCCATGCC AGAAACACCC ACGCCAGATT ATCAAAAAAT AAATTTGTCA 
4 5 61 ACCGCTAAAA CTGCAACAAC ACCAAATGCG TTTGTTACAA CTGTTGTTTC TCCAGAGAAA 
4 621 GACGACGTTA CTTGTGTAAA GCCTCATTTT GAGCGATTCA TGGTAAAGGC TGAAAATGAC 
4 6 81 AAGGAAAAAT ACAGTGTTGG TGCGAGCGTT GAGCTAATAT GTCGACCAGG ATTTACTAAA 
4 741 ATGCAGTCTA CAGTTTCTGT TGAATGTTTG TCCAACGGAA CATGG ACTGC TCCAAATGCC 
4 8 01 AAGTGTCATA GAAlAAAAATG TCCAACCCCT CAAGAACTTT TAAACGGAGA GTATATAGTT 
4 8 61 ACAAGCGGAG AAGATGCTTT TAAGTACGGA ACAAATATAA CATATAAATG TAATGAAGGT 
4 921 TATCAACTTT TAGGAAGTAT GGTGCGGATT TGTATGCTTA AAGACGATTT AAAAACAGTT 

4 9 81 GACTGGGAGC CAAAAGCGCC TATATGTGAT ATTGAAAAAT GTAAGCCACC GCCACAAATT 

5 041 ACAAACGGAA AATACCATCC GGTGAAAGAC TTTTATCAGT ATTTGGACAC CGTAACATTT 
5101 TCGTGCAATC GTGACTTTTC TTTAGTTGGA GATGAAATGA CAACGTGTAT AAGTAATACG 
5161 TGGAATAAAC CGTTTCCAAG ATGTGAACAA ATCACTTGCA GCGCTCCTAA TATTGCACAC 
5221 GGAAAGCTGC TAACAGGTTC TTCAAGCGTT TACAAATACG GTCAATCTGT TACCATTGGT 

52 81 TGTGAAACTG GATTTACTCT AATTGGCAGT GAAATTTCTA CATGCAAGGA TTCATCGTGG 

53 41 GATCCACCAC TTCCTACGTG CGTGCCAGCT GTTTCAATGC CTTCTGACAC ACCTAAACCA 

54 01 GAAACCAAAA AACCAAACAC GCCAACGCCA GAAGCACCCA AACCAAACAC CCCAAACGTT 



9 



WO 00/28040 



PCT/US99/26260 



54 61 GGAACACATA CACCATTCAA ACCACCACCA 
5 52 1 AAATGGAAAA GGCATGTCGT GTTAGTTCTT 
5581 CTTGCTGCCC TTTATTGTTG TTTTCTAAAA 
5641 AGGCAAACTC GCACGCATTA ACCAATCTGC 
5701 CATTATTCAA GGTTGCCAAT AAAGGTGGTT 
57 61 ATGTTGACCC GTTTATAGTT ATCGCGCCAC 
5 821 TTCCACGTTA TACTTTCACG TACTATGACT 
5 881 TGTGGCCGGG GCTGAAAATA ACACAAGGGG 

5 941 ACACGGTTTA TAAAACTATA TCGGATGCGC 

6 001 GAAAAACATT TTAACGTTTG TTTAGCGAAC 
6 061 ACGCCGGACA ACCCCTGGAA GATAATCAGG 
6121 ACGTGTACGC GTATTCGAAA CAAGACTTTC 
6181 GACCATCTGG ATCTGGCGTT TTCTCGCTAC 
6241 AATTCCCTCT CACCGTAAAA GCCGCATACA 
63 01 AGGTGACGTG CTTTCACAGA GAGGTTATTG 

63 61 TGTTTGACGG TACCGGTCTT AACGAACTAT 
6421 CGCAGTTTAT AGAACCGGGT CCACCTCACA 

64 81 TACCGGACAA GGATGAGATG TTTCTTGGCG 
6541 TGTGGAGGGG CTGTCTCGTT CCCGCGGTGT 
6601 G CCAGG CGTT TAAAGTGCCG TTGTACGACG 
6 661 TGCCAAGGTT TTACCATAAA GACGTTAGCG 
6 721 TCGCCCAGGC CCTGAGACTC AAAGACGTGA 
67 81 TCATGCAGGA C CATTAC AAA ATTGCCAAGA 
6 841 TGCCGAAAAC GACAGACGGG TCGTCCCACA 

6 901 CCCTTAGTTA CGGCTGTATG TTTCTCGAGT 
6961 ACGATAGCTG GCCCATATTT GATGGTTGTG 

7 021 AGCGCTGGTC GG CCG AACAG GCCGTTCACG 
7081 TGCTGTACCT AACTAAAGTG CAGAAGCAAG 
7141 TGTACAACTC CTTTTTC CTC CAACACGGAC 
72 01 AAAACGGCAG CGAAGCCTTT AAGGGCGTAC 

72 61 CGCCGTATCA CCTGGCCTAC GCCGCGTCTT 
7321 ATTACATGCA GTTCTTGCAA CACCACAAAA 

73 81 ATTATGTCGG CACCGCCGCC AACTCAGAGA 

74 41 CAACGTGCCT CAACACGCTG TTCTATAGAC 

75 01 CTCAGCGCAG GGACCCCTAC GTGGTGACCG 
75 61 TTCTGGGCAA CTTCGCGAGC TTTAGAGACC 
7 621 ACCCAAAGTA CACGTACTGG C AG CTATGTC 
7681 GAATCACCGA AGACCACGAT AATCACGTGA 
774 1 GGGTGTTCAA GGGTATCGAC TCAATTGTGG 
7 8 01 TGATTAAAAA TAACTTTAAT TTCCGCGAGC 
7 8 61 TCTGCTGCAA CGTGTATTGG CAGGCGCCGT 
7 921 CCCTGCTGTG GATCATTCAG GATATCTGCC 

7 981 ATCCGGCGAT GGGCATCCTC CCCTCCGAGT 

8 04 1 CGAACTTTAA AGCGGCGTGT CTCGACCGCG 
8101 TACACCGGGA CATGTTCTGC GACTTCTTCG 
8161 CCCCCTTTAA AATGCAGGTT AGAATAGCCC 
8221 AAATAAAAAA TAGAATCATT TTTTCCAACA 
82 81 TCGTCAAACC GACGGGAACC AGGGACACTT 
8 34 1 TCAACTCGCT GCATCGCGCG CTGTTCCCCG 
84 01 ACAAGATCTC CCAGACCAAC AAAACCCCAG 

84 61 CGGAGCTGGT GTCGTACGTA AAGACCAACA 
8521 ACGTGGTTCC GGATTCACTC ATGTCGTACG 

85 81 GGGCATGTGG CCAGATTCAG TTCTACGCCA 
8641 AGACGATCGA TGCCGAGGAA TACCCCCACG 
87 01 TGGCTTACCT GGCAGAAATA CGCGGCCGCA 
87 61 AGCCGGTCGC CGCCACAGGG CGCCTGCGTC 
8 821 AATACACGGG GGTCAACGGG AACAACAACG 



CAAAATCCAC CAATAGCACC CCCAATGAGT 
TTTGCAAGTG TCGCGTCCTT GTTATTCGTA 
TAACTGTTTT TTGTCTTCAG CAGGTTCGCC 
CAACCGCCGT TGATTCCGGA ATTAAGTTTA 
TAAAAATATT CTATTGGTGT TCATTGTTTT 
CTTGTGGCTA CATTATATAG CACGATCACT 
CATACGCCTT AACGTCACGT GGCGTGCGAT 
T AC AT AATC C ATCCAGGCGG CACACATTAG 
CAACAATCAC TGTCGCTAGC GACACTGATA 
TTGAATAACA CATAATGGCT TCCAAAGGCA 
GGTCTCGTGC CCCGATAGGT GCGTGCGGAT 
CCTTTGCCGA GGCGTCCATA CTCGGCAACA 
CAATCCTTTA CGGACTTACA GTTGAACACG 
AAAAAGTTGA CACCACGACG CTCG CCGTTA 
TGTTTCACAA TGCAAGTTTA TTCAGGCCGG 
GCGAGGAAGC CAGGGCTCTC TTTGGGTACA 
GCATATGGAA CCCTCTGGAA TGTCCGCAGT 
TTGTTGTTAC GGAAGGGTTT AAGGAAAGAC 
TCCAGACCCA GCAGGTGCAG ATTGCCGGAC 
AAGACCTGTT TGCACCTCAC GGTCATAGAA 
CGTACCTCTA CGACTCCCTC TTTAC C AG C A 
CGGCGGTCAT CCACGCCACA GAAAAGCAAT 
TAGTGCAGGC AAAACAGTTT TCAACGACGC 
TGATTGTGGA CAG CGTCGTC GCCGAGCTCG 
GTCCCCAGGA CGCGTGCGAG TTGCTGAACT 
ACTCACCAGA GGCTAGGGTT AACGCGTTAG 
TGGCGGGTCA ACTGTTCGCT GCCAATTCGG 
CGCCCAGGGG AC AAAAG G G A GACGTAAACG 
TGGGGTTTTT AAATGAGGCC ACGATCAAGG 
CCTCAAACGC CCTCGATGGT TCTTCGTTCA 
TCTCGCCCCA TCTGCTGGCG AAGTTATGTT 
GCTCCACGAA CCAGGCGTTT AACATGGTCC 
TGTGCACGCT ATGTCACGGC AACACGCCGG 
TGAAGGATAG GTTTCCCGCC GTAACCACCC 
GAACAGCCGG GACCTTTAAC GACCTGGAGA 
GCGAAGAGGA CGGAAACCCG GCCGACGAGC 
AGACCGTGAC AGAAAAGCTA TCCGCGATTG 
ACCTCATCAC CAACATCCAA AG TTTTCTC A 
ACGGAGAGGT CATGAAGTTC GTTAATTCGA 
ACGTCAAATC GGTCCATCAC ATA CTC CAG T 
GCGCGGTGTT CCTGAATCTG TACTACAAAT 
TGCCGTACTG CATGATCTAC GAACAAGATA 
GGCTAAAGAT G C ATTTTC AG ACGTTGTGGA 
GGGTCCTCAC GGGGTGCGAA CTGAAAATCG 
ACACCGACGC GGGGTCCAAC GGCTTAATGG 
GAGCCATGAT GGTCGTTCCG AAATCAATTA 
CCGCGGGATC CGAGGCGGTG CAGTCGGGGT 
ACGTGGTGGC CGGACCGTAC ATGAAGTTTC 
ACACCAAGAC CGCCGCGCTG TACCTGTGGC 
TTCTGAAAGA CGTCCCGGAC GACGAGCTGG 
GCCTCGCGTT CGAGGAAACG AACGTG CTGG 
CGAGGATCAA ACTGAACGGG GCCATTCTAA 
CGACGCTGCA CTGCCTCACG CCGGTGCTAC 
TGCTGGGCTC CGCGG CAATC GCCACACCGG 
CCGCCCTCAC CGTCCAGACG ACGGCGCGTC 
CCGTGATAAC CGTTCCCATG GTAGTCAACA 
TTTTCCACTG CGG AAACCTG GGGTACTTCG 
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8881 CGGGGCGCGG CGTGGACCGC AACCTGTGGC CGGAAAGCTC CCCCTTTAAG AAAACGGG.CG 
8 941 TCAGCGCCAT GCTAAGAAAG AGACACGTCA TGATGACCCC CATTATCGAC CGCCTAATAA 
90 01 AGCGAGCCGC GGGACAGACA ATCAGCACGT TCGAGGCGGA AAGCGTTAAA AG G AGCGTG C 
90 61 AGGCGCTGTT AGAGGATAAG GACAACCCTA ACCTATTGAA GTCGGTAATC TTGGAGCTTA 
9121 TACGACACCT GGGGAAGGGC TGCCAGGACT TAAGCTCCGA GGACGTGCAA TATTACCTCG 
9181 GTGACTATTG TATGTTGACG GACGAGGTTT TATTTACGTT GGATAATATA GCACAGTCAG 
9241 GCGTGCCGTG GACTATCGAG GACGCGGGTG CCCTAATAGA GGATCGCCAG GACGCAGACG 
93 01 ATCTTCAGTT CGTAGACAGC GACGATATCG CCACCGCTTC CTGTCAGCCC CCCGAGGAAC 

93 61 AG CTACCG AC CCCTAGCGCC GGCGCCCTAC TGGCCGGGAA GAAGCGAAAA ATTAACGCGC 
9421 TGCTGAGCGA TCTAGACCTT TAGGAAAACC GTGGCAGGCG GGCAACAATG GCCAGGGAAC 

94 81 TCGCAGCATT ATACGCG CAG CTGTCGGCCC TCGCCGTCGA CTTGAGTCTG GTTATCTTTG 
9541 CGGACCCGCG AAGTATCGAC GGTGCCCGCA TTCTAAAAAC AAAAACACAG ATAGAGAACC 
96 01 TGAACCGCGA CCTTCTGCCG CTGCTACGCG AG C AAAACT C GGTAGAGACG TCCAGCCTGT 
9661 CGCTCGAAGT GGAGCACCTG GCCAAAAACA TCGAGGACAA ACTCGGCGAG CTGGAGCGCA 
9721 GTCTGCGGCA GAGATATTCG AGCCGAGAGC ATTTTGAAAC ACTACACCTG AGACCCGAAT 
9781 GTCACTATCA CTCTACGGTT ACTTTTCAGT TTTACGGGGG CGGGTTAATA GATGTAAACA 
9841 TGTGCCTAAT AAACGATGTA GAACTGCTGT GTAAAAGACT AGGGAGTGTG TTTTATTGCA 
9901 TCGGTGCAAA CGAAGCTCTG TCCGGATTGA ACCGGGTTCT GACGTTTCTG TCAACACTGC 
9961 GGGGTATCTC CCCGATCCCG CACCCAGACC TATACGTCAC GTCAGTGCCT TGCGTACAGT 

10 021 GCCTGAGGGA AATCGAACTC GTACCAAATC AGGGGTCCAG TTTACTCGCG GTGTTGGCAG 
10081 ACCGACACTG CGATCACCTC TGTAAGAAGG TTAGGGCGGA GCCAATACAC GGCCTGTTTG 
10141 AGACAGAACT GAGCCAGCTG GGTCTAAAAG TAACAAAACG TTCGGACGCC ACGCAGCACG 
10201 GCGTCCGGTC CTCTGCAGAT CAGTTAAGGG AGTCGTCGCT GGCGGCCATA CAAGATCACA 
10261 ATATATTCAA ACGGGTGTCC GCGTCAATCA TGGAACTATC CAATCTAATT TATTGGAACG 
10321 CCGGGCAAAC CGGCCTCCAG ACCGGGACCG AAAACGAGTG CTCACAAATG GCCAGACTGC 
10381 TAACACACGA GGCCGATATG CACGAGCACC GTGCGCTAAT AACACCCAAA CTAAGCGCGA 
10441 CTCACTTCTA CGACTGTTTC CGACCGGATC CCATAGAATC CCTGTTCTGC GGCGGTCTTT 
10501 TTAACTCTAT AG ACG AC AC C ATAAACG CAC TG AG CCGGG A TTG CTCCGTG ACGTTCTTTC 
.10561 AA CAG G C AAA CTATACCAAC GTTATGCGAA AACAAAACGA GCTGTTCACC AGACTCAATA 
10 621 GCATCCTGCG TCAGGGG AG C GCGGGATCGC AAAAACCGGC CACCCCCTCG GAGCCACGGA 
10681 CCACCACCGT GGCGGCAACC GCGGCAAGCG ACGTCATCAA AGACGCACAG TATCGCAAAG 
10741 AACAGTACAT GAAAAAGGTG GCCAGGGACG GCTTTAAAAA ACTAACAGAG TGTCTGCAGA 
108 01 CGCAGAGCGC GGTGTTGGCA AACGCACTCT GCATGCGCGT ATGGGGGGGC GTCGCCTACG 
10861 GCGAGGCGTC CG AG CTGGTG AACCATTTTC TCCTCAGGCG GCGCTTCGTC GCGCTTCCCT 
10 921 GGGAGGCGCG CTGCCGCTCG GATCAGATTT TATTCGAAAA CTCAAAGTAC ATTAAAAACT 
10981 CACTATATTC CCAGCGCCTC AGTCGCGAAC ACGTAGAGAT TATCACGCTG CAGTTTTACG 
11041 GCCTGATAAC CGGCCCCCTG ACGCGCCAGA GCGATCTCTT TCCCGGCCCC GCCAACGTCG 
11101 CGCTGGCCCA GTGTTTCGAG GCGGCCGGAA TGCTTCCGCA TCACAAGATG CTGGTGTCAG 
11161 AGATGATATG GCCCCAG ATT CAACCGAAAG ACTGGATAGA CCAGACATTT AATCGTTTTT 
112 21 ACCAACTTCC CGAAGGTG AT CTCAACG CGG TACAAAAGTC CGCCTGGTGC TTTATACGAG 
112 81 AGCTCGTCCT CTCGGTGGCG CTTTATAATC GCACGTGGGA AAAGACGCTG CGGATATTTT 
11341 CCCTAGCGCG CGAGAAACTC TCCATCTCCA ACCTAGACGT TAAAGGCCTG ACGTCCGGCC 
114 01 TGTATCTAAC GTACGAGCAA GACGCGCCGC TCGTTCTAAT TTCTCAAAAT ACCGGCTGGA 

114 61 TATTTAAAGA CCTGTACGCT CTTCTGTACC ATCACCTGCA ACTGTCCGAC GGCCATGATG 
11521 ATAACTAACC GAACGCGTCG TCTCCTGCGG GCGTGGGTCG TGATAATCGC GATCGGCACG 

115 81 GCGGTTGGCG AAAACGTCAC CACCCCCAAG GGCGCGACCA CCACCGCGAA GCCAACGCCG 
11641 GGCCCGTCGA CGCCCACACC TCCCGAGAAC CCACCTAGGG CCGAGGCGTT TAAGTTTCGC 
117 01 GTGTGCAGCG CCTCGGCCAC CGGCGAACTC TTCAGGTTTA ACCTGGAAAA AACGTGTCCG 
117 61 GGCACCGAGG ACAAGACGCA CCAAGAAGGC ATCCTGATGG TGTTTAAAAA AAATATTGTC 
11821 CCGCACATCT TTAAGGTCAG ACGGTACCGC AAGGTGGCCA CCTCGGTGAC CGTCTATCGA 
11881 GGGTGGACCG AGACCGCCGT GACCGGCAAG CAAGAGGTCA TCCGACCGGT GCCGCAGTAC 
11941 GAGATCAACC ACATGGACAC GACCTACCAG TGTTTCAGCT CCATGCGCGT AAACGTCAAC 
12 001 GGCATAGTAA ACACCTACAC GGACAGGGAC TTCACTAACC AGACCGTGTT TCTGCAACCG 
12 061 GTCGAGGGGC TCACGGATAA CATCCAGCGA TACTTCAGTC AGCCGGTGCT GTACACGACA 
12121 CCGGG ATGGT TTCCGGGAAT TTACAGGGTC AGAACCACGG TCAACTGCGA GATCGTGGAC 
12181 ATGATCGCGC GTTCGGCGGA ACCGTACTCG TATTTTGTCA CCGCCCTGGG AGACACGGTA 
1224 1 GAGGTGTCCC CGTTCTGCCA CAACGACTCA ACGTG CTCGG TCGCGGAGAA AACCGAAAAC 
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12301 
12361 
12421 
12481 
12541 
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12661 
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14881 
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15181 
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15481 
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GGCCTCGGCG 
ACCACCGAAA 
GACCCCAAGT 
ACGACGCACG 
CCGCTCTCCC 
AAAACCCTCA 
CAGTACTACG 
dk\&GCTGACG 
ACCGCCAACC 
GCGCCCCAGC 
GAGCTCTCCA 
AG C AAG ATTA 
TTCGTGGGCG 
ATCCACAAG A 
TTC AGGTTC C 
ATACTGACGG 
AG C AACGTAA 
ATATCCACCC 
AGGGTCATCG 
GAAACCATGT 
CTGGACAACA 
GCGGACCTGG 
TTCGGATCAA 
ATGATCCTGG 
GCCATCGCCC 
GGCGGTAAAG 
CAGGAAGAGC 
GCGTCAGACG 
CAAGAGTATG 
CCGCGCGGCC 
CGTCCGCGGT 
CGCCCCCACT 
CCGCCACCAG 
ATGATCCCGG 
GTGCTGCTCG 
GACCCCCAAG 
GCGGACAGGT 
GTGCTCAAGC 
CAACAGGTAT 
CAGGCCCTCA 
AAAAGAATTC 
TCCGGTTCCA 
GAGTCAAACG 
TGGTACTCGT 
GAGTTTGACT 
TACCGCATCG 
CGCGACGGCG 
CCCAATCCGC 
GTTTTGGAGT 
TTCGAGGTGG 
ACGCGAGCGT 
TCCATCTTTG 
AAAATTAAAA 
AGCCTCTCCG 
GACGTATCGT 
GTGGGCAGCT 
ATACACGTGG 



CCCGCGTGCT 
CGCGGGTCTT 
CGGCGGTCTG 
AGGCCAGCTA 
AGGTAACTAA 
ACGCCACCAT 
AAACCGAGGG 
AGATGCGCGA 
GCGTTCGAAG 
TGCAGTTCGC 
GGGCGTGGTG 
ACCCCACCAG 
ACGCCATCTC 
GCCTCCGCAC 
TCAACAGCAC 
ACAACCAGGT 
CCTACTACTA 
TCGGTACGTT 
AG CTGTACAG 
TCAGGGAATA 
CGATCGACCT 
GCGATGTCGG 
TCGTGAGCGG 
TGATTGTGGC 
AGGCCCCCAT 
TCGACCAGGA 
GTAGG CGGTT 
GACTAAAACG 
AAATGAGCAA 
AATCGATTCG 
CTGTATCAAT 
CACACAGAGG 
ACGTTTGCAG 
TCACGATCCC 
CCAACGAACG 
ACCAATCCAT 
GTGCCGAGGT 
TCCTGGGGCG 
ATTTCTACGC 
AG AAC AC AG C 
TCAAAACGTA 
TGCTCTCGAC 
TGGACGCCGT 
GCGCGCGCGC 
GCAGCTGGGA 
TGGCCTTTGA 
ACGCGGTGAT 
CAAACATACT 
TTCCGTCGGA 
ACTTTTTAAC 
CCCAGGTGTA 
AAGTTCACGA 
TAGCGGGCAT 
ACTACAAACT 
ACAAGGACAT 
ATTGCGTGAT 
AGATTTCGGA 



CACAAATTAC 
CGCCGACTCG 
CGCGCTGACG 
CCACTTCGTG 
CTTCACGGGC 
CAAGAAGCTG 
GGGTCTGTTT 
ATTAAACGGC 
AAGCGTCGGT 
CTACGACAAG 
CCGAGAACAG 
CGTAATGACG 
CGTGACGGAC 
GTCCACCCCG 
CACGCTGTTC 
GGAGGCGTGC 
CAAAGACTAC 
CATCGCCCTG 
CCGCGCGGAG 
CAACTACTAC 
GAACCGCGAC 
CCGCACGGTC 
GTTCATTAAC 
GGTCGTCCTG 
CAGGATGATC 
G C AG ATT AAA 
AGACGAACAG 
TCG CTTTAGG 
ATAACCACAC 
CGCACCCGGC 
CATGGATTTC 
CACCGATGCT 
GCTCATCCCC 
GTTCCCGCCA 
GTCCATGTGG 
CACGTTTCAC 
ACCTAGCCGC 
AACCGAGGAC 
GAAGGTTCCA 
CGGCCGTGCC 
CGACGTCGCG 
CCTCAGCGAC 
TCGCCGGTTC 
CACGCCCCGC 
GGACCTCAGC 
TATCGAGTGC 
CCAGATCTCC 
GTTCAGCGTC 
ATATGACATG 
CGGCTATAAC 
CAACCTTCGA 
GCCCCGTGGC 
CGTCCCCATA 
GGACACGGTG 
TCCCCCTCTG 
GGACTCGGTC 
GATAGCCAAG 



ACCATCGTCG 
GGAGAATACA 
CTCTGGAAAA 
GCCAACGACG 
ACGTACCCCT 
TCCGATACCC 
CTCCTGTGGC 
ACCACGCCAG 
ACGAACGAGC 
CTCCGCGCGA 
GTGAGGGACA 
GCGATATACG 
TGCGTGGCGG 
GGGATCTGCT 
AAGGGCCAGC 
AAAGAGACGT 
GTCTTCGTGA 
AACCTGTCGT 
AAAAAG CTGT 
ACGCAACGCC 
CGCCTGGCCC 
GTTAACGTGG 
TTT AT AAAG A 
ATCGTGTTTG 
TACCCCGACA 
AACATTCTCG 
CAGAGGTCAG 
GGATATAAAC 
CCACACGCCT 
CGGTCCCCGA 
TTTAACCCGT 
CCCGCCCCTG 
GCCTGCCTCC 
ACGTACTTCG 
ACGGCGCGCG 
GCGTACGACG 
TTCCAAACGG 
GGCACCAGCG 
G CCGGC ATTA 
GCGTGCGGCT 
GAGCATCCCG 
CGCCTCGTCG 
GTTCTGGATC 
CTGGCGGCCA 
GTTCAAGCGG 
ACTGGAGAGG 
TGCGTCTTCT 
GGGACGTGCG 
CTGGTGTCGT 
ATCTCAAACT 
TTAAACGAAT 
GGGGGAGGGG 
GACATGTACC 
GCCAGGCAGT 
TTTCGCTCAG 
CTGGTGATGG 
CTGGCCAAGA 



ACTTCGCGAC 
CCGTATCGTG 
CCTTCCCCAG 
TGACGGCGAC 
GCCTCAATGA 
ACG C AAC AAA 
AGCCGTTAAC 
CACCCCCCAC 
AG G C AACGG A 
G CATC AAC AA 
CCTACATGTG 
GGCGGCCGGT 
TGGACCAGGC 
ACTCGCGCCC 
TGGGACCCAG 
GCGAACACTA 
AAAAAATTAA 
TTATAGAGAA 
CCGGGAGCGT 
TGGCGGGACT 
GCGACCTGTC 
CCAGTAGCGT 
GTCCGTTCGG 
CGCTAAACCG 
TAGACAAAAT 
CCGGCATGCA 
CGCCCTCGCT 
CGCTGGAAAA 
GTACTTGCCG 
CACCCTCTAG 
ACCTGGGCCC 
CCGGCGCCGG 
GAACGCCAGG 
AGAACGGTGC 
ACCGCAAGCC 
TCGTTGAAAC 
ACATTATCCC 
TGTGCGTGAA 
ACGTCACCCA 
TCTCGACCAG 
TCACGGAAAT 
CGTGCGGGTG 
ACGGGTTTAC 
GAGATGCCAG 
ACCGCAGCGA 
CGGGATTTCC 
ACACGACCAG 
ACCCCATCCC 
TCTTCG C C AT 
TCGATCTCCC 
ACACAAAAAT 
GGTTCATGAG 
AGGTGTGTCG 
GTCTGGGTGG 
GTCCGGGCGG 
ACCTCTTAAA 
TTCAGGCCAG 



CCGCCAGCCC 
GAAGGCGGAG 
GGCGATACAG 
CTTCACGTCC 
TGTTATTCAG 
CGGATCGGAG 
GCCGCTAAGC 
CACAACCTCA 
CGACCTAGCG 
GGTGCTGGAG 
GTACGAACTG 
GTCGG CCAAG 
GTCCGTCAGC 
CCCGGTCACG 
AAACGAGATC 
CTT CAT AG CG 
CACCTCCGAG 
CATAGATTTC 
TTTCGATATA 
CCGGGAGGAC 
CGAGATAGTC 
GATAACCCTG 
GGGCATGCTC 
GCGCACCAAC 
GCAGCCCTCT 
CC AG CT AC AG 
TTTCCGGCGC 
CG AAG AGG CT 
CCCGCCAGAG 
CGCCCCCGGG 
TCGCGGACCA 
AG CCGTTCAG 
GGCAGGCGGG 
TCGCGGAGAC 
CGTCGCCCCG 
AACGTACGCG 
AAGCGGAACC 
CGTGTTCCGT 
CATCCTCCAG 
AAGAGTGAAC 
CACGCTATCG 
CGAGGTGTTC 
CACGTTCGGG 
GACGGCCCTG 
CTGGCCCCCG 
GTGCGCCACG 
GGAAGGCGCG 
GGACACCGAC 
GATCCGCGAC 
GTACCTAATC 
AAAAACCGGC 
GTCGGTCTCA 
CGAAAAGCTC 
GAAAAAAGAG 
CAGGGCTAAG 
AATGTTTATG 
GCGCGTCCTG 
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15721 ACGGACGGCC AACAGCTCCG CGTGTTCTCC TGCCTGCTGG AGGCCGCGGC CAGGGAGAAC 
15781 TTTATCCTCC CGGTTCCAAC GCCCGAGGGA CAGGGGGGCT ATCAGGGCGC GACGGTGATC 
15 841 AACCCCATTC CGGGGTTTTA CGACGAGCCG GTCCTGGTGG TCGATTTTGC CAGCCTGTAC 
15901 C CG AG CATC A TCCAGGCGCA CAACCTGTGC TACTCCACCA TGATACACGG ACGAGACCTG 

15 961 CACCTGCACC CCAACCTGAC GCCGGACGAC TACGAGACGT TCGTGCTGAG CGGCGGACCG 

16 021 GTACATTTTG TAAAAAAACA CAAGCGGGAG TCTCTGCTGG GAAGACTGCT AACCGTGTGG 
16081 TTAGAAAAGC GAAGGGCGAT CCGGCGCACC CTGGCGGCGT GCGATGACCC GTCGCTAAAA 
16141 ACCATCTTAG ATAAACAACA GCTGGCCATC AAGGTGACAT GTAACGCGGT TTACGGGTTC 
16201 ACCGGGGTGG CCAGCGGCCT CCTCCCATGC ATTAACATAG CGGAAACCGT GACGCTCCGG 
16261 GGGCGCACGA TGCTGGAGAT GTCAAAGTCT TACGTGGAGG CCCTGACGAC GGAAGACCTG 
16321 CGAACGCGTC TCGGTCGCGA GGTGACCGCC CGTCACGGCG CGCGGTTTCG CGTCGTCTAC 
16381 GGTGACACCG ACTCCCTCTT TATCGCGTGC GACGGTTATT CCGCGGAAGC CGTTTCCGCT 
16441 TTCTGTGACG ATCTGGCCGC CAGGATCACT GCGGACCTGT TCCCCCCACC CATTAAGCTA 
16501 GAGGCGGAAA AGACGTTCAA GTGTCTGCTG CTGCTGACGA AAAAGCGCTA CATCGGGGTC 
165 61 CTATTGAACG ACAAAATGGT CATG AAAGGG GTCGACCTCA TTCGCAAAAC GGCCTGCAAG 
16621 TTTGTCCAGG AGCGATGCCG CGCCATCCTG GACCTGGTGC TCCACGATCC GGAGGTCAAG 
16681 GCTGCGGCGC GG CTGTTGTG CAAGCGGCCG CCGCACGCGG TATACGAGGA GGGGCTGCCG 
16741 GCTGGCTTTA TAAAAATCGT AGAGGTCCTC AACGCGAGCT ATCTGGACCT CCGAAACAGC 
16801 GTCGTGCCCA TCGAGCAGTT AACGTTCTCC ACCGAGCTCA GCCGCCCCGT CTGCGATTAC 
168 61 AAGACCACCA ACCTGCCCCA CCTGGCGGTG TACCAAAAGC TGGCGAGCAG GTGCGAGGAG 
16 921 CTGCCCCAGG TGCACGATAG AATCCCCTAC GTGTTCGTTG ACGCGCCCGG GTCCCTAAAG 

16 981 TCGGACCTGG CCGAACACCC GGATTACGTC AGACAGCACC AGATTCCCGT CGCGGTCGAC 

17 041 CTATATTTCG ACAAACTGGT GCACGGCGCG GCCAACATCC TCCAGTGTCT GTTCGGCAAC 
17101 AACGCGGACA CCACGGTGGC CATCCTCTAC AATTTT CTC A ACGTCCCGTA TAAGCTGTTC 
17161 TCGTGAACGC CAATTGGAGA ACGCCAACAT AAGACGCCGC GCCAGCGGAG TCCGCAGGGA 
17221 GAGCTCGAGC GGCGAGGAGC GACCAACGGA GACCGCCACC ATGCTGGTTA ACGAACTGTC 
172 81 GGTGGTCCTC GGCGACTGGG AGGTGACTTT TCACCGGGGT AG ATTC AG CT TCGTCAACCT 
17341 CACCCGCCTG CAAACGTTCA AGGGCCACGG GGGCTACGCC AGGGTCCGAC TCCCCTTCTC 
174 01 GCTCGACCAG TTACTCCACC AACATTTCGC GTTCGGACTC GTGACGCGTC TCAAGGAACT 

174 61 GCCCCCCTTC TCCGACTGCG TGGCCCTTAT CGCCCCGTTG G ATTCCGG CG GCGACGCGGA 

175 21 CGCGGCGCGC GTGGCCCCCG GGTTCGTGCT GGACTCCTCT CGCCCGCTGA CCGTGTGGGT 
17 581 AAACGCGAGC GGGCGGCACA CGATCCGGTT CTGCCTCCTC TTTCTAAAGC CGATCGACCT 
17 641 GGAGCGCGCG GTCACGTACG TCTTCGGCGA GAACGGCGGC GCGCGCTCGG AGGGCACCCC 
17701 AAAGCCCACC TG CG CGACCG AAAGCCTGCC CGGTGGACCC CTGCGCGTCT CCGGCGAGGC 
17 7 61 GTCTCAGACG TCGCCCCATT CTTTCGTTGC GTATTTTCCC ACGGCCAACT CGGTGGCCTG 
17 821 CCTAAGCCTG TTGCGGTTAC AGGTGAGGCC GTTTTCGGAT GACGCGGCGC ACAGGGACGC 
17881 GCGGATCTCC CCGAAATACG TCACGTTTAG TAACTCCGGG GGTAACGTCT GCAAGGCGTC 

17 94 1 CGTTCACACG CTGTCCCCGT CGCGGTGTAA AACGGCGCAA ATGGAAATCA TCTACGCTCC 
18001 CGGGGACCCC AACGCCGAGA TAGTCCTGGG CCAGTCCGGA CCCGTCCTGC CCACCCACAC 

18 061 CGGCGGCCGC GTATTGGGGG TCTACGCCGA CGCCGAAAAA ACCATCCAAC CTGGAAGCTC 
18121 CG CGGAAGTC CGGGTTCAGT TAATCTTCCA ACAGGGAGCG GCCGCTCGGG GCGATCTGGC 
18181 GTTTCTGGTC ACGGGCGTGG CACCGGAGCC CCTATTCGTC GTCACCCCGG CACTCTTGCT 
18241 TTCCGGTTGC ACAACCCACC TG CG C CT ATT CAACCCCAAC GGTACCCCCA CG ACT AT AAA 
183 01 AAGAGACACC CTTGTGGCCG CCGCCGCGCC CTGCCCCGTG GTGCGATTAA GCTCCGCCGA 
183 61 CGACGCGCCG CGAGACCTCG TCGCGTCACC AGACACCGGG GCGCTCTCCA TTAACGCGTT 
18421 CACAATCCCG GTCGGTTTCC CAGGGGTGGT CTCGGCGGAG TGTCACGTGT CCCTACGCGA 
18481 CAACGGGGTC CACGAACGCA TGAACCATTG ACGGCAACGA TGGGAACACC GGTGCGTTTC 
18 541 TTTCGCGGCG AGTGGCAGAC CTCG AGTCTA GTGGACAACG GCACGCCACG GTACAGCTCC 
18 601 CTGGTGTGGG CCGCCACTAT TCACGACGGC TACCTGACAC TGGTGAACAG GTCAGAGCTG 
18 661 TGCGTCACGG AGAGGTCTCC GTGTCTGCCG GCATGCCCCA GCATCGGGAG ACTGGTCGGG 
18 721 AAGAGGTTTC CCGGCTTCGC CTTTGCCAGC GCCACTCTGG GCGATCGGGG AACACGCACC 
18 781 GTGTTCTACG CGTTCGGTCA CCGCGACAAC CCACTGGACA TAGTACCCGC CGTGGTCGAG 
1884 1 CGCGCGGATC GCGAG CTGGT GCTGCGGGTT CACGCTCCGC AAACAACGCG GGTGTCGCGA 

18 901 TACGGACTTA AGGTATTCGT GGCGATCGTT ACGGTGGTGC GCCCGCCCGG GGTGTTCCTA 
18961 CACTTTCCAC AAGACCGCGT TCCGATCGCG CTGACAGACG CGTGCAGCCA GGAGGGCTCC 

19 021 AGGCTAACCT CTGAAGAGCC GTGGATAAAA ATTCAAGGCT TTCCCGTCCT ATCTGACGAG 
19 081 ACCGCGCACC CATTTCTCTT AACCCAGAAG ACCAAGCCCT TTACCGAGCG AAAGTTTTGC 
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CGCCTGATCA 
CACGTGAGGG 
GCGACCCTGT 
CTCCCGTGGT 
TGCATCCCAC 
AACCGCAAGC 
GACTGCGAGT 
CCGGTGTACA 
TTCGGTGGCC 
CCGGGCGGGG 
TCCACGTAAA 
TCATTCTCGG 
ACCCTCGCCA 
TTAC AT CAT A 
GCGAAACGCG 
TTCATTAGAC 
GAGGGCGGAC 
CGCGTCGTCC 
AACCATGGCG 
CATCAATATA 
AGTGGAGTGG 
GCGCCCCCTG 
TCCAGCGGCA 
CAAGACGAAC 
GGCCGGTCAA 
CGCCCTGGAA 
GAACCCATTT 
CCCAACGGGG 
CCTCGAGACT 
TCAGCCGCGG 
TGTTGTAAAC 
CGTGAGCGAT 
CCATGTCGGC 
GACACAAAAC 
ACATAACGAT 
GGTCCACCCC 
TCCACTGGAA 
CCGCCAAAAA 
CGGTGGAGTC 
ACACCCTTTT 
CGAACACGGA 
TTAAATCCAG 
CGGCCGCGGC 
CGCAAACGGG 
TTACAACCGA 
AGGCAGCTTG 
CGTGGTCTCC 
AGATAATAAA 
AACAGGGGCC 
CACCGGCGCT 
ACCCTCTGCG 
CATTTTGAGC 
AGGCCCTCTT 
TTGTCCACCG 
TTACCGCGGC 
TAAGAGACCA 
GGCGCGGGTC 



TGGACAACGA 
TGACCGTGAC 
CCCTCACCGG 
CGTCCACAAC 
CCAACTGTAG 
TGACGGCGAT 
GGCCACCGAA 
TCAGCACCGG 
CGGCGAAACT 
TGACAGTGGA 
TTACTAATAA 
GGTCGCGCGC 
CACAAAACCA 
GCTATTGCGC 
CTGAGAACAA 
TCCGCGCCAC 
AGCACCACGG 
AGCAGCATCC 
TTCATACACT 
GGGAAGTGAA 
TGGCACAGGT 
GTCCTGTCAA 
GTCGGGGGAG 
CAGACAGGGA 
CGCCAGGTCC 
CGCGGTTCCT 
TGTGCCATCG 
GTGTGCTCAA 
CAGATCCGCG 
AAACGGACGC 
GTGGGCGTCA 
CAGATACGTC 
GGACCTCTGG 
GTGACAAGGA 
GCGCCGATCG 
CTGACCCTCG 
CCCGTACACC 
GGCGCGGGAC 
GGTGGAGCCC 
GGTGGTTAAC 
CCTGGTGCCC 
GTGCGCCAAG 
ACCTTCGGGT 
TATAATTTTT 
GGTGCTTTAC 
GGCGGGGCCG 
CCAAGCGTCT 
ATAAATAACA 
AATAGAAGCA 
CGTAACAGCG 
TCCCGAGCCA 
CCTTGGCACG 
CGGAATCGTC 
CCTGAAGCAG 
GAGTGTTTAA 
CTAAACCGGG 
CGCTCATGGA 



CCAGCGCAGC 
CCGCCCCCCG 
TAATGCGCCA 
GG CG AT ATT C 
CAAATTCGTA 
TATTAGCAAT 
CCGGGAGATA 
GACGCAGCTG 
GCGGCAGCTC 
CAG CC AAAAA 
ACCGTTTGCT 
CTCGCGAACA 
GTTAACGCCT 
GGCGCCCGTC 
CGTCCCGGGG 
CTATCTTGTT 
TCACCGCGGA 
TCAGTCGCGT 
CGGCCCCGCG 
CAAACGACAT 
AAACGAACCA 
GAGAGGCGCG 
GCGCCAGCGT 
ACATGACCGG 
ATTAAAAACA 
TCCAATCGCA 
GGTTTCGTTT 
ACGGCCATCT 
CGCGTAAAGT 
GGGGTCCTCC 
CCCAAGGTGT 
AGGAGGGCGT 
TACAG CTGGC 
GGGAGAGCCA 
TGGGGCCG CC 
TAGTTGGCGT 
GGCCCCAGAT 
CCGTGCGCGT 
CTGATAAACC 
AGAGGAAACT 
ACGCCCGTCC 
TACTGCAGCT 
GCGGAAAAAA 
TTAACGTAGC 
AACTATAACC 
ACCACAGGCG 
CCCCGAGCGT 
TGTTTATTTT 
ACGTGCGAGG 
TAAACAAACA 
TTCGCCCAGC 
GCCGCGTCTG 
AATCAGGCTG 
TTTCTTCACT 
TATCACGGCG 
TGGCGGCAGA 
GCCCATAGGa\ 



GCCGTCAACA 
GAAACAATCG 
ATCGCCTTTC 
ACGCCCGTGG 
AGGTACGGTA 
CACGCCCACA 
GAGATTTTGG 
GGGCAAGCCA 
CTCGGCCACC 
CTGTGTAGGT 
CGTATCGCTC 
CACAAGGTGG 
TCCGTTAGAT 
CCGCAAAAAC 
GCTCTGCCCT 
ATTTACGGGA 
GCCAATGGCC 
CAGGTAACTC 
AAACTCGCGG 
GATGCGCGGG 
GCGCTGTATG 
GTTGCCGGCC 
AGGGCTGGCC 
TTAAATTACT 
CACCGGCCGC 
AAGAACCGCG 
TCAGATACGG 
CCATCTCTAT 
CCTCCAGACG 
GCAGCTGAAG 
GCACAAAGTC 
AGCTGGCGAT 
AGGACAGCTC 
TCCGCGCGAG 
TGTTAATTAG 
CGGCCCCCCT 
CGCCCTCGCG 
CCCAAATTTT 
ACAGCAACTC 
CGTCCCTGAG 
TGTCCTCCCT 
CGCCGTGGTC 
CCCGGAGTCG 
AGATACCGAG 
GTAAACGGCT 
CGGCCCGGCT 
GAGCGTCTCC 
TTATAAACTT 
CGGCATTCGC 
ACTATTAGCC 
TGCGTCTACG 
CGTTAATGGT 
CGCCGGGTTC 
GCGTCGTCCC 
TCCACCGAGC 
TGAGTTACAT 
AAGGCATAAT 



CCGTCTACCT 
TCACCGACGG 
GCCACAACCC 
TGTACGTGGG 
ACACCTACGT 
ACGGCGGGTT 
TAACCAACGT 
TCTTCGTGTT 
GATCGCGCGC 
TTGAGACCAT 
ACACAACGCC 
CTCAAACACC 
GCAGTTTATT 
ATCTGTAGAT 
CCCAACGCAC 
AGCTCCTCAA 
GCGGCGTCCG 
TCGTAGGCCC 
CGCTGGCACT 
TACTCACGCA 
TTTAACCCCA 
CACTGGGGGA 
GCCCAACACG 
CGCTTACAAT 
CAACCCCCAA 
TCACAAAAAG 
AAACGGCTCG 
GTGGGGATGG 
CGCCACCTTT 
CAGCAGGGGA 
TCCCGGGGTC 
GTTAAACGGG 
CCCCCGAGCC 
GTCCGCGGGG 
ATCCACCACG 
GTACTCCGCC 
GCGGTCCCCG 
TACGCCGCGG 
CTCCACGACG 
GTTATATCGG 
CTGCACGCCG 
GCCGCGGCAC 
TGCGGCCGCG 
ATGCACCAGG 
GAAGACGAAA 
GTTAGCGAGT 
CCAAACGCCA 
AACGCGCGAT 
GCGCTCAAAG 
ATTCGTAACC 
TCGAGGGTGT 
GTTTCTAAAT 
TTCTG CC CTT 
CGGGATTGGC 
ACTGCGACGA 
ACCCCAAACA 
CAACTACACA 



GGGAAAGCAG 
CCCCGTGACG 
ATACTTTGAA 
CCTGACCGTG 
CTCGGCATTT 
CCGGATTCAG 
GTCCCAGGCC 
CGCGCCGCGG 
CCTGGAGCTG 
GTACCTGTTT 
AAACCGTCTC 
CCCCTCCCGC 
TATTATTTTA 
ATTCCAGTAT 
GCACGGTTTT 
TTAGAGAGTC 
CGTCCCCGGG 
GGAGCCCGCG 
CAACGTTAGA 
CCCGACAGAG 
C TT CAG A CAG 
GAACGTTAAT 
ACAGGTAAAA 
CGCGGGCGGC 
CGCGGGGCCG 
GGCTCGTTTT 
TCCAAAAACA 
GGGTCGTAGC 
CTCAAAATCT 
TCAACGTGGT 
AGGCCCGTGA 
ACCCCGAGGC 
ACGTAAAACT 
TTCCACGCGC 
TAACGCAGCT 
CCAAAATGTC 
AAGCCCTGCG 
CGCGACAGTT 
CCCCTCCAAA 
GCCTGAAGCC 
TGTTTAATAA 
GCGCAGCGGA 
ATTCCAGCCA 
ACAATCATAA 
CTTATTTTAA 
AATAAACCCA 
AAAACAACAC 
TATGCTTCTT 
GCAACCGCAA 
GTAAAATCCA 
TTCTAAGTCG 
CCCGCTGCCA 
TTTGGGACGC 
AC AC AG CTTT 
GGTGTGGGAG 
GCAGAGTTCG 
CGCGAACACC 
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22561 GCAAAAAAAA CGCACACGAA AAGGCCCCTC ATGGTGACAA AAACACAAAA CAAAGACGGG 
22 621 GTCGCTCGTG CGTTGCCACC AGCGCCACGC TCGCCGTGCT CGGCCACGGT TGACGGCGTC 
22681 CATTAATACC CGCGGCCCGG GCCCGCCTCG ACGGAAACCA GGCGTGGTTA GCCACCTGAC 
22741 GCACGTGCCT CCCGTCCAGA TGTGGCATTG CGAGTCCCAA AACGGGCGTG TCTAACTACG 
22 801 GTCGCCCCGG GGTTCAAAGA ACGGGGATCC CCCAAACAGG TAAAAAGCTT TTTGTCCAAA 
22 861 CAGAAAGCCA ACAGCAACCG AAAGGAATCT CCAAAACAAG CAAGGGTCCC GTGCCGGCGA 
22 921 TAAACGGAAA ATTTCCCTCA CCTAAACACA CGCGCCGCGT AAACTGTTAA AACACGCAAT 

22 981 ACTTCCTAGG CCTGTGTTTA ATAAAACACA CAAGGGTTGT AATTTCGCGG GGCCCTTTGG 

23 041 GCCCCGCGCG TCTGGGGGTG CGTTGTCTCG TTGTGGTTCG CAATCACCAC CCCGCCTGCC 
23101 CGAAAACAGG GCAACAACCC CCTGGCCTAG TTTTTTAAAA ACTTAACACC GGCAAGGGGA 
23161 GAGGGAGAAG GGGTGCGGCT AAATGGGCTG TTAGCAGCCA ACCGCAGCGG GGAAAAAAGG 
23221 GGGCAAACGC CGGGTTAACA CAAACAGCAA CAGGCGCCAA ACCCAATATA ACACAACGCA 
232 81 TATTCCGTGT GTTTCGCGGG CCGGGGTCGG TACGGCTAAC CCGCGTTGGG CGTTTG C AC A 
2 3341 AGCACGCCCC CGCCCTGCAC CCGGCTCCCC GAGGTCACCG AAGGGTCAGG AAACATAGTT 
23401 TTCACGGTCG CCAGTCGCGC GCGTGGCAAC ATCTTTCGGT GCCCTCCGAA CGACCCGGAA 
23461 AAATCCCCGC CGCGCCGGGT CGTTCGGAGG GCACCGAAAG ATATCCCCAA ACGCAACCTA 
2 3521 AAG CATCATG TTTGGGGTTT CGGTGACGCG CGCCGCAGAG GAGGCCGGTG GTGGCGCTGG 
2 3581 CGAAGATAGT GGCCGCAGAG CACGGGTTTC GGTAGTGACG CGCCTGGACG GCTGCCACCT 
2 3 641 GGCAGCAGGC CAAAAACACG GGCGCCGCGA GCGGCCGGCA GGTGGCGCCG CGCTTTCCCC 
2 3 701 CACGCCGCGA TTTGGCAAAA TGGCGGACCG GCAGCGCCGG TTGGACGCGG CGGACGTCAA 
23761 TCACGACGGA GAG AAG CG CA GTTGGTCAGT TGTCCCGTCA ATCACCGAGG GCCGCGCCGG 

23 821 TTGGCGGATA ATATGAAACC GGCGCGGCCA TTGGACGCTG GCGGCAGCCA ATGGGCGCGT 
2 3 881 TGGGTTTTGT TTACAAGTTC CCTATATATA TTATATTATA CCTTCCCTGT TAAGGGAATC 
2 3941 CCATGTTCTT AAATCTAATC AGCTGGGTTA TATAATTAGA ATATCAGTAA TTACTATTAT 

24 001 TACTGTATTA TTTATTAAAT TATTTATTTT ATTATTTATT TAATAAATTA ATTCTTGCGT 
24061 TATTATGTAA TCAGATACTG GGAAATCTAC TCCCAGTTTA ATAAAAATAA ATAATTAATT 
24121 CTATTATGGT GGGGGTTCTA AATATTGTAC AAAACAACAA TATTTTTTAA ATTTATTCAC 
24181 TAAAAATGCT CCCATTTTAC TTGTGACGTA CATGTTAGGA ATGTGGGAGT GGTTTGTAAC 
24241 TATGTTTTCA AACAACACCC TTTTGGGCCG TCTGAAACTC ATATAAGCGG AAGGCTCGGC 
24 301 GTTAATTGCC ACTGGCCGCT AACCAGTCCA CCTTGCCAGT TGGAGTTTTA TTGCTG CTT A 
24 361 TGGCTTTACC TCGCATTCCG GGCACAAGCT GATTTACGTA GCACGCATTG GGGGTTTAAC 
24421 CCGGTTGAAT TTAGTGGTTT GCACGTGTAA AAAAGCCCAC CGAGGCTTGG GTTTAAGCGT 
244 81 TTAAAG CGTT GCTGGTTTTA TAACATGCGA TTTAATTGCT ATGGGAGCTA GCTTGTATTC 
2 4 541 CGCATGCTTT CGGTTTAGTA GTTGTACAAA ACAGAATTGT AGCCCGCAAA TTGTTACTGG 
24 601 TACCCAAAAC GGCAAGCCCG CTTAACACGC CCCGCGCTGC GGCTTTTGTT CCTAGCCCGC 
24 661 TCGCAATACA ATCGAGATTT TAATTTAAAA TGGGGCAATG GCGTAGCGCA AATTTAAGCG 
24721 CGTCAAAAAT TTAAAATGGA CTAACGTAAC CGGTGCTTTT AGTGACCTAT GGCGAATTTT 
24 781 AAAATTTAAA GCCGTGGGTT TTAACACAGA GCTGCCAGCT TGTAATACGT TG CAT AT AC A 
24 841 GTGCCGTTGT GATTTTATGG TTAAGTGGAT TTTAACATTG AATTTTGTAA TGGTGTACAC 
24 901 GTGGATTTTT AAATTGAATT TAAAATTGTA ATTACGTGCG CGTGGATTTT AAAATTGTAT 

24 961 TTATGTAACA TTGTTAAATT TTAACTG CGT TATGGTTGTA TTTTGGTTAA TTTATGTAAT 
2 5 021 TTTGCTTTTA GATTTGTTCG GGTGTGAATT CCAGACAGGT AAG AG CCTAG CTC CTAATGT 

25 081 TTGCCTTGCC GCCTAGCTCC TAATGTTTGC CTTGCCGCCT AGCTCCTAAT GTTTGCCTTG 
2 5141 CCGCCTAGCT CCTAATGTTT GCCTTGCCGC CTAGCTCCTA ATGTTTGCCT TGCCG CCTAG 
25201 CTCCTAATGT TTGCCTTGCC GCCTAGCTCC TAATGTTTGC CTTGCCGCCT AGCTCCTAAT 
2 5261 GTTTGCCTTG CCGCCTAGCT CCTAATGTTT GCCTTGCCGC CTAGCTCCTA ATGTTTGCCT 
25321 TGCCGCCTAG CTCCTAATGT TTGCCTTGCC GCCTAGCTCC TAATGTTTAA CAACATTAAT 
253 81 GTTTAAGCAC ACTAAAATTT AAAGACGTTT GTGTTGGTTT TTATGACCAG CTTGGTACAA 
2 5441 AACCTGCTGG TGATTTTTTA CCCAACAAAT AATAAATAAA AAAGTTAAAA CTTATTTCTG 
25501 TTGTCTGTGG TAATTGGGTT CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG 
2 5561 GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT 
2 5 621 CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT 
2 5681 CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC 
2 5741 CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG 
25 801 AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG 
2 5 861 GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT 
2 5 921 CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT 
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CCCGGGCTCC 
CCGTTCCCCG 
AGGGTCCCGG 
GCTCCCCGTT 
CCCCGAGGGT 
CCCGGGCTCC 
CCGTTCCCCG 
AGGGTCCCGG 
GCTCCCCGTT 
CCCCGATTCA 
TTGTACATAC 
GAAACGCGTA 
GTTTTTTGGT 
CCGCAGTGTT 
TACCTTAAGT 
GGAAGAAAAT 
CTGTGGGCTC 
TATCAGCGCC 
TTACCGCTCT 
CGGACGCCAC 
TAAGGTTAAA 
GTACGAGGCG 
TTACTGCGAG 
TGCGGCTGGT 
ACCGCGTATT 
AAACAAGCTA 
TTTTTTGTAT 
ACTGTCCGCA 
GAATGGCGTT 
CGAGACCGGC 
GCGCGCTCTG 
GCTGGTACGC 
GTGGTTG CGG 
GGTCCCTCAG 
GTATGCCGAC 
GCTTCTGGTG 
CATACGGAGA 
GTGCGTACGA 
CCGGCATCGA 
GACTGCTCTG 
GGGCGCTCAT 
CCTTTAAATA 
TGCGATCTCG 
CGAATTCTGG 
CGGCTTCCTC 
AAACGGCGAT 
GTGGGTGATT 
GCCATGTGTG 
CCGCTCGCGA 
TGCCCAGGCA 
CTTCGGGGAC 
CGATTGAGGG 
ATCCCCCAAC 
GCGCCGGTCT 
TCCCTGGCGA 
ACGTGTACCT 
GGCGTAAGGT 



CCGTTCCCCG 
AGGGTCCCGG 
GCTCCCCGTT 
CCCCGAGGGT 
CCCGGGCTCC 
CCGTTCCCCG 
AGGGTCCCGG 
GCTCCCCGTT 
CCCCGAGGGT 
ACGTTAAAGT 
ACGCGGGTGC 
GGTGCTTTAG 
AGCGAATGCA 
ACCGCGGTGC 
TTACGTGTAG 
GAAAATTCTT 
GGGCAGGCGG 
TTTCTCAAGG 
CTGCGGGGAC 
AACTGGGGTC 
GACGACGAGC 
CTGGAGTCTC 
CGAATACTGA 
CTTCTGGCGC 
CCCCCACCCT 
ACGTGGTATT 
CTGTGACACG 
TTCTGGATCG 
TGACATTACA 
GTGGTGGCTC 
GTACGGCTGT 
TAGGTACGGC 
TCGTTGGTCG 
ATCCGCCTGA 
CTCCTCGCCT 
GCGAGACGGT 
GGGTTGGGGG 
TGGGACGGTC 
GGGTGGCGCA 
CACCATGGTC 
GGTTCGGGTG 
CGTACTTCTC 
GATAAATCCG 
CGGTGTCACC 
TCGCGTAAGA 
TGACCCCCTA 
GGGGTCTTGC 
TAACATCTCC 
GTCCCCCGCG 
AAATATCCCG 
GTGCTCTATG 
CTCTAAATAC 
GTACACGGGA 
TCGGGGGCTG 
GCTCAGGTTT 
TCTGAAGGAG 
TCCGGTGGAG 



AGGGTCCCGG 
GCTCCCCGTT 
CCCCGAGGGT 
CCCGGGCTCC 
CCGTTCCCCG 
AGGGTCCCGG 
GCTCCCCGTT 
CCCCGAGGGT 
CCCGGGCTCC 
GGTTTATTTA 
GCATTGGCCC 
TGTTTCATGG 
ACCAGGTAGC 
GGCATACGGT 
CAATTATGGC 
TGCCAGTTGA 
AGTACTTGCA 
ACAGCGCTCG 
TAAACGCCAT 
GCGCTTTGGC 
GTCTTTTAAC 
AGTGGTTTCG 
GGCATCGCCG 
TAGTGGCGCT 
GACCTTAAGT 
CGTTGGAACA 
GGGGCCGGCT 
AGGGGCGGGC 
GATTCTTGGG 
GCCGGCTGCG 
GGGG CGTGGA 
GCAGACGCGT 
GCGTCCTGTG 
ATGTCCAATA 
GGAAACACGG 
GCCGGAGCGT 
GCAAACGCCG 
AGCTCAGGCG 
GAAACGTACG 
AG AAAGGTG C 
TCGCGGTTAC 
GCGACTCTGG 
GCGTCGATCG 
TGACGTTCGC 
GAATCAAACT 
CGCCTCCCCA 
AGCTCTTCTG 
AACAGGGGGT 
AGCCGTGACG 
TGTGTTACTT 
TTTATCGGTA 
AGCTCCTTTT 
GTCATGTTGG 
ATGTGGCGCG 
CTTCACCTGG 
GCAATTGCCA 
TTCTGGAAAA 



GCTCCCCGTT 
CCCCGAGGGT 
CCCGGGCTCC 
CCGTTCCCCG 
AGGGTCCCGG 
GCTCCCCGTT 
CCCCGAGGGT 
CCCGGGCTCC 
CCGTTCCCCG 
G C AAACAC AC 
GCCCATGCAA 
CACGTTTTCG 
TCGCCTGACC 
TTAACGTGCA 
GGCTGTTCAG 
CGTTTATGCT 
CCATCCCGTG 
CCTGTACGAG 
AGGCCAAGGG 
CGTGTTGGGT 
GTTCGCCATA 
TTCACACGGC 
CAACGCGAGG 
GTTTGCCATC 
GAGCCGATTG 
TTTTTTATTT 
GTGCGCTGGC 
ACGCGCCAGC 
CGGCGGGGGC 
GGGGCGGTGG 
GATGGGTGTC 
AGGTCTGTGG 
CGCCGGAGGT 
TGTTTTTGGA 
GCTCGTCGGA 
AGCCGTACGT 
ACTGGAAAGG 
GCGGTATCCA 
TGCGGTGGCC 
TCTTGGGAAT 
CGCAGTGCTG 
CCACGCCTTT 
CCTTGGCCAT 
AAAGATCTAA 
TGGAGACGAC 
GTGCGCACAG 
GGTGGAGGGA 
CCCGAGG CAT 
CCAGGGCCAA 
GGAAGAGACC 
GAGGGTCCGT 
CTATCTTTGG 
GTCGGTGTCT 
TTATCCGTTG 
TGCTGTGCGA 
AT AC CGG AAC 
TCGTTTACGA 



CCCCGAGGGT 
CCCGGGCTCC 
CCGTTCCCCG 
AGGGTCCCGG 
GCTCCCCGTT 
CCCCGAGGGT 
CCCGGGCTCC 
CCGTTCCCCG 
AGGGTCCCGG 
GTTTTTTGCC 
AAAAATAACG 
TGTCACGGTA 
GGGAAAGGGA 
ACTGCCGGTC 
GGCCCTCCGC 
ATAGAGGGCA 
TTTAGCCCTA 
AGG CTCTTGC 
ATG CTG C AAA 
CTTGGTGCGT 
GCCGTTCTAC 
GAATGGGAGG 
AGACACATGT 
AGG CGATAGG 
TATATGAGAC 
GAGCAGTTCC 
CTCCACCGGG 
GCGGGGGGCA 
TGGAGTTGCC 
TTGCGTTTGC 
CGTTCCGGAA 
CCACGAGAAC 
CTGCGACGCG 
CAGCGCCATG 
AAGCTCCACG 
CGGAGAATAG 
GAACGGTGGC 
CGATGGATAC 
CTGGTTGCGC 
GGTGATGTTG 
AGCGCAGGGA 
ATCTGTTTTG 
CAGTGTCTCA 
ACAATTGACC 
CCATGTAGGG 
AGAGACGTGC 
CGAGAGGGAG 
GGGGTTTAGC 
GAAATCGTGG 
GATTGTGTGA 
ATACGGGAGC 
TAGGCTGACC 
GCGTGAGCCG 
TGAGAATTTA 
AATGTACAAC 
GAGGGACGAT 
TGGGTTAAAG 



CCCGGGCTCC 
CCGTTCCCCG 
AGGGTCCCGG 
GCTCCCCGTT 
CCCCGAGGGT 
CCCGGGCTCC 
CCGTTCCCCG 
AGGGTCCCGG 
GCTCCCCGTT 
GCGGTTTGGG 
TTGGGTTTCC 
GGAGTGGTTA 
ACGTTTTAAA 
CCGCGCGTGG 
CGCCCCCTGA 
TCTTTCTTTA 
TTAAGGAATT 
GCCACACCGA 
TAAACACGGA 
ATGTGGTGGA 
CCGTGTACGC 
GACTCAGGAA 
GCTACGGAGT 
TGTG CGTTT A 
CAATAAAACA 
TCGCAGAACA 
GCCCGCGCGG 
GAGGCTGAAG 
GGCTGCGTGG 
TGGATGCCCT 
TACCACTGCG 
TGAACGGGCG 
GCTCGTTTTA 
ACGTCCTTGT 
TCCCGTTTGC 
TACGTGGCCG 
GCGACGGGCG 
GCCGCCGTTG 
ATGTTATCGA 
TCCTCCGGGA 
AACTGGCTTG 
AGCAGGTCTG 
AGAGGGGCCG 
GTTATTTTAC 
TCTGGTCCGT 
TGAAACGCGG 
AGCTCCGGGA 
TGGATCTGGG 
CTCGTTAGCT 
CCTACGTGGG 
AGGGTCGCCA 
ACGTCCACGT 
AAGGAGATGA 
AATACGTTTC 
TACGGTCTTA 
ATTGTCCTCG 
GAGATGGGGG 
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2 9401 TTTCTGACGC AACGTTGCTC TCAGAAACCA AAAGGGGGGC TCTGTGGCTT TACTTCAACG 
2 9461 GGCGTCCGTG CCTGTTAAAA GGTCTAGGCG ACTACGTTTT TTGCCAACTG GGGTTAAGCC 
2 9521 ACAGCGTACG CGTGGTTCCG GAAAACCTAA CGGATGGGAA TTATCTCTAT AACCTAGGCA 
2 9581 GCGTCATACC ATGTCGCCTG CTTGTGGCAC TGAGCTACTG TTTGGCGTTT TGGGGGCACG 
2 9641 CTGACCACGA GCCCTGGGTC AGGTTGTTCG CCGGCAAGAT TTTTATACTG TACTTAATAA 
2 9701 TTTCCGGCCA CATTATGCCA CGGAAGTCAA TCTTAGAACA GGTGGGAACG TCCGGGTACG 
2 9761 GTGGCTTCGT CGAGGCAGTG TGTCGCGATG TTCGCGCGGT ACACGGGATT CCGGCGTGGG 
2 9821 ATTTTGCGTC CGCGGCTCCG GCGTTAACGT CTCAACAGAC GGACTACTTG TTTGCGTTCA 
2 98 81 ATAACAGCGT AGTGTAGGGT CGCCTCACAC TACCGACGTG GCGACCGGCA GGTATC C C AG 

2 9941 GACCAAAAAA TAAACTCTGT CGTAGTCGTC GTAAACTCCG AATTGG CTTT TCATTATCTC 

3 0001 GTGTGC CAG A CCGGGGGGCG ATGGGTGCGA AAGGATGGCG GCCAGGCCTT CCTCGGCCGT 
3 00 61 TAAACGTAGC GACGTTTGAG CCTGAATCAG GCTGCCAGCG TCCCTGAACG TGAGCTTTTG 
3 0121 GTTCAGGATG TCAAAGATTT CGTTAAATCT GGAACCGGTC AGGTTGACGA ACGGACGAGT 
3 0181 GTGTTTTAGA TGGGATATGG CCTTTGCCTC CAGCGCCGCT AGATACGCAC CCGGGAACAA 
3 0241 AAACGACGTG GGGGCGTCTG GCGCGCGCGA AAGAATTGGG GAGATGTAAT TTTCTGTCAG 
3 0301 GAAATCAAAA AGTTGTCCCT TTTTAAGGTA TTGTTTGGAC CCGGACGGGT CTTCTACCTT 
3 0361 GGAAAATCTC CCACCCAGGA GCGAGGCCAG GTTGAGGCGC CGGTGGGGCG ATCTGGGTGT 
3 0421 GATTTGGGCC AGCTCCAATA GCACGTAAAC CAAAACTAAA CAGCCTAGTG CCGAGAGGCC 
3 0481 GGCTCGGAGG TTCCATTGGT ACGCTTGAAA GGGAGGCACG TCTTCGCTGA ACAAAGTGGT 
3 0541 CGCAAGGATG TATACGAGAT CTGGTCCAGG TGGCTGACCG* GTACCGGGGT TCTTGGAACC 
3 0601 GGGTAGCAGT CCGCTCTGTG CCAACGTCGC GTATATCACG TGGTTTGTAA AGAAGTCCGC 
3 0 661 GTTGTACTTT GTTTCTTTGT TTAGGGGCGC TAGAGAGGCG CGTTGCCTCG GATTCGAATA 
3 0721 TGTAAACTCG AAACCCAGG C CCTTCGTGTT TTGTTTTAGG TGAGAGACAA GCGTCGCCAG 
3 0781 CTTTGTGCCG AGGTTATCCC AGAGGCCGTT TCGAGTTGGC GGCAACGGGG TACCTGTCCG 
3 0841 GTGACAGTAG TACCCGTTGA GG AGG C AC AG CGCGGCCTGC GTGTCCGATA CGTTGCGGAC 
3 0901 ATCCGCGTAA AAGTCTTGTG TGATCGAGGC GGCTGTTGAG ATTACGGCAG ACATTAGCTT 
3 0 961 TAGAGACGTG GAGTTTTGAA AGTTTGTTGT GCCTCGCAAA TCTCTGGGGA ACACCCTGCG 
31021 TTGCATGGCG TTATCTGTGA GGCTACTGTA CCACGGCCCG TATTGTGGCA GCCATTGGTT 
310 81 TTGGTTCATG TACAGGCTTG GCAGGAGCTC CAGACGGAGA TCCTGGTTTA CCGTGAATCC 
31141 CGGGTCTCCG GGGGCTATTG TGACAACGAA CGCGTCTCCA TGTGTCTGCG GGGCGATTTT 

312 01 GGGTTGCGGG TTTAGTGATG TTTGGGCATT GGCGGTTTCC CTGCCCGGGG ACCGCAGATC 
31261 CCCGACCATG TTTTGAATTG TTTCTAGGTC GCGCATGACC ACTTCTGTTT CCCGTTGGTG 
31321 CGTTTGCAGC ACGTTGTCTA GTTCTAGCCG GAGCAGGTTT TTTCGCAGCG TGTGAACCCT 

313 81 TAGCGCCAGC TGTTGATTTC GCATCACAAA CGCGTTTCTT CTTTCTTCTA TCAGAGATGG 
31441 TTTGTGAATA AACAGAACGT TGTTTGGATG GGGGGCCCAA AAAGTTG CGG TAATTTGCCG 
31501 CGCTGGTTTT CGTGGGTATC TC AT AC AG C A TTTTTCTGAC GTGCGCATCT TCCCATTGGC 
31561 TAAAAAACCC GTTAAGCGCC GCCGACGTGG TGTTGATAAA TGTAGGCGGA AACGCTGATG 
31621 ACTCAGTATG AATTGTTTTA AGTGTTTTCT GGTTCTTAAA TAGCAGCCAG GGCCTTACCG 
31681 ACCAGTTTTC GTGTC CTCTT GGCGCGGCTC TGCCTATATA TGCCACGGAA TCTGTCAGTT 
31741 GGTGTAAGCC CTGCCATCGC TGTGATATTT TTATTGCGTT GCCGGGAAAT ACAGCCGCGC 
31801 TTAGG CACGT CTTGAGTTCA ATTACGT AG C ACGTTTTTTG ATGTTCG CT A TTAACGGTAA 
31861 GCACACAATC TGCTATTCGC CTACCTAGGG AAACCTCGAA AAAAATCCTA AATTTAACGT 
31921 TTTTGGTTGC CCTGAGTGTT GGGCGAATTG AAAGAAACTT TAAAATATCA TCAAGCGACC 
31981 GGTGTTTGAT TAACCGTCGG TATACTGCTA AGTGGGCGCG AGTTCCAGCC CGTTTGCGTG 
32 041 AGGCCGGAAG GGCTTCGAGG GTCGCGACGT GCTTACATTG ATTTGCAAAC GCCATGGCCG 
32101 AAGGAGGGTC TGGCTTTGGC GACGAGCTTG TCCGTCAGAT GCGGGATCGG AAGCCTAGAT 
32161 GGGACGAGTC TTCCGACGAT ACTGATGACG TTGACACCGA AAGCACCGAT CTTGAGTATG 
32221 ATGATGTGTT CCCCGTGGTA GATACGCACG GCTTAATGAG CCCTGGAAGT CAAAACTATG 
322 81 ACGTACCCAC GTCTCCGTCC GGAACGCCGT GGGAATTACT GCACCCGGAC GCCCTGTATG 
32341 CACATCCGAG GTGCCCGCCT AAAAGGGCGG TGGTGCCGGG CGGTGGCGCG CGTCCCAAGG 
32401 TGTCCGCGTT CTCGGCTAGA CTTCAATATG TTGGACGACA GAGTTTTGGG GATAGAGAGA 
3 2461 CGCGGCAGCT TACCGGGGCT CAGTTTTCTT CCGAGAGCGA GCACGAATAT GCGGAAATAC 
32521 CGGAGCGTAC T ACT AC ACG C CCGGTTGAAA GCGGTGACAA GAGAAATTTC ACCTCCGGGC 
325 81 GTAGGGGTGC AATCTCGGGA CCCTCGTCGA CAAAACCTAG TCACGGTGCG GGGTTAACAC 
32641 GTAAGACTAA AACGTCGCTG AGTGTTAGTC TTAAAAACTT GCTGCGGATA AAGGACGATG 
3 2701 ATGCTAAGGT TGATGTACCC AGGCCGGTTA CCGTGCCGGT TCATCTCATG CAGCCGCATC 
3 2761 CTATGACGGA ATATAGAAAT GCCTTTTTAA TCTATCTAGA GGGGGTCATG GGTGTTGGAA 



10 



WO 00/28040 PCT7US99/26260 



3 2 821 AAACAACGCT ATTGAATTCA ATGACTGGCA 

32 881 CCGAGCCCAT GAAATTTTGG ACGTGTGTTT 
3 2 941 TAG TTAAGCA AGGCACCCAC GGGAAATTGA 
3 3 001 GCAAGTTTGC GCTACCGTTT CGTGCGACAG 
3 3 061 GGCTGGTGGG AAACGGTAGC ACAAAGCCAG 
3 3121 TGTCCGCAAC GGTGGTTTTT CCGCTGGTTC 
33181 ATCTGTTTCA AATCCTATCT CTTTTCTCGG 
3 3241 CGCTCAACAG TTCGGAGGCG CACAGGCGCA 
333 01 GAATCACGCA AAACTACTTG CGACAGGTAG 
33361 GGGTGATGAT GCAATATCTC ACACCGGAGC 
3 3421 CCATAGAGGA CATATGCAAT ATGAATTCCA 
3 3481 AACTACATGA ACAGAGTATG ATACCCATGG 
3 3 541 ACGTGACCTT AATGGAGGTC TGTTTGGGAC 

33 6 01 TAATTGTTGA CGCAGGAGAA CATCTAGATG 
33 661 GGCAGGTGAT GTCAAATGAG GCTATTAAAC 
3 3 721 GCTACATTCA AACGCTAACC AAATTGGAAG 
33781 TTTACTATAA TAAGGTGTTC TGTAACGGAC 

33 841 GAGTTGGAAT TTAATGGAAC AATTTAC CAA 
3 3 901 ACATCTATTG TAATGGAAGA TGCGTGGTAC 
3 3 961 ACTTTGGAAA AAAGAAAAAG TTTGCTGCGA 

34 021 TATACATTTT GTAAATCTTC AAGTGACCAT 
34 0 81 AGTTCTTTGC CAGGTTTTAC TGGGAATTTT 
34141 GTTTTATTAA CGAAACGTGA ATTATTTACC 
34201 GCGGAAAAAA TAAATGCAGA AATGTTCAAG 
342 61 TCCGGCATTA TAACAGAAAA TTGGATATTA 
34 321 AATATGCAAT GTGTTGCACT GTTGTTTGGT 
34 3 81 TATGTATCTT ATCGTGATTT ATTGGTCGTT 
34441 GCGCCGAAAT CTTATAATAC TCTTGATTTG 
345 01 TTTGTTTCAG TTATAGATAG TCCGCTTAAT 
34 561 GCTATTGAAG CAAAGGGTGC CTGTCAAAAT 
34 621 TTTTTTGAAG TTACTGCAGT AAATTTTTTA 
34 681 GTTAACGTTG, GGTGTGTTGT ACGGCATGTT 
34 741 AGGCTCTGCT ACCCAACGTT TAAGTTATAT 
34 8 01 GCAGAGTCCC AGGTTTTTAA CTTGCCGGCA 
34 861 CAAGAGGTCG TATTTTCAAT GTTTAAAATT 
34 921 ATTCTAAACG AAATTGTATA TATTACTAAT 

34 981 CAGCTAACCA ACACATTTAG GCGGAATGTA 
35041 AAATTAAACG TCACAGATTC TTCTGTGTTT 
35101 AATAATGTAG AG AT AT C ATA TATGATTAAT 
35161 TTTCGCGTGT TTTCGCCTTG TTTTTTAAGC 
35221 CGATCCGACG CACCGCAGAC GTCAAAAAGA 
35281 GGTTTTTGGC GGCTGTTGCA CGCTTTTCAC 
35341 AATTGTACCA GGCTGGCATG GAAGCAAGTA 
354 01 TACGTCATAA GCTCCGTGCG CCCTGATCAC 
35461 CTTAACAGCG CGATGTTTGT CTCGGCCGTT 
3 5521 GGTACCGCCC TACACATTCC GATATTGTAT 
3 5581 CTGTGTGATT CGATTGTTTT GAGCTATGAT 
35641 GTGTCAAACC CCACAGTTCA AGCCAACCTG 
3 5701 GATAATTTTC ATATCCATTA TTTGTGGTTA 
35761 GGATTGTACA GAAGACACGC ACTCAGTGCT 

35 821 ATGTCTGCTC TGTATTTTCT GTTTAAGCTG 
358 81 GCGTAAAACT TTAAAAGGTT GTTTCCCGTT 
35941 TTAGCGAGAA CCTCGGATAG AGGCGTGCCG 

36 0 01 ACTTGAATGG CCAGCTCTCG ATTTTGGGGA 
3 6061 GTTAATAATC GCTGCGGAGC CCGCGACAAA 
3 6121 TCTATGGAGT TTAAACAAAA CACCATGTCT 
3 6181 GATTCGAAGT ATTCTCTCTG ATACGGGCCC 



TGGTGCCGCA GGAAAACGTC TTAAGCTGTC 
ATTCAAATTG CCTTAAAGAA CAGCGCAGCA 
TCACTTCTGC TCGCGTATAC GCGTGTCAGA 
CCGCCGGCAT CGGTCGCAAC CTGCAACCGT 
CGAATTGGAT TG TTTTTG AC AGACACCTAC 
ACGTAAAGTA CAACAGGCTA ACGCCGGATC 
CACACGACGG CGATGTGGTC GTTTTGCTAA 
TTCAAAGTCG AGGCCGTAAG GAAGAAAAAG 
CGTGGGCGTA CCATGCCGTG TTCTGTACGT 
AAATGGTTCA G CTGTGTGT A CAAACTGTGT 
GATTGACTCA TCGGTTTCTA ACCCTAACAA 
TGGCAGAAAT GTTAGTATCG GTTAAAGAAC 
TCTTTAAAGA GCTACGAAAG CTTCAAATTT 
ATGCGTGTGG CCTCTGGGGA AATATTTATG 
CACGAGCAGT GAACTGGCCA GCTCTTGAAA 
GCAATGGCGC GTATTAGTTT TATATTTTTT 
AAATACGTGT ATGATGAAAA ATCAAATGTA 
ATTAATTGGA GAAACGTAAG CAAAGAATTA 
GATTCTTTAC TGCTGGAACC ATTGTCTGTG 
TCAAGTATAG TTAATGTGCA TAATAATGAC 
GTAATTAACC TTACGGTGGA TTTTAATTAC 
AATGTAATGA CACATGCTTT AACACAAGGC 
AACTCAACAA ACATTATGGA TCTTTTTTAC 
ATTACTTTTG ATTATTCCAA TGTAATAATT 
GTATCAGTTA CAAATTCAAG TGTAAAATCA 
GTTCCTAGCA CATTTCCAGC GTTAAAAGGC 
AAAAATTCAA ACTATG CTCT GGGTGTAATT 
GCATTTTTAC CAAAAAATTT TACAGAAATG 
GCCATTGATT ATCTCAAAGG AAAACTTTTG 
CCTTCAAATG AAAATGATAT CTTATCTTTC 
TTTATAAAAA ATCTTCAAAA GCAGCAATTA 
GCAG CGTTAG AAAGTTTAAT GCATTTATTA 
GAG CTAAATT TGGAAACGTT GTCGCATATT 
AATAGCATGT TATCGTTATC AGTAAATGAT 
GTATATAATA CACCTAAGGT TGGTGGTAAA 
TACATGTACA CCAAATACAG TGAAAATTAT 
ATGAACATGT ATGAAGTTTT AACAACAATT 
TATCCGTATA TATTGTTCAC ATCCATGTGT 
CAAATAGCAA AGCCCGATGA CATCACCATA 
TTAAGGTTTG ATTTAGATGA AAACAAGCTA 
ACCGGCTCGG AACTGGCACA AGGAGCGTCT 
GCCACGCGTA TAAACGAGTT TTCAGTCATA 
ACCGCGCTGA TGCCGCTTAC GAACATCACG 
GCTCGCGTTT ACGAGGTTTC GGAGGTGTTC 
TATCCAAATT GTTCTCATTT TACACCGCCC 
AACTTTTCCG CCCCGAGGAT AGGATGTCCG 
GAGAACCAGG GACTACAGAC CATGATGTAC 
TTTTCCCCGT ATTCTCCTTT CTTTGACAAC 
ATGAACAACG GTACCGTTGT GGAAATTCGG 
ATTGCATTAG TATTTGCTTT TATTGGAACA 
TTTTCCATCC TGGCCTAAAC TGTTAATAAA 
TCTTTTTGGA TGGTTACGCG GGGACTGAGA 
TCACATATTG ATTCAATCAC CTCTGCGCCA 
GGAAACATTA AAAAATATTT GTCGGTCGCG 
ACTCGAAAAT CCTTCTGAAA GTCGTGGGAC 
TCCAGGCGCT GTTTTACGCG CG AG AG CATG 
GCCGCCTTAC ACGTATTCGC GTATAGCATT 



11 



WO 00/28040 



PCT/US99/26260 



36241 
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36361 
36421 
36481 
36541 
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38341 
38401 
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38821 
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38941 
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39361 
39421 
3 9481 
39541 
39601 



TTAAAGTGAA 
AGGTGTTGGA 
TGCATAATTG 
GAACCCATGG 
GTCGCTGGCG 
CATAAGGTCG 
GTTATCTGTT 
AA-rGGCCTGC 

att^Sgtgtgt 

CTCTCGCCTT 
ACCATAATTC 
GAATAGATGG 
CCGACCATTG 
AGCTGCACCG 
CCTTTGGTAG 
TCTCCAGTGA 
AAACGCGTGA 
TAACCGTATT 
CAGAATAGCG 
CGTACCAAGG 
TTCCACCAAA 
CTCCAGCGTT 
TATTGGGTTC 
TGTCGCAGTT 
AGGGAAATTA 
AAGCCCGTGG 
CACCGGTAAA 
TATGTTAACG 
GAATGACGGT 
GTCGATAATT 
CATATTGGTA 
GAACATGTCT 
GTTTTGATCG 
CGAAAACATG 
ATACAG CCTG 
GAATATTGGT 
TACCTTGTAC 
AAGCGATTGA 
CTGAAATTTG 
AATAATTAC C 
CAAAATGTTA 
TAGCGTGAGG 
CAGATGTTCC 
AAGGTAACGA 
CGAACAGTAC 
TTCGTAGGTG 
GATAGCATCA 
AC CAC CTTGG 
ACGGCACGCG 
GTTTTCAAGA 
ATCGGAATTG 
AACAGGGCCC 
GGCCACGGAG 
AAGTTTTCAA 
TTTGGGCGTG 
TGCGTGCGTT 
TAAGGTATCT 



ACAGAATATT 
TAGG CTTTAG 
TCACGGTGTT 
CAAAAAATAT 
GTTGCTTGAG 
GTGGCGCATT 
TGTTATTTGT 
AGTGCAAAAA 
CCTCAAAGTC 
CGTCCGTTCC 
GC AG CATTGA 
TGCTGAATTT 
CGG CAAGTGC 
CCCCCCCAAC 
TTTCAGGAAC 
AGTCCCAAAC 
CCGGTCGTAG 
TTTTTACTTC 
CATTCGTGTT 
CGGGCGG CGC 
CACTCGATAA 
GTGAATATTG 
AATTCATCCG 
GCCGCCTTAA 
GTGGTCCAGA 
AGGTCTAACG 
AAGCACCTGA 
TTAATTG CAG 
TCTTTCACGG 
GACTCGAAGG 
TTGAAAACTG 
GGAAGAGGTT 
GCGTTAAAAA 
GCCTCTAGGA 
TCGCACGAAT 
AC CT CGGGTG 
GGTTTTTTGC 
TTTACTATGT 
TTTTGCTTTG 
GTTTGGACGC 
AATTTTAAGT 
ATTTTGAATG 
GGCAGTGTGC 
CAGAGGATTC 
AGCGACCGTA 
AAAACGGTGG 
TCAATTATTA 
CGGTGTATAA 
TAAAAGGTGG 
ATCTCATTAT 
GCCGGTGACG 
TGCAACAGCA 
GCCAATCTGC 
CTGCTGTTGG 
TATACCAACG 
AACACCGAGT 
GTACCGACCA 



TTCCAAACCC 
TTTTAAATTA 
AAACGCGGTA 
GTCTGCCAGT 
GATGGATCGA 
TGGATCGTAT 
AAGAGTTACC 
AAGAGAGTTA 
AGAACTTCTA 
CACGCACAAG 
TAATTCCGGA 
TTTAAACAAC 
CGACGAGTAC 
TAAGCGAGCC 
CGCAGCGGCC 
GTCTAATAGT 
CTATGAAAGA 
GCACGTATAT 
TGCTTTTGGT 
GGTTCAGGGC 
AACGCTTGTG 
TGTTTTTCAC 
AAAGTTTTTT 
ACCAGGAAAC 
AAACGTCTCG 
TAGAGTT CCA 
GGTTTCCCAT 
CCCCGTTGCA 
AGAATTTTTT 
TACCGGCGAG 
AGATTTTGAA 
CGGTCTGGTG 
TAAAATTGTA 
CGCGAGGCGT 
GGCACACGTG 
GAAACGTTAG 
CGGGATCCTT 
TTTGGTAGTC 
AGAGGTTTTT 
GATGCAGTTC 
AATTCAGATG 
GCTCTAAAAG 
GCGTAAAGGC 
GAAACATGCT 
GGATTCGTTC 
GTAAAAACGT 
AACCCGGTCC 
AAGGGTATAG 
TCTCTGTTTG 
TGGTGGCCAG 
GGGCGTTCGC 
TAATGGAGGC 
TTCGTCAAAT 
GAAAAGACGC 
TAATTCAGTT 
TTAAGGACCT 
TCGCGTATGG 



AAACGCTGCA 
GGTAAAAACA 
ATCGGGCTCT 
AAACAGT CCG 
AGTTTTTTAT 
TCTTGCGTGG 
GGTGGGGTCT 
CTCGCAATCA 
ATTACTGGGT 
CAGCTGTTTA 
CGTTTTGGGG 
AGGGGAAGGG 
ACGACATGAC 
TTTAAATCTG 
TTGGCTTCGC 
GTGTCGAATT 
CATAAAATTG 
CTTTGGAATG 
GTGACTAATG 
TCCCAGATCC 
TAGAGTTTGT 
CTGGTCTATT 
AAGTAAGGGC 
GTTGCATCCG 
CCGCCCCCGT 
GTTCGCAATC 
AGCGAACATA 
CAGGTTGTCG 
CATTACAAAG 
AGACCAGCAC 
GTTGTGTAAA 
GTCGTGACGC 
ATCTCTATCT 
TTCGCCTGGC 
TTCCGCCACA 
ATAGTACGTT 
TTTGTTAATT 
TATGATTAAT 
TAAAAGTACC 
GGGATGCGAA 
CAGTTTTGCA 
TTTAAAGTAA 
AG C AAAG CAT 
TTGATAATTT 
GTCGCTTTGA 
ATGCGTCTCG 
CGCGTACTCG 
TCCTTCTCCC 
CAT AGC CCTG 
GCAGCAAAAA 
TGCCACCGTA 
GGCTCTGGAG 
GAAAGAGTCG 
CAGGGAGGGA 
TGTGAAGTTT 
CAAACGAATG 
GGACGGCAGG 



ATAGGACGTG 
TTATAGAGTT 
CG CCTATTTG 
ACTGTACATT 
TATCCGGCGG 
TGTTTGCCAG 
TGACGGTCGC 
CTAATTTTGA 
CTCTGAACAA 
GCGCGTAGGG 
CGTACAATGG 
GTACGGCCGG 
AGACGTACTC 
AGATTTTTAT 
GGTTATATTT 
ACCTGCGGTT 
GCGTTGCGTC 
AGGTTGCATT 
ATGCGTTTTG 
AAACGCAAAA 
ATTTGCGCTT 
CTAG CATATG 
CTACCGTATA 
TCTTTACTAG 
CTAACTATGT 
GAAACCGGGA 
AAGTGCAGCA 
GAGTAGAACA 
TTATTAGTGA 
GTGATTTTGG 
CTCATGTGCT 
GCTTGTTCAT 
TTTGGTGCGT 
CCCAGAAACT 
GACATGCACA 
G ACTTTGG CG 
TTCACCTGTA 
TCGCTCGCCA 
GGATTGGATT 
ATCCAATCCG 
CACGGGGGCG 
TTCTGATAAC 
TCCTCGAATT 
ACGCCACATA 
ACCAAAACCG 
ATATGGGAAA 
TTACCGGTCT 
AATAC CAG AC 
TCCAAGGCCT 
ATTGCCAGTT 
GCAGGACACG 
GTACGACCCT 
GCAGCCAGCG 
GGAGTTCAAT 
CTGGAAACGT 
ACGGATGGAA 
CGGCCCACAA 



CTTTAAAAAA 
GTTGGCGCGC 
TGGCTGGCGC 
TGACTCGCTG 
AAGCGCGCTA 
CTCGTTTATG 
CGGGACGGAA 
GTTTACGGTA 
TGATACGCCT 
ATGGGGTGAC 
CATCATGTTT 
TCCCGGAAGT 
GCCGTTGGAT 
CATTGATTTT 
TTGCTTTATG 
TTAAGTTTGA 
CTAACTCGTC 
TTTTATATCC 
AAAAGTCAAA 
ACGAACAACA 
TATTT CTTGT 
G AACG CTTAA 
TTTTAGAAAT 
AAACCACAGA 
ATTGGTTTTC 
AAATGTGCTT 
TGTCCCCGCT 
CACTCACGGT 
GCCTGGGGAT 
TATTAATTAC 
TAACTGGATC 
ATCGATCTGG 
ATTGAATTCG 
CGGTGTTAGG 
TTGCGACGCG 
TCACAACAAC 
GATTTACGTC 
CTTTGATGTC 
CGATGTTGTG 
AAGTAAAACC 
G AC AC CGAGC 
ACGTGCCGAT 
CCCATATTTG 
CCATTAATGG 
TGTCGGACAG 
ATGTACATCC 
TGCCAACGAT 
GCCGGAGCGC 
CTAGCGAGAC 
GAGTTTTAAA 
CGGGCAGAAG 
TTCCATATAT 
GACTTTTTAA 
TTGAGGGTCT 
CGCTAGCGGT 
AAATTCAGTT 
AACAAAAACA 
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3 9661 AT AC ATT AT C ATGAAGGCCT GCAATAAGCA TCACATTGGT GCCGAGATAG AGCTGTCGAC 
3 9721 TGATGACATC GAGCTGCTAT TCATTGACAG AGAAACCCCA CTCGATTACA CAGAATACGC 
3 9781 CGGGGCCGTA AAGACGATTA CCGCCTCTCT CCAGTTTGGC GTGGACGCGC TGGAGAGGGG 
39841 CCTGGTAGAT AC CGTATTG A ATGTTAAGCT TAGGTCCGCC CCGCCGATGT TTATTCTAAA 

3 9901 AACACTATCA GACCCGGTCT ACACCGAACG GGGTCTAAAG AAGGCTGTTA AG TC AG AC AT 
39961 GGTGTCCATG TTCAAAAGCT AC CTTATGG A TAACTCGTTT TTCCTCGACA AAT C AG AC AT 

4 0021 CGCCGTCAAG GGAAAGCAGT ACGTGCTGTC GGTTCTCTCC GACATGGTGG GGGCGGTGTG 
4 0081 TCACGAAACG GTTTTTAAGG GGACGAATAC GTATCTGTCT GCATCGGGAG AGCCAATTGC 
4 0141 CGGAGTCATG GAG AC C ACGG AAAACGTAAT GCGAAAACTG TTAAACATGC TAGGTCAGGT 
4 0201 TGACGGGGGC ATGTCCGGTC CGGCGTCTTA CGCCAATTAC GTTGTCAGGG GCGAAAATCT 
4 0261 CGTAACCGCC GTGACGTACG GTCGCGTCAT GCGAACGTTT GACCAGTTCA TG AAACG CAT 
40321 AGTGGACCGT CCCAACGCGC AGCCCAGCGT CGACGATGAT CGGGACGCGG TGGCGGACGG 
4 0381 GCAGGACTCC CTCGCCAAAA CACCGATCGC GGCGGCTGTT ATCCAGATTG GAGATAAACT 
4 0441 GGTGGCTTTG GAAAGTTTAC AG CG AATGT A CAACGAGACC CAGTTTCCCT TCCCGTTAAA 
4 0501 CAGGCGTATG CACTATACTT ATTTTTTCCC CATTGGGTTG CACATGCCCC GTCCTCAGTA 
4 0561 TTCCACGTCG GCCACAATTA AGGGGGTCGA GCACCCAGCG GAACAATCCG TCGAAACGTG 
4 0621 GATTGTAAAT AAAAACAACG TACTG CTG AG TTTCAATTAT CAAAACGCGC TCAAGTCCAT 
4 0681 CTGTCATCCG CGCATGCACA ACCCGATGCC GTGTGGGCAA GCTTTGGGTC AGGCGTTTCC 
4 0741 AGATCCCGGA CACGTTCACA GGTACGGACA AAGATCTGAG CACCCCCCGA ACATGAACCT 
4 0801 ATACGGGCTG GTGTACAACT ATTACCAGGG CAAAAACGTG GCACACGTTC CGGATGTCGC 
4 0861 CCTAAAGGCG ACGATGACCA CAGATGAGTT GCTGCACCCA ACCTCGCACG AGACGCTTCG 
4 0921 TTTGGAGGTT CACCCGATGT TTGATTTTTT CGTTCATCAG CAGCCTGGTG CGCAAGCCGC 
4 0981 GTATAGGGCG ACCCACAGGA CTATGGTGGG TAACATTCCA CAACCCCTGG CGCCCAACGA 
41041 GTTTCAAAAC AGCAGAGGCC TGCAGTTTGA CAG AG CGGCG GCCGTGGCTC ACGTGCTGGA 
41101 CCAGTCAACG ATGGAAATTA TCCAAGATAC GG CGTTTG AC ACGTCGTACC CACTACTCTG 
41161 TTATGTCATC GAATGCCTCA TTCACGGACA GGAAGACAAA TTTTTGATTA ATTCTCCTTT 
41221 AATTG C ATT A ACCATTGAAA CTTACTGGAA CAATGCCGGA AAACTGGCGT TTATTAACAG 
412 81 CTTCCCTATG CTGCGATTTA TCTGCGTTCA CCTGGGCAAC GGTAGTATTT CTAAGGACGT 
41341 GTACGCCCAT TACCGAAAAG TTTTTGGCGA ACTCGTGGTT TTGCAGCAGG CGCTCTCGAA 
414 01 AATCGCGGGC CACGAGGTGG TGGGGCGCAG GCCCGCGTCC GAGCTGATTA ACTGTCTTCA 

414 61 GGACCCCAAT CTTTTGCCGC CCTTTGCTTA CAATGACGTT TTTACCAACC TGCTCAGGCA 
41521 GTCCTCGCGG CACCCCATGG TACTCATAGG CGACGAGGGG TACGAAACGG AAAATGACAG 

415 81 GGATACGTAC AT C AACGT C A GAGGAAAAAT GGAGGACCTA GTCGGTGACA TGGTTAACAT 
41641 TTACGAGACC AGAAACAACG CGGATCATGA CGGCCGCCAC GTCCTTGACG TCGGTCCCTT 
417 01 TAATGAAAAC GAACAGCACA TGGCTGTGCT GGAAAAGCTT TTTTATTACG TGGTTCTGCC 

417 61 AGCCTGTACC AACGGTCACG TCTG CGGCAT GGGCGTCGAT TTTGACAACG TGGCCCTGGC 
41821 CCTGACGTAC AACGGCCCAG TGTTTGCTGA CGTCGTGAAC CCCGACGATG AGATTTTGGA 

418 81 C C AC CTG GAG AACGGGACGC TCCGCGAGAT GCTCGAGGCT TCGGATATAC ACCCCACCGT 
41941 TGACATGATT CGAACTCTTT GCACGTCGTT TCTCACCTGC CCGTTTGTTA CCCAGGCCTC 
42001 CCGTGTTGTG ACTCAGCGGG ACCCCGCGCA ACTGTTGACC ACTCACGACG ACGGGAGATA 
42 0 61 CGTGAGCCAG ACTGTCCTCG TTAACGGGTT CGCGGCGTTT GCTATCGCAG ATAGGTCTCG 
42121 TGACGTTGCC GAGACCATGT TTTACCCGGT GCCGTTCACC AAGCTGTACA GCGATCCCCT 
42181 GGTGGCGGCC ACGCTCCACC CGCTGGTCGC AAATTACGTG ACGCGCCTGC CGGCCCAGCG 
42241 CGTGCCGGTC GCGTTTAACG TCCCCCCGGC CCTCATGGCC GAGTACGAGG AGTGGCACAA 
423 01 GTCTCCAATG CTGGCCTACG CTAACACCTG CCCGATGACG CCCACGTCGT TGAGCACCCT 
42361 GGCGAGCATG CACATGAAGC TGTCCGCGCC GGGGTTCATC TGCCACGCAA AG C AC AAG AT 
42421 TCACCCGGGC TTTGCGATGA CCGCCGTCCG AACCGATGAG GTGTTGG CGG AG AACTTG CT 
4 2481 ATTTAGTGCC AGGGCCTCGA CGTCCATGTT TTTAGGGCAG CCATCGGTTA TGCGTCGGGA 
42541 AGTCAGGGCG GACGCAGTCA CGTTTGAGGT GAATCATGAG TTGGCATCGC TGGACATGGC 
42601 GCTCGGTTAT TCTTCCACCA TCACGCCCGC CCACGTTGCG GCGATTACCT CGGACATGGG 
42661 CGTTCACTGT CAGGACATGT TTCTCATGTT TCCCGGGGAC TCGTACCAGG ACAGGACCCT 
42721 CAACGACTAC GTT AAAC AAA AAGCCGGATG CCAACGATTC GGTGGTCCTG GCCAGATTCG 
4 27 81 TGAGCCCGTC GCTTACGTTG CGGGGGTGCC GCACTCGGAC AACATACCGG GTCTCAGCCA 
42841 CGGACAGCTG GCCACGTGTG AGATTGTTTT GACGCCCGTT ACTGCAGACG TT AC CTATTT 
4 2 901 TCAAACCCCC AACAGTCCCC GGGGACGGGC ATCCTGCGTG ATCTCGTGTG ACGCGTACAA 
4 2 961 CAACGAAAGC GCGGAACGTT TGCTCTTTGA CCACTCCATC CCGGATTCTG CCTACGAATA 
4 3 021 CCGCACTACG GTTAACCCAT GGGCGTCGCA GCAGGGCTCC CTCGGAGACG TGCTGTACAA 
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43 0 81 CTCAACCTCG CGCCAGGTCG CAGTGCCAGG GATGTACAGT CCGTGTCGCC AGTTTTTC C A 
4 3141 CAAGGACGCT ATTTTGCGTA ACAATCGGGG CCTGAACACA CTAGTCACGG AATACGCGGC 
4 3201 CCGCCTCACG GGAACGCCGG CGACCAGCGC GACGGACCTG CAGTACGTGG TGGTCAACGG 
4 3261 AACGGATGTG TTTCTAGAAC AACCGTGCCA GTTTCTACAA GAAGCGTTTC CCACGCTCGC 
4 3321 CGCCAGTCAC AGGTCTCTGC TGGACGAATA TATGTCGAAT AAGCTCACGC ACGCCCCTGT 
4 3381 GCACATGGGA CATTATATGA TTGAGGAAGT GGCCCCTATG AAAAGACTAT TAAAGATCGG 
4 3441 AAACAAGGTC - GCCTATTAGT TTAGCTCAGA CGGTCTGGAG CTAACGAGAG ATGGCCCTCG 

435 01 ATAAGAGCAT CGTTGTCTCG GTGACGTCTA GATTATTCGC CGACGAGATA GCAAATCTTC 
4 3561 AGTCAAAGAT AGGATGCATT TTGCCTCTCA GAGACGCCCA CCGTCTGCAG AATATACAGG 
4 3621 CGCTGGGTCT GGGGAACCTG TGCTCTAGGG ATTCCGCGGT GGATTTTATT CAGGCATATC 

436 81 ACTATTTGGA CAAATGCACT CTCGCCGTGT TGGAAGAGGT CGGTCCCAAC AGTTTACGGC 
43741 TAACGCGCAT TGATCCCATG GACAATTATC AAATAAAAAA CGCGTACCAA CCGGCCTTCC 
43 8 01 ATTGGGATAA CTACTCAGAA TTGGTAGTTA TACCACCGGT CTTTGGGCGC AAAGATGCGA 
43 8 61 CCGTCTCACT GGAGTCTAAC GGGTTTGATG TGGTTTTCCC TGCCGTGGTG CCAGAACCAC 
43921 TGGCTCAAAC AGTGCTTCAG AAGCTGCTGC TGTATAACAT ATACTACAGA GTGGCGGAGA 

43 981 CGACGCCCAC CGACGTCAAC CTAGCCGAGG TGACGCTGTA CACGACCAAT ATCACTTACA 
4 4 041 TGGGTCGCAA CTACGCCCTG GACGTGGACC CCGTTGGGTC GAGCTCAGCT ATGCGGATGC 
44101 TGGACGACCT GTCCATTTAC CTGTGCGTTT TGTCCGCGTT AATTCCGCGC GGGTGCGTAA 
44161 GGCTACTGAC CTCATTGGTG CGCCACAACA AACACGAATT AGTCGAGATT TTCGAGGGGG 
44221 TGGTGCCACC TGAGGTACAG GCCCTGGATC TCAACAACGT AAGCGTGGCC GACGACATAA 
442 81 CGCGCATGGG TGCCCTCATA ACCTATCTAC GAAGTCTCAG TTCTATATTT AATCTGGGCC 
44341 GCAGATTTCA CGTTTACGCG TTCTCATCGG ACACGAATAC CGCTTCCTGT TGGTGTGCAT 
444 01 ATAACTAGAA ACGGGCCTCC CTGTGCTTTG ACATGTCGAT CCCCAAAATT ATGACGGTGT 

444 61 CCAGAGACAA CGAGGGTACG GTGTGTGAAG TCGCGGTGGA CAACGGACGA C AC AG AG CG A 

44 521 TGATTTATTA CCCTAAGACC ACCAACTTAG CAAACGAGCG CGCGGACGTT GTTAAGGAAG 

445 81 CTTTTGATAC CGAAACCCCA GTGGACATTG TAAAG CAAAT TGTTAACGAG GGCCTAGCTA 
44 641 TATCCAAAAA AAATTG CGTC CGTTTGGCGT TGTATTTATA TTTTTATTTG CAGTACGTGT 
447 01 GCTTTGCTCT GCTCCTCACT TGGCAGTTAA AC C C G T AC AT GGACCCACCG GGTCTGGTGT 
44 761 TTGCGGTTAA CCCCATGGGT CCAAAACATG TCACGAAACT ACCGCACCCG GCTATTGTTG 
44 821 CGGTAGGTTG TGGGGCAGAC GCCATCTGTA AGAACTGTAG CGTCCCCGAT ATCAAAACGG 
44881 AGCTTGGAAT GGTTTACCAC AACGGGTCTA GCGATTCTGG TCAGCGCGCA CACTATGGGC 

44 941 TGGCCCTGTT AAAGGCGGCC TGGCTTGTCA TGGGAAATGT GTGTCCGGAA CCAGTAGTGC 

45 0 01 GGCAAGGCGC TGCATTACTT GGTCCATGGA ACCGGACGGA GTGGTCGGAT TTTAAATCGG 
45061 CAATGGCGGC AACCACGTTT TGCGGATCCA GAGGCGTTCT GTGGTCACCG ATTCATGAAA 
4 5121 AAAACCTCTG TCGCCCCACC TGGAATGATG TAATTAACAC ATCAGTTTTT ACAAATGAAT 
45181 CACTCTGTCC AAATATACCT GTGGTGCCCG AAAGTGTAAT AGTG CTTAAT GGTGATGCAT 
4 5241 GAGAACAATA AACGTATTCC CACGCACTTC ATGTACGTTG TTTTTATTGG TCTGGTAAGG 
4 5301 TATTAACAGT AATGGGAGGT TCCGCTGGTG CCTATAAAGC AAAACGGTCA TAGAGTAACA 
4 5361 ATATTGTGAA TGGGGAAAAT AGTCTGTAAA TAGTTTATCC GAACAGTATG ACTGCACACA 
4 5421 CGAATGGGGT TTTAACCACG ACGGGCTTTT CAACAAGTCA GCCGGAATCG GTTCAAGTTT 
454 81 CTCCATTTTA TCGCGTAATT ACAAAACCTC CCGTTATGGG CTTGTTTTTT TGCGTGGCTA 
4 5541 TGTGCGTTAT CGCGTTGGTA TGGTACGTGA TGCGGAGGGT GTGTTGTAAG GGGCGCGTTG 
4 5601 TTGCCGATTC GTGTCGCGAC CCGCGTCAAC CCGCGTATGA GATGTTGAAT GTTAGGTTGC 
456 61 GTCCCCACGG AACCAATCCA TAGAGAACTT TCACGTACAT AGCCAAAGCG CACGACGTGC 
4 5721 TCTCTAGCTC GTTTATTCTA GCGATTTAAA AACAAATTTA TCGTTGGTTG CCATGTAGTG 
4 5781 GGCCATGACG GCGGCGACAA GCACGTCGTC CGACATGGTT TTTTGTTTCG CGCAGTACGT 
45 841 GTGACCTCCG TCTTTTAGTG GAATGCAACG TATTG CCTTG ATTTGATCTA TTAGATACGC 
45901 TACGGGATCA AACGATAGCT TAATAGTGTG AGACACCACA GTTTGGCTAG CGCTAAACGT 
45961 TCCCGAGTTC AATGCGTAAA TAAAAGATTC AAACG CCTTG GCTTTCTCTG GCCCCAACAT 
4 6 021 GTAAATGGGC GAACGTATAA GAGTGTTCTT GTCCACGTGG TGTAAAAAAC TAAGAGGGAC 
4 6081 CGAGCAAATC TCGTTTAACA CGGTTGCTAT GGCCACTCCG GCATCTTGAC TGCTGTTGCC 
4 6141 TTCCACGGCG ACGTTAACGT GCGTGATCTG CGGGTGAAGG GTAACGATCG CTCGAATTAA 
4 6201 CGCGGCGGCA CATGACGCTA TCTGGTACGC GGCGGTGCCG GTTAGATCTC TTAGAAAAAA 
4 6261 ATGTTCCACG CCTAATAAAA TGCATTTAAT C AC CTTGTG A TTTACCGCAA TCACCGCACC 
4 6321 GATGCCGGTG CCGGACGCGT CTGTGTTGTT GGTGTACGCC GGGTCTATGT AAACGTGGAG 
4 6381 ACACGGCTTC ATCGCGCCCT GTATATCTTG TGACGTGCTC TTTACGCGGC ACAGATCCAA 
4 6441 TTGTGACAGC GAAGAGTCGC TAACGATTTT ATGCATGCTT TG AG CGGACG TGGCAGCGTC 
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4 6501 * CCCCATTAGC TCCGTGGAGA ATGCGCCTTC CAAAAAAAGA TTAGTCGTGC TCCGTACCGT 
4 6561 CTCATCAATC GTAATATAGG CAGGAATATG CAGGCGGTAG CACGGGCACG CCACGACAGT 
4 6621 GTCTTGCAAG TTAAAATCAT CTTTATGATC CGGACATACG TAGTTCACCA CGTTAAGCAT 
4 6681 CTTTTCGTGG GCGTCCTTCA AATTAAGTAA AAAACTAGTT GATTTGTCAG ATGAGTTACT 
4 674 1 TGAGGATATA AAAATCAGTT TTGCGTCCTT TTGAAG CAT A AAACCCAGAA TCGCGGGGAG 
4 6801 GGCATCTTTT TTAATGAAGT TCGCCTCGTC GATATATAGG AGGTGGAACG TCTGTCCCCG 
4 6861 GATGCTCTAA AGGCAAAGAA AAACACAAAA CGAGTTACGC GGGTATGGAG AACGATACGC 
4 6921 CTAAGGACAA AATCTCGGAA GCTGACTTTC AACAGTGTCA GGCGTTCTTT CACCGTCCCA 
4 6 981 TTAGAGATCT AATTTCATCT GGAGCTGACG CTTTAAACCA CTTTAGCCTA TCTGAATCAG 
4 7 041 ACGGACATAA ATTGGAACGG ATTGTTCTTC TGCTTGACCT GGTGGGGACA GAATGTCTCT 
4 7101 CTTATACCAC GATCGCTGCA AAGAATGTCA AATGACGCGC GTCAACAGCC CAATATGTCG 
4 7161 ATTTCATAAC GTCTCTAACT TATACCAGTG TTTGGATTGT AAGCGCTATC ACGTATGCGA 
4 7221 CGGGGGACGC AACTGCGTGA TCGTGTACAC TCGCGAAAAT CTAGTGTGTG ATTTAACGGG 
4 7281 AAACTGCGTT TTGGATAATG TGCAGGACGT ATGTTCGTAC GGTCCTCCAG AACGCCGCGT 
4 7341 ACCCGACGCC TTCATCGATC CGCTCGTGTC ACACGGCACG AGGGAATGTC TT AAAAG CG A 
4 74 01 TATACTGAGG TACTTTGAGA CGGTCGGTGT GAAATCTGAG GCATATTCTA CCGTTGTCAA 
4 74 61 GAATGGACAA TTGAATGGCA TCATAGGTAG ATTAATAGAC GCTACGTTTA ACGAGTGCCT 
4 7521 TCCGGTAATG AGCGACGGCG AAGGTGG C AG AGACCTCGCG GCGAGCATTT ACATC CACAT 
4 7581 AATTATCTCC ATATACTCCA CTAAAACGGT AT AT G AT AAT CTTCTATTTA AATGTACGAG 
4 7 641 AAATAAAAAA TACGACCACA TTGTAAAAAC TAT C AG AG CG CAATGGATGC GCATGGTCTC 
4 7701 AACCGGCGAT CCGTCGCGGG TCAGTGCGAC GGGTTGTTTC ACGTGATACT TCCGCGAGGG 
47761 TTTATCCTCG CGAACAATAT TACGTGCGGT GAACGGCAGC GGTTTTTTGC ACACACTTGG 
47 821 TTCGCTGCAT CTGGACGCAC GTCTAAGACT TTATACGTGT GGGGACGGGT ATTTCAAAAC 
4 7881 ACCGACCCGG GCCGCGGGGA CGGTCCGTCC GGGCCGTGGT CCGGACTGGC GATTAGTCTG 
4 7 941 CCTCTGTTTA CCACAAATGG AAAATTTCAT CCGTTTGATG TAGTTATACT CAAGG C CG AT 
4 8001 ACGCCTGACT CTGGAAGCTC GTGGACCGTG AAGTTCTTGT ATATGTCATT AATTGCGGCT 
4 8 061 TACAGAAACG CAATGCGAGG TTTAAAAGAT AAAGTTTCGC AATGTACCGA TGCCGCCGTT 
4 8121 GACGGTGAGG TTCATCCTCT AACCGTCTTA AAAGAAGCGT TGGTATCACC GGACACTGCT 
48181 ACGCGACCCG TGTCCGCGTG CAACCCTCTA CAGATGTTGA CCGGACTCTT ACAGTCTAGG 
4 8241 GTACGGGACG ACTACGTGAC ACACCACCGT GCGCTCGAAC GCCCAGGTAA TGTGAGGGGA 
483 01 CAAGTAATCG CCCCGACGCG CACCGAGATG CCAAACGGAT CGCCAAGTCG TGTAAGGCTT 
4 8361 GGATTCCGCC CTCCCAAACA AGCCAACTAT CCAAAGACGT GGGCGCAGGC GCGTCACGTT 
4 8421 TTCTCGTCTC GCGCATATTA CGTGTGCGTA TATGATAACG AAGAACTAGA TACCAAGTGG 
4 8481 CAGCGGCAAG ATCCGCGGCC GTTGCCGCTA GATTGGTCCG ATCCGGTCGC GTACCTGTTA 
4 8541 GAGGGCGATT TGTTTTTAGG AGCCAAGCAG AATGCGTTTG TCGATTCTAT AG AAAAG AC G 
48601 TGCAGGTGTC AGAACTATAC CATTAAGCAA TTTTTTCCGG TTTTGATAAA TAGGGACAAC 
4 8661 GAAACAGTCG ACTTAATTAA GGAGCATTTT ATAGAGGCGT GCTTCGTGAT TAGAAACCAG 
4 8721 GTGTCAGAGA GGAGCGCTTG GGTAAAGGCG GCGCTGTTTC GCAACGATAG TAACACGTAT 
4 8781 TGGAAGGATG TTTTGGGATT ATGGGAGCAT GGGCCTCATA AGCTGGGTAC GGCTATAAAA 
4 8 841 CTACCAACAT CGGAACCTTG CAATGCCGAC GTAAACTGGA GCTGG CTCCT GTGTGACGAG 
4 8 901 GATATAACTC GGTCAATTAG CGGGCAGTCT ACTGTTTGCT TAGTTGTCTC GCCTACCCTG 
4 89 61 ACCGCCTGGC TGGTGCTCCC GGGGGGCTTT GTTATTAAAG GCCGCTACGA CCTATCAAGC 
4 9021 GAGGATTTAA TGTTTGTGGC TTCGAGATAT GGCCACCCAG CGTCGTCACA TTCTTAAATC 
4 9081 GTTTTTAAAC AAGGAATGCA TATGGTTG CG ACACCCGGGT ACGTCCGCGT TTGTTCGGGT 
4 9141 ATACACCGCG ACCACTGCGC ATTCTGCCGT TTTTGACCCG CCGGTAACTA GCGAAAATGC 
4 9201 GATGTCACTT AACTTTTTAA ATGTTATGAT CGTAATCATG AAACCAAAAG AATTTGGCCC 
4 9261 GTGCGTTACC GTGTACATGA ATGGAGATAT TCTAGATTTT TGTGCCACGG AATCTGTCGC 
4 9321 CATAAGGGAC GTGCCTGGTA GGGCGGACCT GTGTTTAATT CGTTTTGGTA CCCTTTCTAA 
4 93 81 TGCGCCGAGG AGCGTTCCGA TACCCGGGCC GTTGAACCCA CATCCGCGAG AAACCGTGCC 
4 9441 CGGGCTAACA AAACAGGAAA TTATATACAC TTCGCAAACA GTGCCAAGAG G AC AG AT AC C 
4 9501 AGATGCCATA AAGGGGAAAG AGTTCCACCA AATAAATCCG TTTTTGTGGT TTGACGGAGG 
4 9561 GGCGTTTTGG CAACTGTTCC TCTCTGTGGA TTTTATGCTG CTCTGTCCCG CACTCGACAC 
4 9621 AGTTCCGTCC CTGGCCAGAA TCGTTGGGCT TCTTACACAG TGCGATAAGA GCACGTGTAA 
4 9681 AATTTGTACG GGGGCCCACG TACACGTTAA CCCGTATCGC GGATACACGC CACCTGACTC 
4 9741 GCAAGGGACC TCACCCTCGT GCCCCTGCCT TATCTCGTGC GGGGCCAGGC GCGCGGCGGA 
4 9801 TGTCCTGGTT ACCGGACACG TTAATCTTTT GGGCCTGCTC TTTGACCCCA AAGCCTCCCC 
4 9861 CAAAGTGACC AAGCTGCGTT TAAAAAGAAA CCCACGCCCG GTACCGATAG AGGACGCCAT 
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4 9921 GTCGGGCGTC ACGGCCGAAG GGACCGAGGT GCAACCCACT TCGCTACCGT GGGCCCTCAT 

4 9981 TCGCCTGCCG GATTTAGCCA GTCGCGTGAT GCTATACGGC TGCCAGAACT TAAAAAGCAT 
50041 CTGCTTACGT TCTTATTGAA GCACGTCGCA CACATCAGCG AACTCGGCCG CCGTCCGGGG 

5 0101 CTCGCGTACA CGATGGTTCC GTTTTCCTTC TTTATGTTCA GGTTCTTCGC CGGAAACCAC 
50161 CTGGATAGCG TGGCGATGAC CTCGGAGAAA ACGGCATTGG CGACGTGTTT TTGGTGGGCC 
50221 ACGTAACCCA CGTGAACGTT TTCAACCGAC GAGAGAAGAA CGCTGATAAT GGCGACCACG 
50281 ATCCATGTTT TCCCGTGACG ACGCGGGATA ACAAAAACGC TGGCTTTCTG CTTAAACGTC 
5 0341 TGTAACACGT CGTCCGTGGT TTCAAATAAA CCAAAGTGTT GCTTAAACGT GGCAAATAAC 
5 0401 TGGTGTGTCT TTTCGGGTGC TTTGATAGAC GCGATAAAAT AAAACGTGTG CATGATTAAT 
50461 TGTTGTTGAA AGGGCTCAAG GCAACAAACC CCGGGCACGT AACTCCCATT TAAAAATGAC 
50521 GACAGGTTGG CTAAAAACTG CCGCAGTTCT GCATAAACGG GGCACTCTAG GAAAGCTTCG 
5 05 81 TGGGTCCGTT GGGCCGATTG GTATTCCATG TACTGGTCCT TGGCGTTTGC CATTTGGACC 
5 0641 CGACAACATG TTTGTTTAAG GTAGTTTGTG AGTTCCTCGG TTAAC CGGGG CAACGTAGCG 
50701 TGGCTAGAAA TTCGGGGCTT TCCCAACGTA GGGGCCGGCA TGGGTTCTTG AACTCTGCGG 
50761 TACGTTTGAA GATAATCGTC TAAAACAGAA CTATATG CAT TAACCGCGTG AATAACTCCC 
50 821 AAGCATGGGT GGGCCATTCG TTCGGTCTTG TCGTTGCCCG ATATAATAAC GGGCGGTTGC 
508 81 CGAAACCAGT TTTCGCAACC ACCGTCCGTG ACCACGCGCA AGTTATTTTG AAGACGTTCG 
50 941 CGATAGCTGG TTAACAACAT GTTCC CTTCA AGTTTTTTGA ATAACGGGCA CCCCGAAACG 
51001 GAACGCCGCT TCGTTAAAGG CGTTCAGTTA GCTTTAGACC TGTGTGACAA CACTCCGGGA 
51061 CAGTTTAAAC TAGTTGAAAC ACCTCTTAAT AGTTTTCTTC TGGTATCCAA CGTTCTGCCG 
51121 GAATCGCGCC CGGTTAGAGA CTGTCCGCAG CCGGAAGGGT TTGACTTTGA ACACATTCAC 
51181 CTCCCAAAAC TAACACGCAT GCAGCGTGTC CTGGGGCGAT ACTGCGACCA TGTTAACAAC 
51241 GACGACACGT GCGTTAACGT AAAGGCAAGT TCCTCGAATT CACAGGGTGC CTTGTTTTAT 
513 01 CTGCCGTATG GACAGGACGA ATGGAATTGG GCGCTCACGT TAAGGAAAGA CAAGTTGGTT 

513 61 AAAATGG CTG TAGAGGGCTT GTCAAATCCC ACGACCTGGA AAGGTTTAGA GCCCGTGGAT 
51421 CCTTTACCGC TCATATGGCT TCTGTTTTAC GGTTCCCGGT CGTTCTGT CG GGAACCAGAG 

514 81 TGGCTATATG AACGCAATTT TGGTATGAAG GGACCCATAC TCTTACCGCC ACATATGTAT 
51541 GCCCCCCAAA AGGACGTAAT GACTTTTGTC CATCATGTAA TTAAGTACGT TAAATTTTTA 
516 01 TACGTGAACG CCGGTGGGGG TCTTGAAACT GAACCGTCCC CGCCGTTCGA GGCCTCGCGG 

516 61 TTGCGCGCAG CCATCGCTCG TCTCGGGGAC GTGGAAGCGG ATGACGCATA CCTGTCCGCA 
51721 AAGTGCATGT TGTGTCACCT GTACAAGCAA AACGATACGA TTTCGATTCA TGAAACACAC 

517 81 GTGGGCGGAG TCATCGCATT AGGCGGAGAC GGTG CG AG AT ATATAACGTC TAGTGTTCGG 
51841 GCTCAACGGT GCACGAGTCG GGGAGATTTT GTTTTAATCC CCCTGTACAA CATTGAAGGG 
51901 CTCGTAAGCA TGATAAGGGA ACATGGCCTC GGCAGCAGCT AAAAAAATGT TAATTAAGTC 
51961 GGAGCTCGAG TCGGAAATCA ACAAAAAACT GTCCATCTCC GTATTTGACA GGTTTGGGGC 
52 021 CGACAGTGCC GTGTTTAACG CG C AGTATAA GGGAACCAGG GAATCGCTGC GGTCGTACAA 
520 81 CAGCCTAAAA AAGAAGGACG ATCTGGCGAC CGTTGTCGGA ACGCTAGAAA CGTCGCTGCG 
52141 TGAAAAACAA AG CG AATTGG GATTACTAAA GGGGTTTAAC AGGAAAAAAA TTGAAGAGTT 
522 01 TGACGCTGTG GCGGACGCGG TTCGCGACCT CAAGGACGAG CTGTACGGAG AACTGGAGAT 

522 61 TCTAGGTACG CTTGACAATG AATCTGTTCC CGTGGAAGAA GAGTCCCCCA AGGACGACAT 
52321 TATTAGGTGG AAATTGGAGC GTCTGCCCAG AGTGTGCCCC AAAAGCCCTT GATCCCATTC 

523 81 CCAAGGTTCA GACTGACGTC GACAGAACAG CATCGTCCCA TATAACCGTC ATTAAAACAC 
52441 GTAAGACGAT CGCCCAACTG AAGATAC CTA ACAACTGGGG CCAGTGTAGT CACCAGGCGA 
52 501 CGGACTGGAC CGCCGTGCTC GGACGCGGCT CGTATGGTGT GGTGAGGTCC ATGTCTCTCG 
52561 GCCGCTGCGT TAAGCATTTT GGCAGCCGGC GTGAGTTTTT TTACGAGTGC ATTTTTAACG 
52 621 ATATAGTACG CGCCTGCCGG GAGAAACATC CCCTGAACCG CGGGGGTGAC CGTATACTAT 
52681 GTTTCCTAGA GCCGTGCGTA CCATGTCGCG CCCTGATATT CCCGCAGTTA ACAGGGAATC 
52741 TGCTAAACGC GGATCTTAAA CACGTGAACC CTGAACGGCT GGCCGTTGAA TTCTCTGAGC 
52 801 TGAGGGAAGG CGTTAGTTTT CTAAATAATA TATGTGGCAT CGTTCACTGT GACATCAGTC 
52 8 61 CAGAAAATAT ACTGATAAAG GGGGAACTGA CAACTGCGTA CGGG AG ACTT ATGATCGGAG 

52 921 ATCTAGGGTC CGCCTCTTTA CACACGGGAA CCCCTTGGAC CGGAGTGATG GTGACCTCCA 
52981 AACTCGGGTT CGTGCAGCAC ACGTAC C ATT TTAAGGCACC GGCCAGATTT ATCTGTAAGC 

53 041 ACATTTACCG GCCGTCGTGT CTCCTCTACC GGTGTCTGCT GTCGTGCGCC GGGGGCCCGC 
53101 AGGCGCATAT GCTAAATCAG CCGTTCCAAA TCACTCCACA ACTCGGTCTC ACAATTGACA 
53161 TATCGTCCCT GGGTTATAGT TTGCTAGCAT GCCTAGAGAA ATATCTTCAG CCAGCTGACC 
53221 CATTTCCCCA GCAGGGAGCG TTGGCGGACG CTTCCTCCGA ATCCGCCCAC CCATTGTTCT 
532 81 ATTTGCGTTG CATGGTGCCA AGAGTAGTCA TCGCCGAGAT TTTTTCTGTT GCCTGGGACG 
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53 341 TTCCACTCGA TTTAGGCATT GACTCATCTG GCCACGCGCC AGCTATTCCC CTGAGAGAAG 
53401 CGTACAGGCG GTTTTTTGCC AACCAGTGTA GTTTATATAG GGCGCAATAC AAAGAGGATG 
53461 CGTTAGAAAA CGCATCCTCG CGGCTGTGTA ACTCAAAACT TAAACTAGTT CTCCAGAAGC 
53 521 TGTTGGTCAG GGACTACTTT AGTCATTGCG GAAACTGCGG AGATCATGGA TTTTTTCTCA 
53581 GATGAGCCGA TGGTTCAGGA GATGGCGCTT CTCGACATCG ATGAGCAGCA GCGGCTCCTC 
53641 TCGAAAATGA GCCTGGCCAA CTTTTTAAAA CATGAGCGAG TAAGGGCGTT TTTTAGCGAT 
53701 AACAAAAAGG AAAT AAG CAT GCCGGCTATA CGGTTCGTGT ATAATTTTTA TCTATTCGCC 
53761 AAGGTGGGAG ATTTTATCGG CAACACCGAC GTGTACGATT TTTACGTCAC TTGCGTGTTC 
53 821 AGGGGGAGGC GCCTGACGCG CCTGTCGGAA GTGTACGACG CGTGCCTAAA CATGCACCCG 
53 881 CACGATCGAC ACCACGTGTG TGCATTGATA GAACAGGTCA CGCGCGGCCA AAACATCAAT 

53 941 CCTCTGTGGG ACGCTCTGAG GGACGGCATA ATTTCGTCGT CAAAATTTCA CTGGGCCATA 

54 0 01 AAACAACAGA ATTCGTCCAA AAAAATTTTT AACCCGTGGC CTATAGTCAA CAATCACTTT 
54 061 GTAGCGGGCC CGCTCGCGTT TGGACTGCGT TGCGAGGAAG TGGTTAAAAA GATACTGGCG 
54121 ACGTTGCTGC ATCCAGG CGA GGCGCACTGT GAAAACTACG GATTCATGCA GAGTCCTCTC 
54181 AACGGGGTTT TTGGCGTCTC CTTGGATTTT GGAATTAACG TCAGGTCTGA CCCAAAAGAC 
54241 GGTTTGGAGT TTCACCCAGA CTGCAAAATC TATGAAATAA AATGCCGGTT TAAGTACACT 
54 301 TTTTCCAAGA TGGAGTGTGA CCCGATTTAC GCTGCGTATG CTAAACTTTA TCAGAAGCCC 
54361 AGCATGCAGA CGCTTAAGGG GTTTTTGTAC TCCATATCTA AACCGGCGAT CGAGTTTGTC 
54421 GGAGAGGACA GGCTCCCCAG TGAATCGGAC TATCTTGTGG CATATGACAA AGAATGGGAG 
544 81 GTGTGTCCGC GGAAAAAGAG ACG CTTAACT GCAGTACACC ATCTAGTTAA AAAGTGCATG 
54 541 ATTCACAACT CTACGGCGCC TTCTGATGTG TATATATTGT CAGATCCGCA GGAAACCGGA 
54601 GGCCAAATTA ATATTAAAGC TCATCTGAGC GCCAACCTAT TTATAAACGT CAGGCATCCG 
54661 TATTATTATC AAGTGTTGCT CCAGTCTCTC GTTGTACAGG AGTACATCAG TCTCTCCAAG 
54 721 GGAACTAAAA ATTTGGGAAC CCAGAAAAAC TTTATAGCGA CTGG CTTTTT TAGAAAACGG 
54 781 CAGTTTCAAG ACCCGAGCTG CTGCACGATC GGTGAATTTG CCCCGTTGGA TCCACACGTA 
54 841 GAG ATAC CGA CCCTTTTAAT CGTGACACCG GTGTATTTTC CCAGCGTGGC CAAACACCAA 
54 901 CTGGTGAAGC AGGCGACCGA ATTCTGGGCG GCTAGTGCTC GTGAGGCATT TCCCGAGCTA 

54 961 CCATGGGATT TATCCTCTCT GTGTGCAAAC GCCCCACCAA CACCGTAGAT GTGAAGGGGG 
55021 AG C C CAT AG A TGTATCCAAA GAATTCGATC CTATTATAGG AG AAGAAAG C ATTGTCTTGT 

55 081 TAACGG C AG A TGGGACTGCC CCCGCGGCGC TGTACAAACC CAAAACCAAG CCATCCAAAC 
55141 ATAAAAACAA TAAATTGTCA GATTTTGTTT AAGCATTCGT ATCTTTATTG GAGTGGGGTG 
55201 GATGGTGTGG GGTTGGGAAG GGAATGGGAT TGAGGGGAGG ATGAAATGCT AGAATCATAT 
55261 GTATTTTGGA TATGCATCCT CGTAATCGCT CACGTCTTCT TCTGTTTCAT ATAACGCGTT 
55321 CAGTTCCGGG GAGTACTTAA CTTTTGTCGT CTTCTCTTTA GGGCCTTGAG GAAGGGGTGC 
553 81 GTATTCGGGC GACTTTGCGC ACCATTTCCA ATCGTTTCTG ATAATTCTTA AGATGATGGC 
55441 TAGGATGCAA GTGATCGGAA TCATAGAGAT GTTCACGGCT ATGACTTTGT GCAGGCTGGC 
555 01 AGATATGAAT ACCTTGTCGT ATCGAACGAC GGGAAGAAGC AGAATCAGGT AACTGACAAA 
55561 AACGC CAACG TAGAATCCGA TCTGGTGCTT TAGGTACTTT ACTAGAAATA GTTCTGTGTT 
55621 GATATACCAA GCTATTGTAA GGACAAAAAA CAAGTTTATT GAACCAAAGA CAATATCGGA 
55681 AACGAGCATA TAG AAACTG T TTCCAATAGC CATCATCGTT CCGAGGGAGA ATACTAACAT 
55741 CTCCATGGCT ATTAAGGACA GATAAAGATT GGCGCCAATC GGTTTTCCGT AACGC AACAG 
55801 GGTGTCTAGC AAGCTATTCT CAGGAATCTG CTGCTCTAAG ACG CGT AG AG ACCAGGTGGC 
55861 CGAATTGCAC GATATCATAG CCTGGACGTG CGTAAACGAT AGGCAAAAAT GTATACAGTA 
55921 TACAAACGAG GCTAAAAGTA TATGTTTGTA CGACAGAACG TGAATAAATA ACTGGATGGT 
55981 CCACAGCCTC AAAATAGACA TAAAAACTAC ACTGTGGGAG CCTATGAGCA CGATCCATGT 
5 6041 CTGGAGGCTG GTCATTGTTG TCGCGTGCAC GCGCTTGGCC TTGAATATGG CCAGGATGGC 
5 6101 CCACGCATAG TAACACAGCA GATATCCGTC CACCAAAAAC GCCCACGAGA TGTAGACAAA 
56161 CATTTCTGGT GCTTCCAAAA ACAAAGTCGG CGTCAGATGT TTAGCAGAAC TTCTGACCGT 
56221 TAGGTTTAAC GCGCTGTAGT TGACCAACGT GTTAAAGTAG CACGGAAATC CAAGTCCCGG 
5 6281 GAATGTGGCG GTCAGTGGCA CTACCGCTGA CATTATAAAC ATAAGTCCAA GAATAACCAG 
56341 TAGTTTTACC CATGACGATA AGATAAACGA GTCGCTCCGT GAAATTTTCA TAGTTCAAAG 
56401 AATCTGCTCG CGGGTG CGTT CGG CAAACT A GCAGTAAAAT CAGCAAGTGG CTTACATGTT 
56461 ACGGATTTAA ATAGG CTCCG CTTTTAAATA TCACATTCTG TGCAAAGGAG TTGAGCCACG 
56521 GCACCATGAA CGCCCGGGAG GTGGCACTCA CGGGACATGT TTTG CACAT A TCGCTGCATA 
56581 GCACGCACGA G CGCG AG AAA TTAATAATCT GGCAGGTTCA TTTACTTGTA TGTCAACAAT 
56641 GCGGAATTCA GGGAGATGCC GCATATCTAT TTGTCACCGA AACATTAAGC AATACTGACT 
5 6701 GGGGAAATAT AC CG G C GAT A AACCGTCACG CACCGTCCAT AAATGAGCAC GGCCGTAATT 
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5 6761 ATATGCAGTG GGAACTCCGT ACTCGTTTAC GGAATCCCAT CATTCAATTG TTAAGTCGCC 
56821 AGCCCGGTGC GGTTAACGTA AGGGTCAGCG AG CCG AATAT GGTAATAGTG GGCTGCGAAC 

56 881 GAGCGTTGGA TCACTCGTGT TCGGTGCGCG TGACTGGAGC CTATCTTCAT TGCGATACCA 
5 6 941 CTATGGACTT TAGTTTGGAT TCTGTTGTGT CCCCAACCCG CGAATTTTGG TTCTCAGAGA 
570 01 TGTTTTCTCA CTGTTTAGTT TCCAACATTG AAGTTTACCT TAAAACAACG GGCGGGTTAT 
5 7 061 ACTATAGGGC ATCGAGTGCC ACGCAATGCC GAAAAAGGGC GAAAGATGGC GCATTGGGTA 
57121 TTCTTGATAT CTTTAATTGC GAATCTCGTG AAATACAAGT TGCCGGGCAG AAGTACACCT 
57181 TGAGTATCGC CACCGCAACA TTTCACGTTC TCTGGGTGGA CGAGGCGTGT ATGTGGAACG 
57241 GGGCCCTGGC CGAATTTTTT AGGGCGCTGC ACAATAAGTT GTTCGGCGAC CGGGAAGGCG 
573 01 TAGCGCCAAC GTTAACGTAC GTGTGTCCGG GGGCCACTCC GGAGGGAACC CCCTTCCCCC 
5 7361 CCTACTTTTC CGCGTTTCCA CACCTCCCGC TCGTGTTTGG AAGACCGCGA AGGCTCGACG 
57421 TAACCGCGGT CCAAGAACTC CCAAAAGCAC AAATTG CGGT ACACTGGCCC CCGTTTAAAG 
57481 ATTCAATCTT AGGGGATCAG CTTCTCATAC CTGGCATTTC AC CT AAAAAG CCAGGTACCG 
57541 TACCCGTTCG TTGGCCGCTT TGGGTGGAGG ATGTTAACTT GAGTCTCTGC GAGACGACAG 
57601 AAAGCGTTGC CCGCATAGTC GACCCACATT CTATAGTAAT CATAAAATTT TCATCACTGT 
57661 TGTGCCAGCA CCTAAAATGC CACCGTGCGT TTGTTAAAAA TGAGTTAGAA TACATAGCAA 
57721 C CAT CTG TTC CAGCGACCTT CGCCTCTTCA TCCAAGAGGA ATACAACCGG TTACTTG C C A 
57781 CCATTTTTAC GTGGGCCGCG GCGAGCGGGT AT AC CTGGG C GGCCATTGAT AAAACAACAG 

57 841 TATTCATCAA GGCTCCCCAG CTCAG CGCAG CTGTAAGTGG TTTCTG CCCA TCACTAAATA 
57 901 GTTGCCGTAG GAAACAATGT TACGAAGGTT AAAAATAACA GTTCATTTCC TTTCACAGGA 

57 961 AC AG C AAAAG GTCGTGACCC GTCTTGAGGC GCATTTGGGA CTTCCCGTAC AGGAAACTTC 
58021 CCACCCGCCT GACTGGCTCA AGTGTGAGGT CTGCTCCGCG TCCGTGTTTT T AAAAAT AC C 

58 081 AG CCGGGGTT TTGTATGCCG GACTCGCAAG AGACCCCACC AGGGAAGCAA AACGGGACTC 
58141 GTGG CTGGAC TGTCTAGTAG AAGGCGCGAC GTTGTTG CTT AACAACTCAG TGTTACCGAT 
582 01 TGGGGCGCTG GCGGGTATCT TACCCACCCT TTTTGCCAAC AGG CGGTGTG TTAATTTTTG 
5 82 61 GCTGCTGCCA CGCGCGTGGG TAAAATCGGC GCCCATATGC CCTCCCCTAC CGATTGACTG 
5 8321 TGTTACGCCT CCACAGTTTG TCGTGACAAA GCGTGGACCA ATCTGCTGGT ACAAGGAATG 
58381 GCCGTTACCG GTTGACGTTG ATTTTATGTA CTACCTACAG GAGGCACTAT GTGTTTTTAG 
5 8441 TGTTGTGTCC AACGGGGAGG GTACGGAGAG TCACGCGGAC AATATACGAC AATTAGAGAA 
58501 GTTTGAAAAG GTACTATGTT TATTTTAAAA CAGAGTCTGT CTGGGGATGC TGTGTAGGCT 

585 61 GGTTATTTGA AAATATTTTA ATACCGATGC GTTATGAAGG TGTCTGGTAA AAATTGTGCC 
5 8621 GCAATAAAAT ATGTTAAAAG TCTCCGCGTG GCCCTCTTCT TCTCCAACGT TCTCTGTTAG 

586 81 ACGAAAGGAT TCCAAATAAA TGGGTTTCAG TAAGCCTTTC ATGTCGTCTG CGTTTTCGTC 
58741 CAACAGCTTA TGGAGTTTGA AC CGCACTAG GGGCGGAACG CGCAAAATGC CTTGGGGTGG 
58 8 01 GTTTAACTCG TGGTATGCCG CCAAGGTCGA CGCTATCACT TCGATCCCAT ACAATACAAA 
58 8 61 CGCGGAATCA TTACACGTAT ACCT CTCGTG TAGGTACCTG GCCCGGTCAA GCGCGGACAG 
5 8921 GATAAATGAT AGGGGGGGCT TATCGTAGTT TTC CAGCATA GAAAGAATGC AGCAGAGTTC 
58981 TGTTACGCTC GCGCTCGCCG GCTTAAACAG CTGCACGCGG TCAGAAAAAA CGTGCGGATA 
59041 TATGTCCAGT GCGTCGATCA TGTCCTCGTG TTCGACCACG CACGCAAAGA CAAACCCAGT 
59101 AAACGTTGCC GACTGAAGCC TGGTGCGCAC GTGCACGCCT AAATCAGGAA CGTTTAGTCG 
59161 TCCTCGAACA C AC AG CTT AG GCGTATTGGA AGCCATGCAA ATGCTATTTT TTTTATGTAC 
5 9221 TTCCAACATC AGTCGCGGAA AAACTGGATA ATTATAGGGC CCTCTGGACG AGCATTCTGT 
5 9281 TTCAGGCATT ACCGCCTCAG ATCGGTGCTG CTCCCCCATT AGACG CTTCA GTATTTGGTC 
59341 CATCGGTTCC AGGCTGGCTA ATTTGTTCGC AATATTTGTG GCCTATGTCA TTTATATACA 
5 9401 C AG AC AG C CT GCTGTCCACA TACAGCTGGT CTGCAATATC GTGTAAACTT AAGGAAATGA 
5 9461 GAGAGTGATT TATGAGGTCC ATGTGAGTCA GTTTGG CG AT TATGTACGTA AACGGGCCTA 
59521 AGAGAATGGA AACCGTGTCC TGCGAGTAGG CGATGCTCGT TCGCTGCCCC TGATTGTCGA 
59581 CCACGGGTGT GAGGTTGTAC GTTTGAAACA TTTCCCCTTC CCACAGGCTA GTTAGCTCTT 
59641 TGAGCATCTC CATAAACGGA GGG ACGTATT GTGAAAAAAA GCTGTTTACT ACGGAGCTGC 
59701 CTTTAGGGAT GTGACATTCG GATAGGTTAA TGGTTGGGTC GCGCAGACCG G ATAAAGC CT 
59761 GCAACGTATT CTGAGTGAGT ATATTAATAT CGTGTCTGGG CCTGCCTCCC TCTGGTAAAC 
5 9821 TCTGAAGCAA CTGCGTTTCA ATGCATTGAA TCTTCTTTAC GTAGTCGTCT CGTTCTTTCT 
59881 CAAGGTTAGA AATTGTATCG AACTGCTCGT TGATCTGGTT CGTCAGGCAT TTGAACACGG 
59941 TATTTGAAAC CTGTTTCCGG AGACCCTGGG TCGTACTTTC CGTACCGGTC CCAAAGAGCT 
60 001 TGTTTTTGTC CACAATG TTT TGAGAGACGT CTGTGACGAA ATCCTCAACA ACGTCTGTTA 
60061 TGCCGCTCAC TGTTTTGTTC TCCGCGAGCT TTATCAGCAA GTTTAGGAGC TCCCTCTTGG 
60121 GGTCGGTTTT TTGAGACGAT TGCTCTAAGC GAGCCAATAT GTCTTTATAC ATGTCGTTGC 
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60181 ACGTGCTACC GACCAGCGCT TTTAGAGGCG CGGTATTTAG TAACTGACAT AACTTTGCGT 
60241 GCTCCGCGGT CCTGTGACAT GTCATAATCG CGGTAAATAA CCGGTACAGA GGG CTATCAA 
60301 AGGCGACGCG TCCATTGTAA ATTATAGGCC GTGAAACAAC TTCAACTGTT ATATCTTTTT 
60361 GTCTGTAGGT GCCGAGATAA GCAG CCTTAT CTC CGTTAAA CTCCACAGAC ACCTCGGCGT 
6 0421 AGTCTGGAAT ATAAATAGAC GTCACATCGG TCATTATCGA GGTAATCTCC TGTGCCAGTT 
60481 TGTCGACAAT CCGCGATATC GTAACGTCTC CGGAAGCGTA CGTTTTCCTC ATCTGTTGAC 
60541 AGTGTGCTTG TATCCTGGCG ATCGTGTCGC CGTCGGGGGT GCGCCTGACA ATAGGCACCA 
60601 ACb&AAGGGC GACCACCCAG TCGACGTATC GTTCGTACGA TAAAACAGTC CTGGTGTGTA 
60661 GTACCTGATT AATAGTGTTC AGTAACAGTT GATCGACTGT TAGACGAACG GTACGTGCCC 
60721 ACGATTCAAA CGTAGACCGT TTGAATTCCG GCGTCAGGTG CTCGACACCG CGCCAACGGG 
60781 TCTTCAGGGT GGTCTCGAAG TTGGTCCAGT CCTTTAAGAC GTTGTCGTAT GTGCAGCTCC 
60841 CCAATGCCGT CTTGTACAGG TGTATGAACA GCTGTCTTCC AAAGATCCCT GGATTTCGAA 
60901 GACTGGAGTG TAGCGTTTGA CCGCGAACGT ACGCGTACTT GCCCTGTAGG ATCTCGAAAA 
60961 GCGAGATGGA GAGTTCCGTC GGATGGACCA AACACGTCGA ACCAAATCCG GGGTTCATCT 
61021 TGAACATGAC GTCCGATGCC AAAGTCAGGG GTGTCGTGGA TCACGTCAGT CGCCTGTCAA 
610 81 ATATAACTAC CAGCCCACCG GAAATGGGTT GGTACGACCT GGCCTTCGAT CCGGCTGAAG 
61141 ACTCCGGGCC GTTCTTGCCG TTTACCGTTT ATCTAATTAC GGGAACTGCT GGTGCTGGGA 
612 01 AAAGTACCAG CATATCGGCC CTGTACCAAA ATTTAAACTG CCTGATCACG GGCGCGACCA 

612 61 CCATAGCCGC ACAGAACCTA TCGCGTCGCC TAAAGACGTT CTGTCCCACG ATCTTCAGCG 

613 21 CTTTTGG CTT TAAGAGCCGA CACATCAATA TAGCCGTCAG AAAAG CTCAT CAGACCGGAG 

613 81 CCGTATCCAT AG AG C AAATT CAGCAACAGG AGCTATCGAA GTATTGGCCG GTTATAGTGG 

614 41 ACATTATGAA AGAGGTTATG GCGAAAAAAC CCAATGGCAT GTACGGGACT ATATCCAACG 

615 01 CGAATTTTGA AACCCTCTCG AGAATGACCG GACCGTGTTT ATGGACTTCC AATATTATTG 
615 61 TAATCGACGA GGCCGGAACC CTGTCCTCTT ACATACTCAC CACCGTCGTG TTCTTTTACT 
61621 GGTTCCTAAA CAGCTGGCTA AATACCCCTC TTTACCGCCA GGGGGCGGTT CCGTGCATAG 
61681 TATGCGTCGG TTCGCCAACG CAGACAAACG CGTTCCAGTC AACTTACAAC CACGGGACGC 
61741 AAAAGACGGA GATATCGTCG TGCGAGAACA TCCTAACATT CATGATAGGG AAGAAGGTCG 
61801 TATCTGAGTA CGTACACTTG GAGAGAAACT GGGCGCTGTT TATAAACAAC AAGCGCTGCA 
61861 CCGATCTGCA GTTCGGACAC CTTCTAAAAA TTTTAGAGTA TAATCTTCCC ATTCCTGACG 
61921 AAGTCATGAG T TAGGT AG AC AGATTTGTCG TTCCTAAAAG TAAGATTATG GATCCTTTAG 
619 81 AATACATTGG CTGGACCCGA CTCTTTTTGT CACATAGCGA GGTAAAGGCG TATCTAACAA 
62 041 ACCTACACAC ATGTCTAACG CTAGGGGGCG ATACCAGGGA CACGAAGCTC TTTACCTGTC 
62101 CCGTGGTGTG CGAGGTGTTT GTGAAGCCGT TTGAGGAATA CAAGCGGGCC GTCAACCTCA 
62161 CCAACCTCAC CGTGACCGAG TGGGTGACAA AAAACCTCTT TAAGTTAAGT AATTATTCGC 
62221 AGTTTGTGGA CCAGGACATG TCCATAGTCG CCACGGAATC CACTGAACGT TCGACCCAGG 
622 81 TTACCTTTAT CACCAAGTTT GTTAAAAACA GCCACGTGTC TTTAAATGGA AAAACAAAAA 
62 341 AATGCATATG TGGGTTTCAG GGTACGTACT TTGAGTTCAA AAGAATCCTA GACAGCGAAC 
624 01 TCTTCGTGGA GACGCATTCG CAAGACCGTC CGGAATACGT ATACGGTTTC TTAAACACGC 
624 61 TACTGTACAA CGCCATGTAC TCGTTTCACG CGTACGGCGT GACTAGGTCA CACGAGAAAT 
62521 ATCTG C AAGA CCTAAAATTT GCACCCCTCC CGGCCGCTCT GGCAACCGGG CGTGTAGA CC 
62 581 TTCAAACGGT TCGTGAAGAG TTAAACCTGG AAGACGACAT CTTCTACCAC GTGTGCAGTC 
62 641 CCCCGCCACC CGCGGGTATC ACCTCCCTCC AGGTTTTGGT CGACACGTAC TGCGCCCTAA 
62701 AGGACGTGTT CGCCTCCAGA ATAAAGGTGG CGTGTCGCTG GTTTGGCGGG GAGTTTGAGA 
627 61 AGGAAACGTT TTCCGCGTTT ACGGTTAACA TGGTCGTAAG GGACGGAGTT GACTTTGTCT 
62 821 CCCCTTCAGA ACGTCTCAAC GGGCTGTTGG CGTTTGCATC GACCGTTGAA TCGTATAAAA 
62 881 TTAAGGGGTA CACGTTTTTA CCGGTAGCGT TCGGTCGCTG TCAGGGTTTG CCACTCAGTG 

62 941 ATGACCTCAG GAAGAAGATG CCCTCCCTGG TCGTGCAGGA CTCTAGCGGT TTTATCGCGT 

63 001 GCCTAGAGAA TAACATAACC AAATTGACCG AAACCATGGA GGACGGGAGC GTTTTCCAAG 
63 061 TGTGCTGTGC GGGGGACTAT GGGGTCAGCT CAAATTTAG C CATGACCATC GTAAAGGCAC 
63121 AGGGAATGTC GTTGGAGCGA GTAG CCGTAG TATTTGGGTC CCACAAGAAC GTCCAGACAA 
63181 GCCACGTGTA TGTAGCAATA TCAAGGGCTG TTAACTCAAA TTATTTGGTC ATGGACAGCA 
63 241 ACCCCCTTAA AACCCTCCTC AGAGAACCAG TCGATAACAC CTCCGCCAAG CATATAGTCC 
63301 GGGCCCTCCA CAACCCAAAC ACAACCCTCA TCTACTAAAT AAATAAAAAC AAGCAAATGG 
63361 AAAACATTGT GTTTTTATTC AGTCCAACCA CGGCCACGGA TAGTTGTCAT TTCCACACAC 
634 21 CGGGGGGGTG GCCGCCATAG TTTGACGACC AGAAACGCCC GGGCCCTCTG GGTTAGAAGT 
634 81 AGCCCCGCCC CCCGTGGTCG AGGTCATACG TCTCTTAGGG CCCTGGGTTG TACCAACATA 
63 541 AATCACGTCG TCATCACCGT TGCAGTCCCT GCGTGACGCG CGCCTCTTGC AGG CGGCACG 
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636 01 AACACCGCCG CTTGTAACGT GAT CTG CATC GTCGTCGCTG TCTGAAGATG AACACAGGTC 
636 61 AATCACATCA GTGGCGCCCG TGCCCTCATC GGCGTCTGAT TCGTCCCATG TCTCAACAGA 
63 721 GCCTCGGTCG TCCACTTCGT CGTCATCTGA AATAATAATG GTCTCTTGTC CTGCAGTTGA 
63781 TCCAGCAACA ACAGGTTCTT CTTCATCGGA AGACCATCCG CCCTCTGCCG GTGATCTAGA 
63 841 TATATCACCT GACCAGGCGG GCGAGGCCCG TGGCGAAAAG CCGTCAGCTT CTCCGCCACT 

63 901 TTCATCAGTG TCCTCGTAGT CAT CTATAAA AAAATCGTCT TCGCTGTCTG ATTCAGATGA 
63961 CTGGGGTGCG CGGTGTTGGC CAGTAACCAC CGCAACGGGC CGCGGACCAC CAAACGCGGT 

64 021 CAGTGGCGCG GGCGCGCTCG GTGGTGTGGT GGCACCTGTC GAAGTAATGA CGGTAGGTAG 
64 081 CCGGCCGCTC TGGTTAAAGG ACTCACCGTC CCGTGCGTGC GGATCTATTC TAGGTGGGGA 
64141 GGAGACCGGC GGTTCGTCTT CTTCATCCAC ATCACTGTAG TCGCTGTTCT CCGAATGAGA 

642 01 GTCGGTTTCG TCTGGCGTTT TTGGAGTAGT GGGAGGGGAA TCTTGGTCAA AAACATCCTC 
64 2 61 AGGTGCAGTC ATGTCAAT C A CTGGGGGGCC TCCGTACCCG GGCGGTACCG GCAACCCCGT 
64321 ACGGGACCCG GCGAACAAAA AACGTCTTGT TCTCTTTTTC CTAGGTGCCC CGGGAATCGG 

643 81 CAGCATCCTG GGAGTGGCCG GCGGGGTCCT TGACGGGTCG GATAAGAACA TAGCCATGGC 
64441 CGAAACGTTC ACCTGTAAAA CGCACTGCCT ATCCCCGATC AGTTAATATT CCAGTCAACG 
64 5 01 GCACCCCTCC GATTCTGAAC TAGATAGTCA TTCGCAAGTT TAAAATGGTT GCAG CCTAGA 
64 561 AACTGCGGCC AGGGAGAACC GGTTGCCGCG TGGGCGGTAG CCAGGGGAGA CGGATGGCGC 
64 621 GCCTTGAGGA CGAGATGTCT CTGTCCGTTT ATCAGCGTCG CCTTGTCTAT AGCCTTTCTT 
64 6 81 CCCCACAGCA TAAAAACGCA CCTCTGAAGC TTG G C AG AAA GGCAGCTTAT TATGTAGCTG 
64 741 GTGAACCAAT CCCAGCCAAG GTTGGCGTGT GACCCCGCCT TCCCTCTTTC CACCGTCAGA 
64 8 01 ATGGTGTTTA AAAGCAGAAC TCCCCGTTTT GCCCAACAGT CTAAGCACCC GTGAGAAGGA 
64 861 GCGGTGAACC CAGGTACAGT ATTGGCTATC TCTTTAAAAA TATTTTTGAG GCTTGGAGGT 
64 921 ATACTGTAGT CCGGAGCAAC GCTAAAGGCT AGTCCGGTGG CTTGACCGCG ATGGTAAGGA 

64 981 TCCTGGCCAA GGATCACGAC TTTAATATCC TCCGGTTCGC AACAATACGA CCACCACATA 
65041 ATTCTATCTA TCGGTGGATA AATCACGGTT ACGTTGCTCA TATCCATAAC GCGCTTCAGC 
65101 AGAGCCGCGA GCTTCTGTTT TAAAAATGGA GAAAGATTTA AAAATTCAAG CCATGAGTCG 
65161 CTAAGTAGCA ACGTTTGTGT GGAGGGTTCT TCTAACACTT CTGGTGACAT TTTACTCCAA 
65221 ACTATTGTTT TAAGCCAACC CTCCATAAAT CCCCGCGTCT GGTAATTACT TTACGTATAC 
652 81 GTGAAACTTT TGTACTTCTG TCTGAGAAAT CCAACAGAGG TGGCCGCGTA TAAGCAGTTC 
65341 CG CTTTC ACT AC CGTTTATG TGAAAGTCTT CAAACTCGGC TAAAAACTCA TCAAGACTAT 
65401 ATTG CAT AT A ATTCAACAAT TGAAGCTCTT CTCTGGGGAC ACTGTAATTT AACTTGTACA 
654 61 AAATCCTTAT AAAAAACGCC CTAAGGTGAA ATCCATTTAC ACATATTTCT GTGGTAGAGC 
65521 GATCAGCTTT ATATCGCAAT GTTGCTGTAC ACAGCCCTGG ACATAAAGCT AAATTTGTTA 
65581 AATGAAAAGA AGCAATCTCA AACGGCTCCG GAAATATAAA GTCACTAAGT GGTATTATTT 
65641 TACAACATGG ATTTACTACA AGCCCGGTAC AACATGTAAT TAAAAAAAAT CCACATGTAA 
65701 TAAATAAACT TAAGGTATAC ATACTTCTCA TGTTTTATAC ATAATATGAA CGCTAATGCT 
657 61 GTACTTATAT AATATGTATA GAATTATGCC AATACAGTCA TGGTAAGTAT ATTTTAAGGC 

65 821 ATAATGCAAA CATCAAATAA TATAACACAA AATGCACGCT TCCGGATATC CGCCCAGCTG 

65 8 81 TTAACCCGAA AATACAGAAA TGACTACACA AACACACCTG AAACCAATTT TATTCTCAAC 
65941 ATATGCAACA ATATTTCAGG GT AAC AC CAT GTTAATAAAA TACGCAGCAT GCACATTTTT 
66001 AGCTAAGCTC CTCAAAGACA ATTTCCTCTT CAATTGATGA CTCATCAACC TCGGTTTCCG 
660 61 TGTTAATGTC GGAAATATAT GAGTCCAGGA TACCCTCGTC ATCGCCACAA ATTAATTCCA 
66121 GGGCTTGTAA AATATCATCC AGTGAATCAG CTGCTAATGT CAAGGATGTT GTTTCTAATT 
66181 TTAGCCCATC CAAATGGTGA GTGGGTGACA CTTTGGGACA AGACCCCGTA AAATTGGCAC 

662 41 GCTCCACACA AACGGAAGGA TGCATTTGAT CGTTCACGAG GGGCAGCATA TTTATTTTTA 

663 01 TAACATCAGG AACCGGACCC GTCAATGACG CACATATGTC CATTAAGATT GGAGTCGTGA 

663 61 CACTGGTTGA ATTGACGGTG AAAAAAAATG TATTCGTGTC ACATGCGTAA CTATTTTTTA 

664 21 ACCACACGAG GGCGGAATAA AACGGGTCCC CGGCGCTAAG GCCGTGAATG CGGCAGGCAC 
66481 GG CTCATTGC CATTTCCAGA TTTGTTAAAT CTGTTCGAAC ATACGAGTCT AAAATCAAGT 
66541 ATAACCACGA CAAGTTCAAA CAGGAGGCAA CGCGGCTGGT TTCCACCCGG ACATCCCCGC 
66601 TCACAAACCC GAACTCTCTA GGAACGCACA TCAAAGCGTT CAGACAAAAT TCCGAAACCG 
66661 AAACCCTAGT TCTG AG CACC ATCTGGTTAT AAACTGCGGT TAAGAGACGA GCGGCCACAC 
66721 ACTG CTGCTT GTGAGGATTC AGCTTAGGTG GCCTGCAGGC TTG CTG AC AG GCCCGTAGCT 
667 81 GCCTGGCGGC GCTCGCGCAC CTTTG AC AG C CGCACGCCAG TTCCAGAGCG AGGCAGAAGC 

66 841 GCTCTTTGCA GTCGCGCCAT ATCTCGGCCA TAGGAGATTC GCTCGCGTGA TCTTTTAGGT 
66901 G CAT AT AT AT CTGTTGGGTA ACAAACCTCA CTCCCTGTAA AAGGGGAATA AGGT CCGTTC 
66 961 TTTCTATCGT TTTCTCTGAA TCTATAGTTA CATCTTTAAG TAT AAC C AAA GACGCTAAAA 
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67 021 ACCCAGGATG GTCGACACCT TTAAAAAATC 
670 81 TTCCATGTCG TTCAAAGGTA GTCACGATGG 
67141 CACCCTTAAC TGGAATGGAA ACAGCCATGA 
6 7201 TAACCCCGAA ACAGAGCTAA ATACCAATGA 
67261 TACTATTAGA CCAGGGTGAG TAACCACGCT 
67321 AAGGTAAACA GCTCGTGTAT ATTTCAGACG 
6 73 81 AGGTAATTAA ACTTTTTTTA TTCAAGATTC 
6 7441 GAG CATATCT CGCCAATGGG AATGGCTGAT 
67501 TCTGTGTACT TTACTAGTGC CGATGAACCT 
67561 AAAAACTGCC TGGCCTCGCC TTTTAGAAGC 
67621 TTGTCTTTAA AAATGCAACT CTGGTACATC 
67681 TTTAAAAACA CGTTAAACTG CGTGTCCTGG 
67741 AAAAATTTTT TAACCAAACT CGCCAGGTAC 
67801 GTGTCTATCA TCTCCTTCAC CTCCCCTTTG 
67861 AGGCACCAGC TCAGGGGGGA GGTCGGATCG 
67921 TACTC CAGTT TTAGGATCTC CGGCAGGACG 

67 981 CCCGCATCCA CCAGGGGCAG TACTGAGTTA 
6 8 041 TTGTGGAGCT CCTTTAGACA GTAAACCATG 
6 8101 TCTCCTTGTC TGTGTTGCAT GAG C ACCTGG 
6 8161 ACTCTAAGGG TTGCCGCTTG G TAG AGCGG A 
6 8221 TCACCGACTA TCATGCCGAG TGGCGGATAG 
68281 TCTTTCCCGT AATGGCGACT CATGGTCGTG 
68341 AGAAGGTCCT GCGTGGACAA CTTTTGCGGA 
6 8401 CACGCGGTGC TTCTTATAAA CGAAGCATAC 
68461 AAGCTCACAG AAGAAGTCTG TCGATGTATT 
68521 TCCGGGCTCA TCGCGGATCT GAATCTCTTT 
685 81 GTTAAAACCC GCGGGGCCGC CACCTGTAAC 

68 641 GTGAGAATTC TCACCGAGAG GGCCCTTTGT 
68 701 TCCAGCGGGG TCGTTATACC GCCC CAGTTG 
68761 ATGAAGGCCA AGTGTTTGGG GGCGTGGCGT 
6 8 8 2 1 x GCTATTGCCG ACTCGGTCCT CGTCACTTAT 
688 81 CTTAGACTCA AAGCACTGTG CAAACTTGTT 
68941 GCCCCTATGC AGCTACTGGC CAACGGGAAG 
69001 GGTACGGCCG AGAAGAGGCG CCCTGTCGTG 
69061 GGAAGCAGCC TTGTCCCCGA TTCCTTAATC 
69121 GCACCCCTGG TTGACCTCAG TAATGTCTTA 
69181 CCCCTGAGTC AATTTGTCAT CACAAACACG 
69241 ATTACGCCAA CCCAGGGCGT TGTTCCCGGC 
69301 TCACAGAGCA TTCACCACCA GGCACAGACG 
6 9361 ACATCTGCGG CCCCTGGCCC GCATATCGCA 
69421 GTGGAAGGGT CCGCGGGCGT CGATTGCGGG 
69481 CCGCCCAATG TCCCCGCCAA GAGGATGGAA 
69541 AAAAACGTCC ACATCGGAGG CCG CGTATAC 
69601 TTAACGTCCC CGTCAGGGTC CGGCCAGAGT 
69661 CGCATGGCGG CCGGCTCTCC GCCCTTCGCC 
69721 AAGCAGCCAC GCGAGGACGT GGCAGACAAA 
69781 ACAGTAAACC ACCCTTTCCC GGGACCGTCC 
6 9841 GATTTAATCG AAAGCTC CAC GGATGTAACC 

6 9901 GACATGCTAG CGGCTATCCT ACAGGACCTG 
69961 TCCCCCTCCA GCAACTCGGA CAATGAGGAG 

7 0021 GGCCACGGAT CGCCTTGAAA GATGACCCCA 
7 0 081 CACGCAACCC GGCGGATGTC TAACCGTAAA 
7 0141 GACAAAAATG AGCTTCAAAC TTACCCATTT 
7 0201 TCGCCAGTGG CGACAATTTT TAAGTCTGTG 
70261 TAGAATTTTC ACATCCAGTT TCATATCATA 
70321 GGCGTGTTTT TATCTTGAGA GGCCCCCTAA 
7 0381 GAGACCCCAA GTGCTTGAAT TTATGCCATT 



GCAAAAGCGA CCGGATGGCT CTATCCGCGT 
ATCTCCAGTT AG ACTCTG TC TCCCTGTTAA 
CACCGTGAAC TTCCTGATGT CTCCTAAAAC 
CTGTCACCCC TACCCCAAGC CACGCCCCCG 
ATCCTTTAAA AACCCATACG TGGAGTTTGT 
CCTGTCATAA AATGGATACC GACGACAATC 
AGGAGGGCGT GTTTACAGTG GTGTAGGTGG 
GAATCCACAC TTAGTGCTCG GATCAGGTGC 
AAATTTGGCG TGTTTAGGTC GTACCAACCA 
TCTATGCAAG CTTTAATCGT GTCCAATAGC 
TTTATGACAG TGGTCCAAAA AAAACAAAGA 
TAGTCCTCGT ATATAACCTC CTCAACAAGA 
TGAAACGATG GTACGGACAG GTCGTGAAAA 
AAGGTTTTGG TTACACCGAC CATGTGAGAT 
TGAGGGGGTA ACAATTCGTG GGCGTGGGGG 
CGTACGAGTT CTTCGTCAAA TCCTACGCAG 
ATAAGTCCCC GAATTCTATC GTTCGTAATT 
TTGCTCCGAC ACTGCGGTAC GATGAACCGC 
AGTATCTTAG ACAGGTATAG GGCCCGTTCC 
TTGCGGGCAT TGACGGTCAG TGTGTTCATT 
TGGAACTTGT ACATATGATT TAAAAGATGG 
TTTTAACGTT CACAGGGTGT TAAAAAACTC 
CTCTCCGAAC CGCTCAGGGA TAAGCTGTAC 
GTCAAAAAAC ATGACCCACG TGACCTAAAT 
GTAATGGAAT GTGCCTCCCT GGGTCCCATC 
AACCTGTTCT GCCTCTACCG GGGTTCCCGG 
GTCCCGTGCG CAGAATGCGC CCAGGGCATC 
TGTACCGAAA AGATGTTCAT AGCGTCTGCC 
GCCAGAGTTC TGCACGACGT CTACGCGGAA 
CGTCTCATAT GTTGCAGGCG GCCTATCATG 
AATACCCTGG ATGCCGAGGG AAAACTGGAA 
TTTCAACCCA TCTTTCTTCA AAGAATCTTA 
ATGGTTCCTG ACAATTATTT TACCATCACC 
ACTGGAAGTA CTAGCGGGAT GACGTGTCCG 
CTGCCAGTAT GCGAACCGGG GCTTCTCCCG 
GAAAATCCAG AAATCATCCT CAGCGCCCCA 
CACCCCAGTC TGCCTCAGTC AGTCAGCATT 
CAATGTTTTA TGGACACGTG GAAAGCGGTG 
CCTATTTTGG CCGCCGCACT AACCGGTTCG 
TGTTCCCCAG TTGCCGGCAC GTCTCGGCAG 
AAACCAG CAT GCGTTCCGCA GCCCGCGTTA 
ACGGTAGCAC AGTTGGGAAA CGCTCCCGTA 
GCTCCACTGG TTAATATACC AATAATAGAC 
CCGGCCGATA TCGCCAACAC TCCAGAGTCC 
GAAACCGCCG CAACGGTCCC CGCTAAGAGA 
AGACTGAAGG GCGACGTTCG GGGCGCCGCA 
GGGATGCGCG TTCGCGAG C A GGGCTTATTC 
G CG AACG CAT CTGGACCCAA AAACGACGAC 
TATGGACTAC AGTCCCCCCC GGCCATCGAT 
ATATTTCCAG AGGTTAGTCC GCCATCTAGC 
AGACGCAAGA TCACACCACC GAGACAGTCC 
AATATCACCG GTTGGAAAGT CCCTTGGTGC 
TCTGTGCTAT TTCGGGACAC TAAAAAATTG 
TAAATGTCAC TCTTTCTCAA GTTTCTGTCC 
TAAAGACACC TGTTTTGTTT TCGTGAATCT 
AAACATGCCT CGTGTGAAAA CACAACCCAA 
AGATCTCCAC GGTGGAACAC ACACGGAGAT 
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70441 GGATTCTCAA AACCTGTGTC CTGACGGCCA GGATCTG CTC GGGTCTTATA TCTATACGGA 
705 01 GAATAACGGC CCGTTTTCCC AAATAATGCA CAATGGACAG AG CAATACCG GGACAGGTGA 
70 5 61 AAGCTTCGGC AGCTACGCTG CCGGCGACGG TTTTCTGGGC GGTTCTGTGT CAGGGATGTA 
7 0 621 TGGAAACAAC ACCGGAGAGG GCGCGTGTTC TAAAAGACCG TCCGCGTGCC GTAAACGCTC 
7 0681 GGCTGCACTA ATTCACGCGG CGTCCGAGGC GTCTGTGGCC GAG CAAGGCA CCTCACAGGG 
70 741 GGCACATGCC GTATCTGACC GGATAGG CAG AGACGGTGGC GCTGACAATA GACTACTCAA 
70 8 01 GGTGAGTGCG CGGCTGTCGG ACAAAACAAA GAGCGCCCTT CGCAGCCATC CTTGCTTGCG 
70 8 61 TTGCTATTCT TTGATGTTTA ACACGTAATG TCTGTTTATA TTAACCCTGC CATAGCCCGA 
70921 ACCGAACACT GTGCGATGCG ATTATAAAAC GCAACACTCG GACACGTCAG ATGGACGCGC 
709 81 AAAACGCGCG CGGCAGTGCA AGACCTCAGG AAAAAATAAA GCCCAACAGC TGCAACAGGT 
71041 ACAATAAGCA AATTTAGTAA TTTGTATTTT TTAAGTTTCC GAGGTTTTCC CGCTGGCCTG 
71101 CGGCACCGTG TCCGTGCTTT AGTTGGAATA CCTCTCCTAA CAAGCACGTA TTCCAACAGG 
71161 AGCTGGAAAG TAAAACAACG CAACTGACCG TCCTCACCGC TCAACACGAA GCCCTTCAAA 
71221 AG C ACG CATC ATTTTTACAG AAGATGATGA TATTGATGTG CAAAAACGGG AACAGTAAAA 

712 81 AAAGCACGTC ATGATTTAAA GCCACCTCGG GTTACTAGTT TAAAACTGCC TAATTGATTC 

713 41 TATTCACCCT TCAATAAAAT AAAATTTTAA TACGTTTGCG GTTTGTCCAT TTCCTGTCTT 

714 01 AAAACTATTA AACAATTCAT TTGGGAATAA GGATTTATGG GATTAAGGGA TTAAATTTTT 

714 61 TTGGATCATG GGATTTGGGA ACATACGTCT GGGATGGAGG TTATGCTTCA TGGTCTGGGT 
71521 GGCGTGGATT GCACGGGGAC GGTCGGTGTG CCCAACCTGG CACCTGACAG ATGGGAAATA 

715 81 CGAGGCGGTA TACAGG CACT ACCTCGAAGA GTGCCGCAAA CATGAAGGCT CGGGGAGCCT 
71641 GGACGGTTCC GGACAGACAA AGGGGTCTGG AACCAAAGCA ACCACCGAAG CTAATATATC 
717 01 GATAAGACCT AACGTTGTCA CATCAGGTCA AAATAAAGAG CCGCCTGGGA CAGCACCGAG 

717 61 GGCCGAATCA TCACACGACC TGCCACGCAT CAAGCAGGTT AACGCTCTCC GATTATCAAC 
71821 CCCGGAATTG GCGCAACCAC TCCCGGTAGT AAAATCGACT CCGCGCGAGT CACAGTCAGG 

718 81 TGGGACACCC TGGAACGCGC GCCCCCACGC GTTCATTATG CACACAAACG ACATGCTCAA 
71941 CCCATCTGTG GTCCTGTCTT TCAG AG C CAT CCGTGCGCGG TCCACACGCG ATACCGAGCA 
72 0 01 GTCCGTTCGC GATCGGAACA CGGTCACGAC CAG CTATCGT ACCCCTGGCC GCCCTTCCCT 
72 0 61 CTTTCAAGCC AGACCCTCGT CTCACGGTGC GCGTCTACCC CCTTCGCCCC GAACGATGGC 
72121 AAGATACGCC GAGTCGCGAA CAATATGCGA CCAAAATTGA CCGCAAAGAA TACCACGCCG 
72181 TCCGTTCGAC GCAGTGCCCG TATGGACG CT GCACGGGACC TCAAGCGTCC GTGTCAGACC 
72 241 CATTCCTCCC TTCGCCGACG CCGTGGCACC TGGTTCACCC CCTGTGGGCT ATGGCCGGAA 

723 01 TATTGGGCCT CACGATCGCC TTGTACACGA TATATCAAAT ATATGCACAT AGAAACTACA 
72 361 CGACGCTTCA CTGCGAATAA AGGTTTATTT ATTTTGCACA CTAGTCCGCG TCGTTATTTC 
72421 TGGTCCTGGA CTGGGCGCGC CTCCTCCTGA GCGCCCCGTC CGCGCTGGCA GCG ATTCCGC 

724 81 CTCTGGTGGT CTCGTCCATT GACACGTCCA CGCGGATACT AATTCCGGCG AGGGCGTCTT 
72541 CCATCTGCCC GCGCGTCACG GCCTTGGCCG TGGCCGCCCT CACCTTAGCT TCAATCTTAC 
72 601 GGGTTGCCAC GTTACATAAG GCCGAAACGG CAGCTGCGAT TTTGG CCTCC TTTTCGGTGT 
72 661 TAGAGATGAC GGGGTCGGAG CCGGACCTGG TGGGATCCCC AGAATTTATC AGCTTTTTGA 
72721 GCTGCTTGTT TTCAACGGAC AATTTTTGCA CCTGAGCGGC TAACTCCTCC ATTGTCAGCT 
727 81 CTTTTTTGGG GGCGCGCGTC TTGGGACGCG TGGAAGACAT GGCAGCTACT ACGCATTACA 
72 841 GAGTACAGCG GGACGTTTAA ATACTTTCAG GCGGCCCCTG AGGTCACTCG TGCGCCACGG 
72 901 TTT C AAC AAA TTTAAAAAAG CACATATATG TTAGGTACAG AAATGTTG C A ACCGCGACCA 

72 961 GAACAGAGTT GATCACCGCC CATATACTCG AAAAACTAGA CAGCACGGGC TGGTACGTGT 

73 021 CCGCCGAGCA ATCATAGTCA TAAAAAGAGG GCGTTACGGA GGGAGCCGCG GAGGGGTTTG 
73 081 TAGTCACAGG AGGCGTGCCC GTTCCTGTAG TCGAGGAGGT GGTGACGGTG GCGCACACCG 
73141 GCAGACACAG ATATAAGCAC GCCAACAGGG CTAACGCGAG CACAATTGAC CCTGTCATTT 
73201 TCAAGCCGGC GTTCGGCAGG CAACCCTCCA CTTTGGCACG CGCGCTCTCC TTATATACTA 
732 61 CCAGTTAAAC ATTATGGCAG AAGTGACCGC CCATACGGTT CCGTACGCCT TTGACTCCTG 
73 321 TAAGTTTGAA ATAATACCAA AAAACAACTC ATCTCGTATA GCGTTACGCA ACAAATTTCC 
73 3 81 CGTCGTGGTC AAACCGGGAG AACCTCTAGT CGTGCCCCTG GGACTAAAAA TTATCCGCGC 
73441 GCCCCAGTGC GCATTCTTTC TCAGCGGAGC GCCGACGGAT GAGGTGTATT ACCACACCGG 
73 501 ACTTATAGAT CAAGGGTATC GTGGAGAAAT AAAACTCATC GTTCTCAATA AGACTAAACA 
73 561 GGTCGTCACA CTTTACCGGG GAGAAGTCAA CGTCTCACTC ATCGCGTTTA TGTACGCCTC 
7 3 621 GCCCGGGCCC CTAAAATGCC CGATACTGAA CCTACCACAT TACTCTCTGG ACGCTGGATT 
73 681 TGATGTTACG TCGCCACATG CCATGACCAT ACCCCCAACC GACAGAACCC CGTTTACGTT 
73 741 GAGTTTGTAT TATAAGAGTC CACAGCTCAG CACCCCCCAC GTCCCACTAA TCGTGGGGCG 
73 801 CTCTGGACTG GCGACAAAGG GGCTTACCGT CGACGCGACC AAATGGACCC AGTCGTTGGT 
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73 8 61 ACATCTGAGG TTTTATAACT TTACCAAAGA GCCAATAGAT ATACCAGCAA ACAGCCGCAT 
7 3 921 ATGTCAGGTA GTGTTTATCC ACGAAGACCA CGTCCCAAGC GGGTGGAACA TTCTTAGATC 

73 981 CCGCGTACAA CTCGGCAGCA CCCTCCAGAT ATCGTGGGCC AAAATCAGGT TTACCGACGT 

74 041 GGCCACGCTC CCCAAAACCC ACCCGCTCAA CTCCCGCCAC ACTCAAAGCC AAACCGAACC 
74101 GGAGACCGCC CGCGGCGCAA AGGGGTTGGG GTCGTCAGGG TTATAATTTT AAAACATGGC 
74161 ATTATTTTAA AGTTGTTTAT TTTTAATAAA ACAATCCAAA TTTACGTTAA CCATCAGCTT 
74221 GCGCATCCTG TATTATTTTG TCTGGCACGA TATTATCCGC CACCGCCAAC TGGGGCGTCG 
742 81 GTTGTCGAAT CACCGGATGG GAGGGGTTGG AGGACGCTTC GGTGTCTACC AG C AC ATTAT 
74 341 TAATTTCCGC AGCC CACGTG ACAGAATCAT CGGACGCGAT CGTCGTAGCC AGTTTTTCCA 
744 01 TTTCCAGGAG TGACACGGTC TCGTCGGCCC GTTTATTCAA AAGCGCGTGA AAGGCGCTCT 
744 61 CGTCCACCGT AATATCCCCT AT AG CC AGGG ACAACATGGC GTTCCAGATA ACGTTCTGTA 
74 521 TAGACTTTAG CTTAGATATT TGAATCCTAA CCGTCTCCTG GCACCCCGCG GGTACCCTAG 
74 5 81 CCGCGTCAAA ACGACGAAGA TACTCGGTTA TTTTATTGGA TTGAACGGCC AACAGATACG 
74 641 TCTGCCGAAG GGCGGCGGTG CACATCTCTG CCTGCTTCAC ATCTCCCATA TCCACTCCCG 
74 7 01 GGGGCAGACC CATGGCTATT TCGGTCCGCC AGTGAGTAGG ACACTCGTTA AGAGGATCTG 
74 761 GCAGGGGACG ATATTGACAT CCCACGCGTC CAAACGGACT AAACCCGCAA CACGTCCAAA 
74 821 ACGAACCAGG CGCGGCCATT GCCAAACAAA ATGGTAGATG AAATTAGGGC AATTTTCTCT 
74 8 81 ACTAGTGGAG ATATGGC CG A AGTAATTACG GATATACTGA CTGAAACGCA AGCAACGGCG 

74 941 TCCTTCTTCT GCGTGCTCCA CGATCGGGGC GACGCGCCTA TAAATACTCC ACATGCCGTA 
750 01 ATTAAACTCT GCCTGCCCGC CAAGCGCCCA CGCGGCGGGC CAAGGTGTTT ACCGTTGATG 
7 5 061 GTGCTGAACC TACCGGCGTG GCAGGTTAAT CTATTCCTAA CAGGTGACGC ACCATTGACC 
75121 TCGGATAACA TTAAAGACCG CATTGACCTG GCTCAGACCG AGGAAATACT CGAACCCATA 
75181 TTAAGCGTAC TGGCATGCAA ACGGTCCGCG CAGCAGACCA AACATGACTC GTTTAAATCT 
75241 AAGGTGGCCT GGTTCAGGGC AAAGTTCGTC TCGGCCCTAA GAAAAGTGTA CAAAATGACC 
753 01 CCATCCCCCT ATTGGATGAT AACGCTG CTG GGCTCGTTCG AGGCATCGTT TGTACTGGCC 
75361 GGCACGTTTT ATTTTTTCCA GTCTCATATA TGCACGGCGG AGACGCTGGT GCATCTAACG 
75421 AGGTTATTCA GCTCCAGTCA GGGCCAGAGC TTGGTCACCG TAAACACCTA TGACGAATTA 
7 5481 GGGCGCGTAT TCGGG CGGTC AGATTTCCTT GGAATTGTTC CAAATTTTTG GGCGTATCTA 
75541 AAATACAAAA TGCAACAGGA CGACGTGGAG TCCAGGGCCA TCGATCAAAC AATCAACTCC 
75601 ATACGTGGGG GGTTGATGCT GTCACCTCAG GACCTCGTAC ACTTCATCTA CCTGTCATTT 
75661 TACGAGTGCA TGAACGCACA GACGTTCCTG TCGTACTCTC GTACCACGTC CAGTTTACCG 
7 5721 ACCCCCGCAA CGGTTAACCC TCCGCAGCTG TGTCGGCGCT TAGAAGCGGA TTTTAAGGAG 
75781 CACGTGATGG CATATTACAA TAAAG CAAGC TACCTGAGTA CTTACATAAC CATTTTAACC 
7 5 841 GTACCCGCTC CTCTCCCGGA CGGATACGAA AACTTTCAGG AACTAGCATG TCAATACTGG 
75901 TGTGGACAAT CGAGAGACGT GGCGGAAATC ATGACTAGAA TTAATGACCA GTATCCACAG 

75 961 CTGAATCTAA CAAAAGATTT ATCCGGTCTC CTTGACCTAG CGGCCCTAGA TCAGTATTCC 
76021 GGAGGCCCCA AGGAAAACCT TTTTACGGTG GCTTCGAGAA TTCCCACATA C AGGTG CG AG 
76081 TTTTTGAATA AACAGTATTT TGTTCTAATG CACGCAGACT GTATAGACGC GTACTGGAAA 
7 6141 CAAAACATTA TCGTGCCCGA AGACGCCCAA TTGCAAGGCC TGACGGATCA AGATCTCACC 
762 01 TCCAGGATAT TTTACTGCGA CCTCGGCCTA TCTCTACCTA CATTTAAACA ACAAATTCTA 
76261 GTTTCACGCC ACGAGTATTT CAACCCACGA CTCCCAGTTT ATAGATGGGT GTTGGATTTT 
76321 GATCTAAAGG TAAC CGAAGG TAGACGGACT CTAAACGATA TCTACAACAT ATGCGTAACG 
76381 CTGCGGCAGG TCATATTAGA AACGTTG C AG TTAATAGGTC C ACT AAAAC C GAACCACCCC 
76441 GTGTATTTTT TTAAGTCGGC CTGTCCAGCC GTAACCTGGC CGGATGATAT CTCAGACACC 
76501 GCGTTCTGTC ACTGTGACGC AAAAATAGGG ATGAGAATTG TAACCCCGTT TCCGAGCGGA 
76561 TACTGTTTGG TGGGGTCCGC TCCACTCGTG TCCCTGACCG ACATTTTAAA CCGCGTGGTT 
76621 AAACTGGACA CGCGGCTAGC CTCAGAGTAT CCTGGGATCT TAGAAGATAA AGGACCATTT 
76681 GACTCTGGCA TTTACGCCAA GGGGCGATGC GTGCGCGTAC CTCACTGCTA TAAGGTTGGC 
76741 CCGGGGGGCG AGCTGTCACG GCTCCTAAAA ATTATTATCT GTCACCCCGA AGAGTCAGAT 

76 801 AAATCGGCGT ATTTGAAAAA CGCGTTTAAG GTTTCTAATC TGCTACATCA CGCCCCTGGA 
76861 GACTCTGTCA CCAAAAACGG CCACCTGGTG TACGCGATCA CTGACGAAAA CGAGGGCTTT 
76921 CTAGAAAGCA AAACCAAGAA TAATCTTCCC AAAACAATCA CGGACCTGGC CGAAAAAATT 
76981 GAGCGAACCA CAGAAAAACC ACTAATTGAT TGGGCAGCGA CCGCAGTGTG GCCAAAACTA 
77041 CACGATACCA TACAGCGTTT CTTTCCGGAT GACCGCATCG GCCAATTTGC ATCTGTGAGC 
77101 TTCATGCACT CCGGAGACAA CATTATACAA GTTAAACCAC AAAAAGGAAA CAACTTTTTC 
77161 TGTATTAATC ATAAGCACCG AAAC C AC ACT CAGACAGTCC GTGTATTTTT AACCCTACAT 
77221 TCCACAAAAG AAAGCGAGGT CACGGTGACC TTTATGAGTC AGTGCTTTGC TGCAAAATGT 
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7 7281 AATCATAATA GCCCAACTGC ACATTTTTCA TTTATGGTAC CCATAACCGG CACGTAAGAG 
77 341 ACACGAATAA AATCATCCGT GATGGGC CAG CTCTCCGGCT ACTTTGTTAA TTTTAACACC 
7 74 01 GCGTAACCCC AAAAACAACA TCTGGACGAG TTTTGTTCGC GAACGTGAAC ATGGCACAGG 
7 7461 CAATGGTGAG CCTGGAATAC ATGAAGGATA TTTTGGACGG TAAGAAGACC CCTTGTGGCT 
77 521 CTTACGACTC CCATTTAAAA CCGCAACTTA TCAAACGGCT CTTTTTATAC GATTTACATG 
7 7 581 CGGTATGTGT TTCACGCACT AATTTGTTTT AT AGGGGG C A TATCATCGTC GGACTTCGAC 
7 7 641 GACTCGTCCT CGGACGAAAT GGACGACCTG TCCCCAACGC CGGAGCCAGA ACCGTCCACA 
7 7 701 ACGCCGAACA GCTTTCCGGA GGGGCCCAAA TCACAAGTGG TGGCGTTACC TAAAATCCGT 
7 7 761 AAAAGATCTC GATCTGAGAC GCCGGTAAAA ATTGAGCACA GATCTCCACT TAACCGCTCG 
77 821 CGATCTCGAT CCAGAACGCG GTCCGGGTCC GGTCAACGAT CAAAGCAGTC AGGCAGATAC 
77 881 GTCAAGAG AT . TCAAACCAAC GGTTGATGCA CCCCGTCATC GAGAACCGTG GCACAGGGGC 
7 7 941 GGTAAGGGAA AGGCCCCGTT TATCCGCAGA GACGCAATGG CTGGCCGCGG TCGACGCACA 
7 8001 TACGGCCACG ACTATCGCGG AAAAGCCGCT TTAACGCGGA G C ATTAAAG A GTCTATTAAA 
7 8 061 AAGATGCACC TTCCATCCAC CATGCTCTCT CGTGCGCACG ATAAAAAGGT ATTCGAGGGA 
7 8121 CTGTTG C C AC GACACCTGGG ACAGTGCTTT CAGGTGTGCC TGCCGGCGCC GCCACCGCTG 
7 8181 CAACCCGAGG TGTTCACGGA TCGACAGCTT ACCGCTATAG TCAAGTCTGG CGGGCGCAGA 
7 8241 GACGCGCTGG TGGCCAAAAA AGTTAGCCTG GCTAAACTAA CAAGCCTATA CAAACCCCTG 
7 83 01 CTCACGTTTG TGACGGGGAG AAACAACCAG GCCCACTGGT TGGCGACGCG CAAAAACACG 
7 8361 CTAGCGTCTG CGGGACTCGA GGCTCTCGCG GCCTTTATCG AGGAAGGTCT GGCGTGGGCC 
78421 CAGGTGTGCG TATCTCAAAA CAGATCGTTG AACG AC AG C A AC CTAGATAT CATTCTAGAC 
7 8481 AGTAGTCAGA GCGTGTGCAC CTGGTTCATC TCCAAAATCA GGCATTTGCA CATACAGTGC 
7 8541 TTCTTGGAGA ATCAGGGTGA GGTTAGCCTG GTGAAGCAGC TAACTTACCT CGTGTGTATA 
7 8601 AATAACCGCC TGGCGGAGGC GGCGAACCTG GCGGGAGAGG TGAAACTGAA TTTTAAACTC 
7 8 661 GGAATGCTGA TTGGGTTCGC CCTAACCTTG CCGGCGCTCC TTG CAGAGC A TAAGTTGTCT 
7 8721 GGCGAGAGCC TGTACCTGTT TAGGT CTTTC CTGGAGAAAT ATAGACCGGG AGATGTCATG 
78781 GGACTCTTAA ATTCAATAGT GGTGGAGCAC TACACCAAAT GTCGTAGCGC GGAGTGTGTC 
7 8 841 ATTACAACCC ACGCCATGGT AGGATCTGGC GAAAACAACA AGGGACTTTT TTTCTTTCCA 
7 8 901 GTGTAATATA TAACCCATGT GTGTAAGTAG TTACGGTATA TTATTCACGG GCGTTTAAAT 
78961 GCAATAACCC ACATAACAAA ATAAAAATGT GTAAAACCAA ACACGCGTCA AGCTTTTTCT 
7 9021 GCAAGGGTTC TGTGCTCGGA ACAAATAGGC AGAGTTCGCC ACCTAGCGGC ACGTTCTCTA 
7 9081 TAGCCTTGGT ATACAGCGGA TGTTATTTAC GGAAGTGCCA CATGAAACCG TGGCCGGGTG 
79141 CGTAACAGTA AACCGGGTCC AATCGCGCAG TTTACACAAC ACATAAAATA GGCGACCGTT 
7 9201 TATAGCGGTT TATTTGAACA TACAACACAC ATTTCAGGAG TTTCGGTCTT TATAAAAATC 
7 9261 TTCATT C ATT CAAAGTGCCG ATATATTTCA CGCTGGTGGA CAACAATACA TATCGGCGCC 
7 9321 TGTGATTGAT CCTCCTCCTC GTATAGGGAC CAGGCAGACA CCCAAATCTG TGCAAACGGA 
7 9381 TCGGGTGGCC CGGGATCAGA GGGTCGTGGT GACTGGGCTA ACTTTAACAT GTATTCATCG 
7 9441 GTATCAAAAA CGCGATGTCC CACCCCAGAT GGCACGCGTC TTGGTTCAGA GGTGTTTGAG 
7 9501 GTATTGCCAC GGAACCATAA ATTAAGAAAT CC AT AG C CAT TCATCATAAT CCCATACTGA 
7 9561 CAACTGCACA AAACAATGCC GTCCTCCAGG GCGTGTAACG CGTTCACAAG TCTGCTCAAC 
79621 ATAAAATCGT CATCCATATA AGGCGCCGGG GGCAACCACA ACTGAAGCGG CCCGGGAGCC 
7 9681 ATGCAGACGT GGTGTTCATT TTTATCCGTT GGTCTCGTGG ATAAACGAAC GCCCATCCCA 
7 9741 GAATGGGTAA AAGTCTGTGC TTGAAGGACC CCATAATAAT AAACCTGAAT ATG C ATAAAT 
79801 CCCCACGGGT CGTCAACAGA TGGCGCCAAA CACGCAGAAA GACCGCTCGA TGCAGATAGC 
7 9861 GGCAGGGCGT GTTCCCCGGG CAACGCAACC GCGCCCCCTT GACTATTGTC CTGCCTACCG 

7 9921 TCTTCACCGG AGGCGGTTGC GGTTG CAG AC GCCTCGGTGG CCGGGGCGGT CATGGTGAAG 
79981 GTACCGGCGG GACGGGTACG AAGTAGCCCG AGCAGCTGCT TTACCGTGTA TTCAGATACC 

8 0041 CTGCCCGCCT CGTCGGCGCC CTTGCGCTTA AACATCCGAA ACCTCAAGCC CCGCAGTTCT 
80101 GCCGCTACCC CAGAGGCCCG AGGGCACAAC CTGCATAACA TCTCTTCATC GGAGCGCAAC 
8 0161 CTGAAGACCA CGTTTG CAAA TGAAAAACTC CGTCTCAAAA CGTCCTTTTC TTCTATGAAA 
8 0221 TAGGCACTCT TTCTTATCGC ACCCAATAGG CGCGCCTTGG CCTGGCCCAG CCTAGACCTC 
8 0281 CCGGACGGAA GGGGGATGTT TCTCTGGTTG CAGTAGTCCT CGTACATCTT CCCATCCCGA 
80341 AGGTAATCAA ATCCCACAGA CCACGGGGTC GTAGTCGCCA GCACGACCCT GGTTTTTTCG 
8 0401 TCGTCTGCCC AAAACAGTCC CGAATACGTT GCAAGATTTG CGTGAGTAAC GATCCATTCT 
8 04 61 CGAAGTCCCA TGGCGGACGG ACCACCGCTC TGAGGTGGGC GCCAGGTAGC CATGGCTGTA 
80521 TATGCCAGGT TGATACTCAC CGCTTTCAGT TCGTGTTAAA ATAACCAAAC TCCGCCCATG 
8 0581 CGGCTACCGC CGTGGGCGCA GATGTGTGCA GCCGTCACAA AAGAATCCCA CCTGCGGCAA 
8 0641 CTAATTCCCT CAGCGCCCAG TCAGCTGGCT ACCGGTTCAT TTCACTCACG AGCACGCCAC 



24 



WO 00/28040 



PCT/US99/26260 



8 07 01 CCCATCATCT TCACCGCTCG TTCCCGCGTC CGGATCGTCC ATTGAATCTG GTCCGGAGTC 
8 0761 GCCCAGCTCT ACTGCCGGGT CATGCGATTG GCTCGTCCCC TCTCCTGGAT CTTC CATCTG 
8 0 821 GAATGAACAG TCGGTGCGAC CCTCTGATGT TACAGTCGTC CCGGGGTAAA AACAACCCTC 
8 0 881 TGTACCGCAT ACAGACGAGG TTTCG C AC AC GTAACGCAGT CTCATGGCGA TGGGCGCATC 
80 941 GACGGGATTC GGCGACTGGA CCGACAGACA CAGGGCGATC TGTGCATAAT CTCTAGGTGG 
810 01 TTCGTCTCCA GGGTTCGGGG ACCTTG CAAG AGCTCTAAGA TAATCCACCA TATCAAAAAC 
810 61 CTGAACGGGT TGCCTGGATG GCAGGTCCTT ATACTCCTCC TCATTGTTAG TAAGCACCTT 
81121 CA*TJ&rCCCGG TTCTCCAGGT TCTTCACGCA AATGCCAGTA TCCGTAAGGG TAATCAACAC 
81181 ACCCTTCTCG CACTTTTTGT GCATAACCCT TAGATCTTCT CTGGCAAACT GACAGTCAAC 
81241 GACCCCCGGT GACGGTAAAA ACAAAGTCCT CTCATCTTGG AAACAACACG CGTGGCCAGC 
81301 GAGGACGGGA CATGGTCGAC TGCAGACCCG TATTCCAGCA TGAACACGTT CGAGGCCAAC 

813 61 GGTCTCTCCA AAATATAGAA ACGTTAATTG GATGGCCCCT GGCAGCCGAT GCTCGCGCAC 
81421 ACGCTTGTTT TTGTCCATCC GACTCCGAGC CAATACCGTT GCGGTAAAAA CCCGTCTTCT 

814 81 TTCACGTCCA GACCCACCCC CCTGGTCGTA TTGGAACTCA CTGATTAAAC CAAGGCACTG 
81541 AGTTTCGGTG GAAGACATCA GATTACACAT TGCACAACTT ACCACCGGTT CCGGCAACAG 
81601 GCGAAACAGT CTATATCGAG CACCCGGTCG CCCATAGGCC TTATTAATAA ATTTAAGCTC 
81661 TCGTATACTT TTGTGGTGTC TCATGTCGTA CAAGAATCTC ATTTTCCTGA ATCTGCCATA 
81721 GTGGGACGGT TTTTCGCCCT GACATATGCC TCGCATGTTG CAGTAGTCAT CAAAAAACTT 
817 81 CTCGCTATTC CATACGGACC GATCTGCAGT TACTTTGTTC CACGGGACCC GGACAACGGT 
81841 CTTGTCCTCA TTCTCCCAAA CCAATCCGCG GTACTCGCCC GATTCGACCG CGGCAACGAG 
81901 CCAGGCCCTG ATATCGACTC CACGGCCCGC CATTCCGCGT ATTGGGAGCT TGACGGATCA 
81961 G CT CT CAAG A CAAATGAAAC CCTTGCGTTC ACGGCACCAG TTCATAAATG TATCTAAATG 
82021 ACGGAGTCAT GTGCCCTAAA CATATGGCTA AGACTACTCA AACTGTG CTT AGGGGAAGAG 
82 081 TGACAAACAC CTGGCACTCA ATAAAATTTT GAGGTCGGGC ATTGACCCGT GCCTTCCTCT 
82141 GGTTAGTATT AACCCCATCG CCACCATATT CAAGGAACAA AACACTCCAG ATACAAAACA 

822 01 CTTTATTACA GATAGAAGGC GCTCACCGGC CTCACAAAAC CGGTCGCGCG CACAGGTAAC 
82261 ATTAAACATT AGCGGCGGAC AACGCCCGCC ACAGACACTC GTGCCACAGC TGAATAATTA 
82321 GCGGTACAGT GCTTTGCACT CCGCCCTCGG GTGTAGGTAC GCCCCCAAAA TATAGGTAAA 

823 81 CACAAGCTTG TGGAAGCGCC GCTCCGTCCG GAGATGTTCT GGCCATGGCT GTCATATAGT 
82441 CAACCAAGTT AAATGCACGA CACGCGTTGG CGCACGACAG GACCGCAACC GCCCCCGTAG 

825 01 ATTCGACGTT CCCTTGAACG AACGCCAGGG AATGCCCCAA CCATCTAATG TAAATGCCGG 
82561 AGGGGTCGGC CCAAAACGCC AACCCGCGCA AAAGATCTTT ATCCAGGAAG CGCAACGCCT 
82621 CGGCCTGAAA CTCTGCTAAG TGGCCTGGAA TCTCCGGAAC CAGTGCCTGC TCGGCGATCG 

826 81 GTGCGGCGCA TATGTGCCCT TGAGGGCGAC GTTCGGACAA TGGGAGCACA CGGATG CCGT 
82741 TGGACGATTC AGTCACCACT TCACCAACTC TGTCCCCATA ATAGTAAAAC ATTATACGTA 
82801 GCATCCAACA CTGTTCACCC CCGGCTCCCA CCGCACCAGT GTACCGAACA CCGGCGCCAA 
828 61 CCGGACCAAA CTTATTCACG GCCTCCTCGC GCAACCCCCG TAACATAG C A GTAGTGCCGG 

82 921 AGGCCTGATC GCAACGAGCA CACGACCTCA CGGTAATTGG TAAAAGGCGA ATGATTCGGC 
82981 ATCTGTCCGG ATATGGAGTC GAAAGGTTGT CCTTAGTTGA CACGTCCGAC ACGGTTTGAC 
8 3 041 TATGCCGTAT GGCGCTGGAT AGCCAGTTTT TGCATTCCTT AGCGGTTAAT TCCCTGCCAG 
8 3101 CCGCGTGCAA GATGCCCCTG GATCGACAGT AGTCAATAAA AATGTTTTTT TCCGCCTCCT 
83161 CGACTCCTCT GCTCCCTCTG TCGTGATTCC ACGGTATGCG GATTAAAGTT CTTTCCTCGT 
83221 CAACCCACCG CATTCCGGGA TGACGACCTG TCTCGCAGCA TTCCACCAGC CATC CT CG AA 
832 81 GGCTGGATGG TTTGGTCACT CGGACCGGCC GCTCCATATC CTCGCCGCGT GCACGGCAAT 
83341 GCCAAACCTT GTCCAAGTTA CCTCCTGACA AGCACCTGCG ACCCTCACAG ACCTACGCGT 
8 3401 GGCAAACGGG AACTTGTGGT CTAGTCTGCT GGGACAGTAC CTT AG CGTT A TTTTTATCTG 
83461 TCGTGCACTC CTGCGGCGGA AGTTTGTGGT TTATGAGCAC CACCGGGAAG GATTCATGTG 
83521 TCCCTGCACC AGCTCGTCCT GACAGACCAC CACAGAAATA GGAACTTGTG GGCTCTCTTG 
8 3581 GGCACCAGGC TGCTGTGAGA GATACAGTGT TGCCCCGTTG CACGGCGCAT GGCTCTCTGG 
83641 TCCCGGAGAG CGGGCAAGCT TTCTCATGAA ACCCATG AAG TTAAAAATTT T CTGTTT AG A 
837 01 GAGGAACATG ACCTCTCGTT GGATAATATC ACCATTACAG TCGTTACCCG CCGCGTACAG 
83761 CGTTTCCCTG TTCCTATTAT CGACGTACAC TCCGGTTTCG TTGCCGTGTA CGTAGATGCC 
83821 CTCGTTATAC GCGACCAGCA TCGTTTTTAT TTGCGTTATC TGTTCTGCGG TCAGTTTTTC 

83 881 ACGATCTGGA GATGGAAACC AGACAGTGAA CCCGGTGCCA TAGAAACACA GATGACCGGG 

83 941 ATGGGGAACC GGGCGCGGAT GCAAACGAAC ACCTCTCCGA TCTTGGGACG TGTGATCCAG 

84 001 TGCCAGAACG CCAAAATAAA ACACCTGAAT TCTCAACCGC C AAAACGG C A CAATTTGATG 
84 0 61 AGGCAGCGGC GCTTCTGGGA CCCCGAGACG ATGTGCTCGG GCGTACGCTG ATCTCGTCGG 
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84121 
84181 
84241 
84301 
84361 
84421 
84481 
84541 
84601 
84661 
84721 
84781 
84841 
84901 
84961 
85021 
85081 
85141 
85201 
85261 
85321 
85381 
85441 
85501 
85561 
85621 
85681 
85741 
85801 
85861 
85921 
85981 
86041 
86101 
86161 
86221 
86281 
86341 
86401 
86461 
86521 
86581 
86641 
86701 
86761 
86821 
86881 
86941 
87001 
87061 
87121 
87181 
87241 
87301 
87361 
87421 
87481 



AGGGACCAAC 
CACCTGCTCC 
TCCAGAATCC 
GACCCCATGC 
CTTACACTCT 
GACAACGGCT 
TCATCGCACC 
CG TAG ATGG C 
CGCGATACGA 
GCACCTGGCT 
TTCTGGTCTC 
TCCTCCAGTG 
TTCGATTTCA 
AGCTGTGCAC 
ACAGAAAAAA 
CGGCGAATCC 
GATAAACAAT 
CCGAAACCCA 
TGGCCATATC 
CACGATGTGG 
CGGTAATCGT 
ATTTCAAAGC 
GAAACCACGC 
GTTTTGAAGA 
CATAGTAATA 
ACGCCATAAG 
CGTGTCCATC 
AAAGGGCGTG 
GGAGCGCGCA 
AG AC CAG CG C 
TGCTCTTTCT 
ACGGCAGTGG 
AATCATAATT 
CTGCCCAAAA 
GGGCCATGCC 
CAACGTTTAA 
CTGTGACAAA 
CCCGCCGGTA 
GTTGTTCCCG 
GCGGCACCGT 
GCAATGGGAG 
CAGTCACGCG 
AAGAGATCGA 
TTGGTCAATG 
ATAACACAGA 
AACTCGCAGG 
CACAGGTGTC 
CGCAAAATTT 
AGCACAGGTT 
CTCAGTGGCG 
CTGCCCATGG 
ACCGTAACCT 
TTG CTCGGCG 
ATTTTCTTAA 
TCGTCAAATA 
ACACGTATAA 
ACGG CCTCGA 



GTGGGCAAGG 
AAATAATTTA 
ATCAAAGGTT 
TCCACTTCTT 
AGCAGGTTTC 
GGAGTTCCAG 
AAGTTAGGTC 
TGTAGTTAGA 
GTGTACTTTT 
AAG CAAGCAG 
ACGTTTTGCA 
CCTTGACAAA 
GTCACCGGCA 
TCAAGCCCGA 
CGAAACCACA 
ATCTAACAAA 
AATAGATAGT 
GAGTTTAACA 
AAGCAAATAA 
CAAACTGTAC 
GAACATAATT 
GTCCTCGAGT 
CTGAAGGGGT 
TAG C CTG ATT 
AATCTTTATA 
TCTGGCGCGC 
TTGATGACCT 
AAGCATTCCG 
CACGCGCGGA 
CGTGAACGGA 
TACGGCTCCC 
GATGTGTCTT 
GGGAAACGAC 
CAGGCCCGGG 
ATAACGAGAT 
AACCCATGTA 
CATAAAGAAA 
ACATTTACTT 
AAGCACCGAC 
TGCTGACATC 
CATCCCATGT 
GCGGGACATC 
ACGCCTGGAG 
CCGTCATGTT 
TCAAACCTCT 
GAATCAACTG 
CGGGACGTTT 
GAACTCCGAC 
GCGCAGAAGA 
CCTTAGCCCT 
AACAGCTATG 
CAGGAAGCAA 
GTGTCAGCTC 
ATTTCTGGG C 
TCTTGTTATA 
TGGTCTTGTC 
TGAACCAAGC 



GGGCGGGCAG 
TAATCCCTTC 
TTACCCGGAA 
GAAATCCAGC 
CATCTAGCCC 
CTGCCCCCTT 
TGAATAGGTT 
ATCTCGATTC 
CCCCTGATGA 
CTCTATTTCT 
GCTCACCTGC 
CACTGGCCCA 
GTGAGTGGTG 
CCAATATGGC 
CGCGAGGTGA 
AC AG C AAATG 
GGAGCCCTCG 
AAGGCCGCGG 
CGATCAGTGT 
GGTTCCACGG 
CCCCTGCTGG 
CTCTGCACCA 
CCATCCAAAA 
CCCTGCCCGG 
TGCATGTGAC 
AGAAGGCCCG 
TCCCGATACC 
CGCCCGAGCA 
CACACCGGAC 
AATTGATCGG 
AGCAGGCGCC 
AACTCACAGT 
GGTGGTGTAG 
TAC AG AC C AC 
GGGCCCCTTC 
AAGTTTCAGT 
GTGGCACACG 
CCTCCCATAC 
ACCCGCTCCA 
ATCTGACACA 
ATTTTCGGAC 
TCCTGGAGTT 
CGGCTGCGAA 
GCGTTCTTCC 
CTCGCATGTT 
GGACGACGGC 
GGGATCAGGA 
GTTCTCTCCA 
GTCCTGGAGT 
CGGCCGCCGT 
CAACGATGTG 
TTGAAAGAGT 
CCTCAAATAC 
ATGTGAACCG 
ATCTTCATCG 
CTCGTCATCC 
TTTTAAATCC 



CGGCATTCCC 
AGAGTTCTCA 
GGCGGTCCAG 
ATTCTCCCTC 
AAC CTGCAG A 
CTTCCAAGAC 
TTCTCGTTGA 
GCCATGACAG 
TATTTGCAGG 
GCCACTTCCT 
AACGACACGG 
GAGTTAAAAA 
GACATTAGAA 
ACGTGACCAT 
CATTTTCCAG 
TTTATTCAAA 
AAGAGTTTCG 
GGGGACCCGT 
CAAATGCCCA 
TATTTCCCTG 
TGCTGCAGAA 
CAGACGACTC 
CACACACGTG 
TGGCTGTCAA 
CCCATGGATC 
AAG AATACG C 
GTCTCTCTTG 
TGGCAAACCG 
AAGACATCTC 
TAGGGAATTC 
CCCTGGCTTG 
AGGCATCATA 
CTGCCAACAC 
TATCGG CTTT 
TCGATTCCCC 
TCAATTTAAA 
TGTGCGCTTT 
GGTGCGAGGG 
GACGGTCCCT 
TCATCATTGC 
TGAGTAGACG 
GGCGATCTCG 
GGCCGCAGCT 
AGGTTTTTCA 
TTCTGCAGGG 
AAGAAACACG 
AGGCCACATA 
AAGTAAAAAA 
CGTTTGGATT 
GTCCTCTCGA 
GTCGCGATGA 
CTGTATCTTT 
CTGTGAGATC 
GTTTGACACA 
ACGCGACTGT 
CATTCCACTC 
ATATCGCGTT 



GGCTCTAACC 
TCAAGGATAC 
CCGCCACCGT 
AAACCCCTCA 
CCTCTCTCCA 
AGCTTAAAGC 
GATGGTAAAG 
CGTACACCTA 
GCTCTATCTC 
CATTTATAAA 
TGAATGATCC 
GTGAAGCAAA 
AGTGTTTGGC 
TCCCCTATCA 
GTTAGAGAAA 
GTGCCTATAG 
CTCCTCGCCC 
GTCGCGCGGG 
CACGTGCATC 
AAACCACACG 
AATTATTCCA 
AGTCGTCTGT 
GTGTCCCGCC 
CAGCTCAGCC 
GCCGGCAGAT 
TGCATCGTCG 
ATCACTTACG 
CATATTGCGC 
TTCGGATGAA 
CTTATCTTCC 
ACACAGCCTG 
GTGTTGACCG 
CAGCCTCGTC 
GAACGTCAGC 
AGCGGCCATG 
ATGACACAAC 
GGCCGTCCTG 
TAGATACAGA 
CGTCCCCGCT 
ACACGTAGCG 
GGCACATGAA 
CCAGCTCCCT 
CATAGTAATT 
CACAGATTCC 
CCTGCAAATC 
TCAATGGATT 
TACGCACACC 
AGGATACCTC 
TCCGTTTATA 
AAACATCCTC 
G GTT AC AC AG 
CCCCAGAGAC 
TCACTGCATA 
CGCCACGCGC 
CGGTGCACCG 
CGGGGTATCG 
CCGCCATCTC 



CGATGCCACA 
ACGCATAGCA 
GCGTCCGCAC 
AAAATCGTCT 
CGCAGTAGGC 
TTTTTTTCCC 
GGTCCACGAT 
TACAATAAGG 
CTCAAAACGT 
AC C ATATTGT 
CG AGTTTCG C 
AGGCATAGCT 
CCACACATTC 
CAACACTAGT 
TTTATTAAGC 
ATTTCTGTTT 
AGGGAACAAC 
GAGGGGGACC 
CCGGTGTAGG 
TTAGGACCCC 
TCCACGAGCC 
GGTATCGGAG 
TTATTTGTTG 
TGCAATTGCC 
GGCGCAGCAC 
TCCTGCGCGT 
GACGGTCCCG 
AACTCAAGGC 
CGCAACCTAA 
TCAAAGTACT 
CTCCTTCCAG 
TCGCGTTGAT 
TTTTGCTCGT 
CATTCCTTGA 
ATTCAGTCGG 
TCCGCCCATG 
TCAGATGAAC 
TGTGCTTCCC 
GTCATCTCCC 
AAGTTTCAGG 
AATCCACGCA 
CAAATAATGC 
TTCAGAATAG 
AGACTCCGTC 
TGCCCTGGCA 
ATCGGCACAA 
CGAGCCGGCT 
AACAGCCCCT 
AACGGAGACT 
TCGAACGCAA 
ATCGCAGCCC 
GCCCCCGGCC 
TAACATCCTA 
TGAGCAAAAA 
GTTCCACGGG 
CTTAGACTCC 
GCGTATTCAA 
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87 541 TACCTCTGAC 
87601 CAAAAAGGAT 
87 661 CAAATGAAAA 
8 7 721 CAAACCAGAA 
87781 TTTCTCACAT 
87 841 AACTTTATTG 
87 901 GCCGCATCCC 
8 7961 GTGGGACACG 
8 8 021 GCTGGAAATC 
8 8081 GAAAACACGC 
8 8141 TT AC C C AG C A 
8 8201 GCGTAAATCA 
8 8261 TGAGGAAACA 
8 8321 CAAACGTGTC 
8 8381 CCGACCTCCA 
8 8441 CACTGCTCCC 
8 8501 AATCTGGCGG 
8 8561 TT AC AG AG C G 
8 8621 GGCGCCGGCG 
8 8681 ATAGCGCTGG 
8 8741 CGTATGCCTT 
8 8801 CCCGCGCCCT 
8 8861 TGCATTCCAG 
8 8 921 GGCCGGTTTA 
8 8981 TAAACTGGCT 
8 9041 GCACACGCCG 
8 9101 CTACTAGCGG 
8 9161 CAATTCATCC 
8 9221 GCGACCCCCA 
89281 CCGGGCGAGT 
89341 CACAGCCCGT 
8 9401 GGTTCTGTTA 
8 9461 CCCGTTTATA 
89521 AGGTTTTGGA 
8 9581 GTCTTGGCGC 
8 9641 CGACCCAAAA 
8 9701 CGGTGGTGCA 
8 9761 AGCCTCCGCC 
8 9821 CGCCTCTAAG 
8 9881 CGGATCCACG 
8 9941 CCGGGCATCG 
90001 GCACTCCGAC 
90061 CACAGGTTGC 
90121 GTCGCACCAA 
90181 TCGCAGGTGA 
90241 ACTCAAAAGC 
90301 TGCACGTGTA 
90361 CGTTTCGGGT 
904 21 TCGCCACTCC 
90481 ACTTTATTTA 
90541 AAGACTATTC 
90 601 CATCCAATCG 
90661 AAATTATGCG 
90721 ACATAAACCA 
907 81 ACGTGGAAGA 
90 841 GACAGCGTAC 
90 901 CACACCTCAA 



CGAGCGGCTC 
CTAGATCTGA 
AAATGTTCTC 
CCTAAAGGCC 
GCCGTGAGCA 
CAGGGACAGG 
CGCACGCCCG 
GCTCGCCTTC 
GCGTTCCGTG 
G C AG AACTCG 
GGAAGGCTGG 
TCAACCCGCG 
TATCCTGTGG 
CTTGCAGTGG 
TGTTCCCAAC 
CGTCGCCTCC 
TTACCTCGTT 
GGCAAGATCG 
CCGACAGGTT 
CCAACCAGTT 
TAAATTGACA 
TTAAATGATT 
GATGCTTATC 
CGCGGATCGA 
TAAAAATCTA 
TAGGGCTGGC 
AAATCTGCGG 
TGACAGATCA 
AGGTAAAGAG 
CTGCGCAGTA 
TGTGCGTGGT 
TCCACATACA 
ATCTCACACA 
AGCCACATCT 
TTATGCAATC 
TAAAAAACTC 
ACCGTCGCCA 
GGAGCGGGAG 
TAATTTAGGA 
GCGGGTCTCA 
TCCTGTTCAA 
ACGTTGTGCT 
ATGCCACTCA 
CACAAGTGTT 
TTGTGAGTGA 
CCTCACCACT 
CGTATATGCC 
CGCTCACCTA 
CCCACCGCGA 
AGGAGAAAAA 
TTGGCGAGTT 
CCCCAAACAT 
TTCCTGGCCC 
GGGCTAAAAC 
CAAAAAAATA 
CCGCGGCCGC 
GCGCTTGCTT 



AGTTAAAATA 

AAGAAG CCTA 

AGAAAACGAA 

GTCACTGACC 

AACGAAAACG 

GCAAAAGCAA 

CCACAGACAC 

CGGAGTGGGA 

TGGCGGGGTG 

CGCGTTGTTA 

AACGTG AC CC 

TATGATCTCC 

GGTATGCGGC 

TCGCTTTGGC 

CAACCGCCCA 

CACGGCGCCT 

GCGTACGTCC 

TACAAAGATC 

CTCCTCAGTG 

CTTACATTCC 

ATAGTCCAGA 

CCAAGGTAAA . 

ATTATCGCAA 

CCCTGCCCGT 

TCCCCCCCCG 

AGGGACGGCA 

TCTACAAAAT 

C CAAAGTG AC 

TGACTGCGTT 

GACCCATAAC 

TCCCCTCTGC 

TCCCCAGCTC 

CGGTTTCTCT 

TATACG CTGT 

GTACCCCTCT 

GTATACGTAG 

ACCTGGCGGC 

GAGGCAGGGG 

CGGTCTCCGA 

AGTAAAAAAA 

ACCCAGCGTT 

GGCGTCCACA 

TACCTCGGTG 

CGGGAAACGT 

TTTCTATCTC 

CCCCTAAACT 

CCGTGAAAAC 

AACTAACTCG 

CAGAGAAACG 

AACCACCGCC 

CGCGGCACGC 

AGTCAATAAA 

CAAGACGTGC 

GTTTGACCGA 

GTGTACAAGT 

CCCTCCAATC 

TATAGAGTAT 



CCGAGCCCCC 
ACTATATACG 
GAGAGGAAAA 
CATATCTTTC 
CTACACATAA 
GCTGTGCACG 
TCGTGCCAAA 
ACGCCTCCTA 
GCTGACACCG 
ATTATATCCA 
ATATAC CGAA 
TTCCCCAGCC 
AACAGGGCCT 
GCCGGCAACA 
TAGTAATAAA 
TCGTACTCCA 
AGGAGCATGC 
GGCAACAGGC 
GATACATCTT 
CTGACGCTCA 
TACATGTTTC 
CGAACCAGAG 
CAATCGAGCA 
CCTTCCGCCA 
TGCACCTGTA 
CCCAGCAGCT 
CCACCGAGCC 
AGGGACCTGG 
GCTGGGAACA 
ATAGAAGATT 
GTCGTTCCCT 
ATTGCCATGC 
CTGCTCCGGC 
C C C ATAG AAA 
ACGGTCCTGG 
CCGCCAAAAT 
CCTGGCATAT 
CGTCCCTGGC 
ATGACTATCT 
TCTTTCTCCG 
TTCCCGAAGC 
TTCCAGATCC 
CCATGATATC 
GTTCGCTTCC 
GGTCATTGTG 
GTAGCTGAAT 
CTCGCACTTC 
AGCATTACAC 
AAACGTTCAA 
ATCAGAAGGT 
TGTAGTTTTT 
AAAACCGTAA 
GTGAGCGTCA 
AACACGAGTC 
CCGAGGAGCA 
AAAACAGTCA 
GTGATGTACC 



AGAACCCCAG 
CCCACGGGAA 
CTGACGACAC 
CCCTTCCCGA 
TAAGACACGT 
GTAACAGTAT 
TTTGAATGGA 
GGTGGAGAGA 
CACTCAGATA 
CCGGCTCCAG 
TGTAAATCCC 
ACTTTAGCAT 
GTTCCGGCCG 
GGCGAACGCC 
CAACCAGCCG 
TCCCGGCACC 
CGCCGGTGGC 
GAATAACGCG 
CGACGGTCTG 
ATCGTCTGTT 
TCTCATCGTC 
TCTTTTCCTC 
ACCATGCCCT 
TCTCTGAGGG 
CTTTACAGTC 
ATTTTCATCT 
GCCCGAATGT 
GGTCTTTTAC 
GTGTCGCCGG 
TGAGATTTGA 
GCGCAGCGAA 
AAAAAGACGC 
GTTAGCGGCC 
CACTCGTGAC 
GACGTGTGTT 
GGAGTGATCT 
ACGGACCGCG 
TCAAGCCCAT 
AAGATGCAGG 
CCAAAACGCG 
ACCCTTAGCA 
CTATCCAGGC 
CTAAAGCTCC 
AAATTGGAAA 
AACTTGGTCC 
TTTATGT CCA 
CTCATAAATT 
GGTTCATAGT 
TTTAAAGGGC 
TTCCGCAGGC 
TTCTGATTAT 
ACACACACAC 
GCAGCACAGC 
CGAGGCAACA 
GAGGCGTCGT 
CGAGATAAAA 
GCCTAGAGGT 



GCCTCCGTAA 
AACT CTGT AA 
GCAATCTAGT 
GCGAGTGCGG 
GTGAGAGGAA 
GTGTCACTGG 
CAGGGGGATT 
CGCATAGGCG 
TTGGGCGAGC 
CGAACCTCCG 
AGACCCGTCT 
GCTCGTCTGG 
AATGCCCGCA 
GTTGGGCGAC 
TAG CATCCAA 
GAGATACCGG 
ATCCGCTTCG 
GCACCTGTTA 
ACTGTGGCGT 
ACCGGTCTGC 
AGAGACGCCA 
ATCCAGCCAA 
CAGGCCAGAC 
TCCGCTCGCA 
ACACCTTCGG 
CTAGCGCCGC 
CACCATGAGT 
TGGAGCCAGG 
GAACGGGTGA 
CAGAGGATCG 
CCGTGTGCCT 
CCTCCTCACA 
CATCCAACTG 
CCGGTTTGGG 
CCGCTACCAG 
GGTCTGGATG 
CCGCACCGTC 
GGACGCAGGC 
CATAACACAG 
TTGTTGTTGC 
GTCTTTTCCG 
AGTACGCAAC 
TCTTTTCCTC 
TAATCCATCT 
ACGAAAACTA 
CCTTATAGGG 
ATCCCCAAAG 
GTCCCGGGTT 
ACCAAGCTCA 
ACACACCCCC 
TTGTAATTCG 
TCTGGTGAAC 
GGCTGCTAAA 
AAAAAGCCCA 
CAGGCCAACA 
GTCCCATCCA 
GAGCGAAAAA 
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90961 
91021 
91081 
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91321 
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ATTGCATGGC 
GAACAAACGC 
AGGTTCTTAA 
TATC TGTGGG 
AGCAGGCACA 
AGGGTGGATG 
ACCTGCCGCG 
AAGCCAACCG 
TACAGCGAGT 
AAAGATAGCC 
CAACGGGTTG 
GCTGTGTTTT 
GGCGCCCCCG 
GTCCAGGTCA 
TTCCGAGAGA 
GACGCCCAAC 
CTCATTGGCG 
TGCGGCCACC 
GTCCAGCTTT 
AATAATTTTA 
GACGGTGACC 
CAGCGTCTTA 
GAACGAGGGT 
CCCGTAGGCG 
GTGGTGGTAG 
TGTGAACGTG 
AGAATATACC 
CCCCACGCTG 
GCCTTTCTTA 
GACGTCCACC 
CTAAAAGACG 
AATAGAGCCT 
TGAACGGATG 
GTATACGACA 
GACTTTAGAA 
ATGGTCACGT 
AAGAGGCCGT 
AGCAAGCTGG 
CACACCCCGC 
AAGTATATCC 
CGCACTCGGG 
TTGGCATAAA 
TACACTTTGC 
TGATTGCTGA 
CCCAAAAATG 
TCATCGGAGT 
TGGGTGAGTG 
ATGGCGGCGG 
GTACGCAGTT 
ACTGCAGGGC 
ACCCTAGCCT 
CCTCCTTTAA 
CCCGCTCCAA 
CCTTCAGGGG 
CACCAGGCTT 
TGACCTTCGA 
GAGAGGTTCC 



GTCGCTGCAG 
CACCTACTAT 
GAACTAAAAA 
CAATCGTGGG 
GGCACGACCC 
C CAT AG AAC C 
AG AG C CAGGC 
CGCCCCAGGA 
GATAAATAAA 
AGGCGAGGGA 
AACGTCGCGC 
CGCTTGCCGT 
TCTGTCGGCT 
GGCGTTCTCG 
TTGCCCGGGA 
CTGACGTCAG 
TGCCAGCGGA 
AGAGACTCTA 
CTTCCTCTAG 
CTAACCCCGT 
ACCTGCTTGG 
ACCGTAAGGA 
AGGTTGGGTT 
ATGGTCGTCC 
AACGCCACGT 
TTCCCAAGGG 
ATCAGCGGCG 
GAGAGCACGC 
ACGATGCCCT 
CTGGCGCCGT 
AGAAGAGAAT 
CTGGCCCGGA 
TGACATATTC 
AGGGACACGC 
TCCTGTCGGC 
CTCGGCTCTT 
GGATGTATGC 
CGGCCCTGGT 
GCATGATTCC 
CTTCGATAAC 
AATGCAGCCA 
TCATTAAGTC 
CGTGGATGTT 
ACTCTTTAAC 
TGAAGAGAAA 
CCTCGTGCAG 
CTAAAAAGCC 
TTACTATTGA 
AGACTCCGCC 
ATTAAGTTTT 
GTATCCGGTC 
AAAGAAATTA 
CTGCTTAATT 
TTCTGGAAAT 
CACTCTCTTA 
TGCGATATTG 
GGCCGTTGGC 



GCCCGCCTCA 
AAAGGCCACC 
ATAATACACC 
GCACAAATAT 
CACAAAAATG 
GGCGGTAATA 
CGGTAGCCCA 
GAATATAATG 
ACACAGAATA 
CGCCTCCGAC 
GAGGCGTCTT 
GTTTTTTTTC 
CGGCACCGTC 
GAGGGGACGG 
CGGCATTATT 
CAAGGCTATC 
AACCAGGCAA 
AACTCACATG 
CGCAAGTCCA 
CGACCGTAAA 
CCCTGGTTTT 
ACAGGCGATT 
CGTCCACGAC 
TAACAAACTG 
TCATTTCCGC 
AGGTGTTTCG 
CCAGTGCATT 
TAAGGACGGG 
TAATGTGGTC 
AATGAAACGA 
GGCAAACACG 
AAATGAGTAA 
AGAATTATCA 
GTTTGGGGGG 
CGTGGCCTGC 
GGAACGAATA 
GACGGAGGGG 
GTGTAGCAGC 
GCGAAGGCGA 
CAGGAACAGC 
GTCAATCTTC 
CCCCCGATTA 
TTCCATGGCT 
TAGGTCTTCG 
CTGGTAAAAC 
GGAAATTTGA 
AGGGTGGTCG 
CAGGC CAAAC 
CCCTCTGAAG 
GTCTGTTTCT 
AGGAGCAAGT 
AGCGACTGAC 
GGGCAGTAAT 
GAAGCCACGT 
AAAAAAGTCA 
GCAAAGAACG 
ATCAGCGCCA 



TAAAGCGGAT 
AGGGGCAGCA 
GAGAACGCTG 
ACCACGTCAT 
GAAACGTGAA 
AGAAACGCCG 
AATATACAGA 
TTCTCAACGC 
AACGGGGAAG 
GTGTACGTGC 
GGTTTTTCCT 
CGGTCGAGAG 
CTCGGCAATT 
GAGCTCAGGA 
CCGGTCCGTC 
TGTCAGCGGC 
AAAAAATCCC 
AGTCGTCAGG 
CTTGGCGTCC 
CGTGGCAACT 
CTGCTGCCTT 
CCCCGTCCCG 
GCTCTTGTGC 
CGGCTTTCCT 
AATGTTAACT 
AAAG CTAAAG 
TTTAAGCCTG 
GGCCCTGCCC 
GTAAACCCTG 
GACAGGCATG 
ACAGACACAG 
CATGGGACAC 
CGCTCAAAAA 
CGGGCACCGA 
AAAAACTGCT 
AACAGAGTCT 
GATTCGTCTC 
TCGTCCCTGG 
AAAAGTCCTA 
AGCACCTTCT 
TTCGCCAGGG 
CCGCCAAAAA 
ACCTGTTCGG 
ATGTTAAAGT 
TCCCTATCGG 
CTGGGGAACC 
CTCGCGTACA 
ATACCTTGGT 
AGATGTTGTC 
GAATTCGACA 
CTCGAATATA 
TCTGGTCTAC 
CAAAGGCCGT 
CACCTCCGTA 
CGTTTGGCTT 
GATAAAATGC 
AAAACTGGCT 



GTGACCGCAG 
AAAACACCGG 
CTAAAAATCC 
TCGCCGCAAC 
CCCATGGAAG 
AGATGGTGCA 
AT AG C AAAAG 
TAATCAGGGG 
CCGAGACCGT 
CCATGGCGCG 
TTCCCGCGAC 
CCGCTCGTCC 
GGGACGCACT 
ACCCCCTCGG 
GGATCTACCT 
AGGCCCTCAA 
GGAATCTTAC 
TCGCTAGAAA 
GGGGTGTCGG 
GCCAGCGTTT 
AGCCACTTTA 
GCCTGCTCTA 
ACGGTGCTCG 
TCCGCGTCCG 
CCAAACATCT 
CTAACCTCGT 
AGACCGAGAA 
TGG CCGCTGA 
CTAATGCTGC 
GCTAAGGCGA 
CGTTCAATGT 
ATGGGGACTA 
AGTTCACACT 
ACAGGGGCGG 
CGATGTCTCC 
CTATCTCCAC 
GGGAGATCAT 
AGATGTAGTC 
TAGAGTAAAA 
CGGCCCGCGT 
TAGCGTCCTC 
ATAAGTTCAG 
CGTAATAGTG 
TTAC C AG ACT 
TGGGACTCAA 
ATCTGTTCTG 
GAAACCGATC 
CCGCTCACCG 
ACCCACGACC 
ATAATACAGC 
ACTCGCCCGC 
AAATGGAGCC 
CAAAAAAATC 
ACGGCGCACC 
CAAGATCTCT 
TTCGGTGTAC 
GTTGAAAATT 



CGCTCTGAGT 
GGCTAAAAAA 
AAGTTCGTAG 
AAC AC AG AG C 
GTCAACGCCG 
CGGTACATCC 
CAGCCCATGT 
CTCTACAAAA 
CCCGCTCATA 
GCGTTTAAAA 
TGGGTGAGCT 
GCAGTGATTT 
GCTCTTTTAA 
GATCGGCACC 
TTAATAGCAT 
CTTCCAGAAT 
ATGCGTTCAA 
CCACACCCAC 
GTTTAAAATC 
CGCTTAACGA 
ACAACAAAGT 
GCCTGTCGTG 
TGTGATTGTC 
ACTCGTCCCC 
CACGGGTGTG 
CCGTCATGTC 
CGCCCGCGTC 
TCTGTATAAC 
CAAGCGCATC 
AATTAACACC 
TCAACCTTTT 
CAGATCATTA 
CTTGGCACTC 
TATGTTTATT 
GACGTTCACC 
GGCTTCGCGA 
GGTGTTGTAC 
GTTAGCCAGG 
TGAACTTATA 
GGTGGCCTTG 
GACTATTTTT 
TATGTTGGCA 
GGTCACGTCG 
CTCTGCCATA 
TAGTCGCACC 
CCATGTCTCC 
TGGGGACTCC 
TCGGCCTTTT 
TGATCTAAGC 
ATGGTTTTTA 
GTGGCCTGCT 
CGCGCACCGG 
TTGTATCTGT 
GTTCGCAGGT 
TCTTTGCTGG 
CCCGTAAGCT 
CCATGCTGGG 
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94381 
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96901 
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97021 
97081 
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97321 
97381 
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97501 
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97621 
97681 
97741 



CAATGCTGCG 
CCCACGTTTG 
GAGTCCCCTC 
CCTGAAAAAT 
CGGCAAACAC 
GCCTAACGCT 
CGGCCTCCAC 
CCACACCAGA 
TACACACGGA 
TTATGGCGCT 
CGTAGGGACT 
GCAACGAGCT 
CACCGTAGAG 
CTGGGTTTGT 
TGTGACACCG 
GCCATAGTTC 
CAACGTGGGC 
AGACATCTAT 
CTTTGAGGGT 
GACCTCCGCG 
AGATGTAATG 
CCAGGTGTCT 
CAGCCACCCT 
TCAGCACGTA 
TGTATGCCTT 
ACATAAACGC 
CCGTAACCAG 
CCTCTATGTC 
CAGGGTCAAG 
TGGCAGATTT 
AGACCCCGGA 
GTGAAGGCTT 
GCCGAACGCA 
CACGCCGCTG 
AAAAAACCCA 
AATAGACGAG 
ATCCACAAAC 
GACTACGTTG 
AACCAACGGC 
AAAATTGGGT 
GTTGGTCTGG 
GAATAACATG 
CCGAGGTTCC 
TGTAGGCGAG 
CACATATTTA 
AACCCGGTGA 
TGCGTCCCTG 
GAACACCGGC 
ACAGCTTAAA 
CATGCCCACC 
ATTTATTCAA 
CATTACAGAC 
TCTTAATACA 
CGACAACCGT 
GCCCTCGGGG 
AATTTTACAC 
CGACATCGGC 



CCCCAGCTGC 
CCAGTGAAAC 
GCCGAGTAAA 
CTCCACGTCC 
GTCGGCCAGT 
CTCGGTAGGC 
GGCATCATCG 
CTCTCCCCTG 
AGCCTGACCC 
GCCTTGCGTC 
GCCGGTCTTA 
AGAGTGTTTG 
TTTTAGGAGA 
TTCGTACAGT 
CTCCTGATTC 
CATGTAGGCG 
GTTTATTAAT 
CCCCACGCCT 
TTTGTCTTCG 
AAGCCCGGCT 
GAACG CGTAA 
GAGGGTCTCC 
CATAAACATC 
CGTATCGTAA 
GCTCTGCTTA 
GCGAACCTCG 
GCACCTGTCC 
CCATCCAAGT 
GCTTGAGCTA 
TGCCGCAGAG 
TTGGAAACAA 
TGAAAAGATG 
AG CCTATTTA 
TAAAAATC CC 
AGTCCTAGCG 
TGGGCGGGCA 
ATGTCACTAG 
CCCTTTAACG 
CCGACCGGCG 
ATGGCGTCAA 
GGGGCAGAGT 
TCAATGGAGG 
TGTGAACTCA 
ACAGATAAAA 
GAGAGCAGAT 
GGAGGCGGCG 
GTCTTCATGG 
GGGCTTTACA 
CTCCTGATGG 
GTGACCCGGT 
AAACACCCTG 
CACCAAAGCG 
GCTATTCGAA 
GCCATATTGG 
ATCCCACCGA 
GAAACCCACC 
GAAGTCGCGT 



TCCCATCTCT 
ACACCCTGAG 
ACAATCTCAT 
AGCTTGGCGC 
CCTTGGACGC 
GCCGTCCCGC 
AGCAGTTCAA 
TGGGGTCTGA 
TCGGGTTGCT 
TCACACCAGT 
ATGACAGTGT 
CCAGCGGCCA 
TTTGGCGCCA 
CGGAAAAAAA 
TCCGGCATCT 
CTGG CACCAA 
TTTAAACAGC 
GACTTGCTAG 
GTCGCCATGG 
GACCTTAATA 
CCAACCAGGT 
CGCAAACACG 
TGGGCCACGC 
AACCTAACGG 
AAGTC CTGTA 
TCGCTTAGCC 
TCCATTATTC 
GTTGCCTTTA 
CACCCCCCAG 
AAAGAAGTTT 
ACTCCGTAAA 
GTTGTAAACC 
TATACAGGTC 
TCAATTGGGC 
CCGGTCTATT 
AAGGCTCGTC 
GAAGCGTAAG 
CAGACGTCAC 
CAAGAAACTG 
GTAGACCGCT 
CCGCGTCCAT 
ACCCAGTGTC 
TGGTCTGAGA 
ATCCATCTTT 
CACGAACACC 
GCAGTAGACG 
CGTCGTCAAT 
AATTAACTGA 
AGTTCCAACT 
TTTTAAATAC 
TGTTTTTTTT 
C AG AAACGT C 
AACCACACGA 
CCATGATTAC 
CACCACCAAT 
GGCAATACTG 
CTCCGCTTCA 



CCAAGGGAGG 
CCAATCGACT 
GACTGGTTTG 
TCTCGGCGTC 
CAATGCCCAT 
CCAAAATGCA 
AACCAAATGT 
GGCACTTTGG 
GCACGATTTC 
GGTGTTTGTT 
TAATAAGGGC 
CTAGTCTGTT 
CCTCTGGCGC 
GCTCCGGCAC 
TCGCGTTCAA 
CAGGACGGAT 
TGGTGATGTT 
CGAGAAGGGG 
ACGGCTTTAA 
CAGGCGTTGC 
ACATCTCATC 
GAAATTTTAT 
TTTCAAAAAT 
CAGACAGGTA 
AAAGACCGTC 
TTTCCCCGAA 
TGTGCCAAAG 
AAGAATTCAC 
TTCCCGCGTC 
CGGTGTTCAT 
TTTTAACACC 
GGAAGGCACG 
ATTCTGCAGC 
AGCTATTTCA 
ATTTACCCTA 
GCGCAGGCTG 
GCCAATATGT 
CTTGGTGACA 
ATGAAGCGAG 
CTCGTGGCCG 
CCTAGACAGT 
AGTCTGTTTA 
ATAAATCAAA 
GATGGCCTCC 
CTTAGAAAAG 
CATCAAACTA 
TCCTGCCGCA 
CAACCTCCTA 
AAAACCACTA 
TGCATTCAAA 
AATGAGAGTC 
AACAGGCATA 
GTCCCCAGCG 
AGAATACATC 
GGGTATCAGC 
GAACCTAACC 
GACGTGGCTA 



GTGGGACGGC 
CCGCTCCCAT 
CACCGCCGTA 
AAGGTAGCCA 
TGATCGTTCC 
CGCATTGACA 
TACATCCCCC 
TAGGCTAATG 
CGCACAGAGA 
GAGCGCCTCC 
ATACATCATC 
GTATTCAATC 
AAACAGGTGC 
ACACACGCCC 
AAAATCACAA 
ATTGTTATCG 
CTTGTGTTCC 
GGACATTTCC 
AATAAAACAG 
GCAGCACACA 
CAGCTCCGTT 
ACACTGACAG 
GGGTGAAGAA 
GCCGCAGTTA 
AAGGGCCGCT 
TACCGCGAGA 
GAGACCGGTT 
GAGTGATTCG 
GGTAACTAGG 
TGTTAAAGAA 
GGTAAAG CAG 
CTTCCAAGTC 
TGAATGTATT 
CAATATCCTT 
ATGTAGGTTA 
GACGGGCATG 
GTAACAACGG 
AACGTACTGT 
CCGGTTTCCA 
TACCAAACAC 
CGCGCCAGCG 
AAGGAAAACA 
AAATCTCCAT 
ACCCCAGTGG 
TCGCGACCGC 
TCATTCAGCT 
CGGGCTGATA 
ACGTGCACCG 
CCAACCGCAC 
ATAGACAACC 
GCCAGGCTTC 
CTATCCGAGG 
GCTAAAGACA 
CATCACGTAA 
CATCTACCGT 
CTCCCGGAGT 
ATCCTATCGT 



TTAACGCCGT 
CCACGAAACG 
AAATACATTG 
AAGCCCAGTT 
TCCTGGCCGC 
ATGATAACTG 
TTCCCAGGCT 
TTTGCCAAGT 
TTAGAACAGT 
TTTAGCAGGA 
GATTTTAACG 
TCAAATTCGG 
CACTGGCCAT 
TGAAACAGGT 
ATCTGATGAT 
TTGAAATACC 
GCAAAAGACG 
TCATGAAGTG 
CTAGAAAGCT 
ATCTGTGACG 
TCGCTCTCCA 
GCCACAAAAA 
CCCTGGGTTC 
ACGAAATTTG 
TCGTGTTTTG 
TAGTCTCGCA 
AGTGAGTTGG 
GCGCATCGTT 
GTTAAAGGTT 
ATAGCTTCCA 
CGCCTTTAAA 
TGCAAACTAC 
TGGTGCGAAT 
TACCAGACTT 
AGCAAACATA 
AGTCATCTAA 
GCTCTCTGGC 
GGACCGTTTG 
ACAATTCTTC 
ACG CTATTCT 
ACGTAGGCGT 
GGTAGGTGCC 
ACGTTTGACA 
TGGTCGACAC 
GAGATAACGA 
TGTTCACGTT 
ACGGTGACGA 
GATCGCTACA 
ACCTTTTAAG 
CCCTGGTTTC 
CGGAGCCAGT 
TTGTGAATGT 
ACGATTATCT 
CTTCACGTAC 
GCGTAGAGCA 
CG CTATTTAT 
ATTGTAAAAA 
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97 8 01 ACTGCAACTG GCACCACCGC CCCTATTCCC ACCTGTCGAC GAACTGGCCC GTCGTCTGGT 
97 8 61 TACCGGCCAC CACGAATTGT TTGTCCCCCT ATCGACGTCA CTGGAAACGT ACATCACCAT 
97921 GCCAGTATCA AAACAGCGAG CGTTTGAGAT ATACAGCGTC TTTGCCAAAT CAAAAAACAT 

97 981 AGTTGACGGC ACACCAATTT TAGCGTTCAC CGACACAGAA CTTACAACGT TCACCCCCGA 

98 041 ACTGTTATTC CTGTACGACT TCGTAATCGA GTCCCTGTGC AAAAA CCAGG CATACGGGTG 
98101 TTCGCGCAAC GCAATTGAGC ATTTTATTAA GAAAGGTATC GATTTTATGG CGGAGTTGGG 
98161 GGCGTTCATT GAAAAAACGT GTGGCTATCG ATCAACCGTA AGCCTGTCCA ACGTGAGAGC 
98221 CGTTAAGGCC AGGCTCG CG A GCTGTGGCCT ATCCAAAGAG GCGTGTGAAG ATTTCCGCGC 
98281 AATGATACTG ATGACACCGC ACGAGACGAC GCCAAAGTGG GAAAATTTTA CGGACTTTTT 
98341 AGAAATGGTA AACCAATTAA CACTATACGG GTTCTATTTT TACGAGTGTC TTAACCAGTA 

984 01 CAGCCCCACT AGCATATCGC TGGCCAAAAT TCAAAATATC TTAAACCGAG TCGATGCCGA 
98461 ACAGAGCGAT CGCGCCCTGT GGCGCACTCC ATTAATCGGG TCTTTCCCGT TCCCCTGGAA 
98521 ATTGAACAAC GTGCTGGCGT TTTTTAAGCC TAGCACCCCC GTGGCAACGC TACAAAAAAT 

985 81 CTACAAGGCA ATACCGTCGT ACCTAATGAG GTCTTTGTTC GAAATAGCGG CCAACAAATC 
98641 GTGGGGTAAC ATCGCGCTCG CAGAGAGCGC CCCGCTAACC GACATACAGA CCGCCGAACC 
9 8701 AGACCAGGGT CCCGTGTCCG CCCAGGTTAT CGCAAAATAT TGCAGCCGTC TCCAAATAAG 
9 87 61 CGCGACAGAT TATGACGCAG CCATCGTCTC CAGCCCGGGA TTCGCCGCCG AGTTCATTAA 
9 8 821 AACAAAACTG TATCCGATCC TTAGCGAGGT GCTCCGAAAC ACGTCCAAAA AAAATCGATC 
988 81 GCTATTCCAA ATTAGATGGC TCATAGTCTT CGCGGCAGAG GACGCCAGAG ACCTAGCCCC 
98 941 TATCAGACGC TCGCTGGCCC TGGCGTATTT TCAAATAATG GACATTTTGG AAGAAAAGCA 
99001 TTCTCCGGAG TCGTTTTACA ACCTGCTGGA CTATCTTCAG GAAACATTTA GGTGCATACG 
99061 ACAGGTGATA CCGGAAGCCA CGTGCCCACA AGAATTTCTA CAATACTTGT TTACTTTTCA 
99121 AAACATTCCA ATAGCAGCGA GCTTCATTCA AACATCCATG AC CTTTGTAG ACGACCTGAA 
99181 AAACGGCATC CCCGGTATAC TGGACCTTGT CTCCTTAGGT GCCGCGTTTT ATAACATGAA 
99241 ACTACTGTAC GATTCAACGC TAGACACCGT CGAGATTCCA ACGGAAGAAG GGCAACCCAT 
99301 CGTCGTGAGC ATGTTCGTAT TCAAATCCAC GATTCGCGTC CTCGAGAAGC TCTTACAGGA 
99361 AGCCGTTATC GCGTTAACTC AAACGTCCGA AC CGATGTAC GCAGCGCACA TCCGTCTGAT 
99421 GCAACACCTC ACGTACATGC AAAAGATCGC CGGACACGAA ATAATGACCA CACAACTTCC 
99481 ATCCGTGTTT CACGAAATAC ACGAGGGATA TTTGCAGTGT TTTAAGCGTT TTAAACGTCT 
99541 CATGTTACAC GTTACGGGAA GCTGCTGCTA CTCACTGACG CGATACTTTG GATTCCTATA 
99601 TCAACCCCCC CTAATACCCG ATACCATCGT ACAAAAAATT TTAAACTTTA ACGACAAAAC 
99661 GGACACAACC GACGACATCT TAAAGAGCCT GTCACAGCCC GTGAGACAAG GACCTCTATC 
99721 GGCTGAAAAC GAAAGTAGCA GTCGACTCTC AAAAAACAAC GTTGAGCTGC TTCAAAAACT 
99781 GTACGACGAC TTTCGGACCG CCTCCACAAA CAATAACCCC ACCTCTATTA AACTTGAATA 
99841 TTCGGGTAAT TATAACGAAA CACAAGTGTC CGTAGATTGG AGCACATATA ACCTGGTGAC 
99901 ATACACCGCA CCCGACGATA CGTTAAAATT CACCCCGGTT AACACGGAGG CACTAGATCG 
99961 CATGTTTGCA GAATAAACGA TGGAATTGCC GCCAATTTTT TCGAAATTTA AAATAGAGGG 

100021 CGTGGCAACG ACGCACCAGG CCGACTGCAG GTTCGGACAA TACGCCGGCT CGCAGTGCTT 
1000 81 AAGCAACTGC GTTATCTACC TCGCACAAAG TTATTTCAAT AGAGAGTCCC CCGTGACGGA 
100141 CACTAACGAC CTCGACGACG TTTTACGGCA AGGGGCGACG TTGGACTTTA TACTGAGACG 
100201 GTCCGGGACG CTCGGCTATA ACCAATATGC CCAACTGCAC CACATACCCA GCTTTATCAA 
1002 61 G AC C AACG AG TGGACCGCGG CCATCTTTCA ATCTCAGGAG TACTTTGGAC TAATTGGACT 
10 0321 TGACG CGGCC ATCCGCGAAC CTTTCATCGA ATCCCTAAAA TCGATCCTAA CGCGAAATTA 
10 0381 CGCCGGCACC GTTCAATACT TTCTGTTTAT ATGCGGCGAC AAAGCCGGGG CCGTAATTAT 
100441 CAAAAACAAA ACGTTTTACC TGTTCGATCC CCATTGCGTA CCCCACGTAC CAAATAGTCC 
100501 GGCACACGTA ATCAG TTCGT CAGACCCCAC CGCCATACTA GAGTACGTGT CACCGCCAGA 
100561 CAGAGAATAC ACTGGCAGTT TTCTATACAT TATGCCCAGC GAATATGTCA ATCCAGAGCA 
10 0621 CTACATCACC AAC CACTATA GAACTATAAC GTTCG CCAAA GTACACGGCC CGCACATAGA 
10 0681 TATATCCACC GGCATAGAGC CGTGCACCAT CGAAGACATC CCAAGCCCGC CGCGATCGCC 
100741 GGATGTGACG TCAAAATCAT CCAACCTCGC ACGCGTGCCG AGAACCACCA CCGACACGTC 
10 0 801 GAGCGCCAAA CCCCCACCGG CGACGCTATC CGGTTTACGG GGCGCGGAAC CACCGACAAG 
100861 CTACCCAGAC CCGGCAACCA ACGACGCGGA CACAAAACTC TTAACTCCCG CTCCAGCGCA 
100 921 AACCGCCGTC GACCACCCCG AATTCCAAAC AACACCTGGA GCCACGCTAC TTCTTTCAGA 
10 0981 ACTGTCGGCA T CC AGGGGTC GGAAACGCAA GCTTTCC AG C CTTCAG CG AT ATTCGGATTC 
101041 CGACGAAGCG TCGTCTGACG ACGAAGGGGC CCCACGTAGG CGCGTACACG ACGATGCGAT 
101101 ATCCGCCGAG GTGATCTGGA TGGACGACGA TATATCTCCT CTATATTCAC CGTCGGCGAC 
101161 TCCATCGTTT GACGACGTGT TCGACAGCCC CCCAATGAGC CCGGAGTTTA CATACGAAGA 
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1012 21 CGCGACAGAG GACACGGATG GCGCGTTTCT AG AAC AG AT C GCTCGAGACG CGGAAACGCC 

1012 81 GTTCTCTGCG TTCGACGACC TTATAACGGA CCACGACTTT TCTTCCCTTG ATAAAAAAAT 

1013 41 AGAACAGTTA ATCAAGTACG AAGCGCCCTC GCAGCACCTG CCAAACATCT CGGACAAACA 

1014 01 AAACGGGCGA GCCGTCCGAG AAGCGGCGGC CCTCCAGGCG ATGGACAAAA TTATGATTAA 

1014 61 TATCATACTT GAACACGGTC TAATTACAGA CGCGCAGGCC CGGGGACCGT CCGCGTGCAA 
101521 AAACGTTCTT CAATTTTTCA TCCTGTGGGG AGAAAAACTC AACATACCAA TAAGCGACGC 

1015 81 CAAGCAGGTC CTAGAACTCG ATCTGCAACT GATACCGTTA CATACGGCTA TCAGCGAAGG 
101641 AAAATTCAAA CAGGGGGCGT TCAAAAAACA CCTAACAACT AAAATCAACC GGTGTCTGGC 

1017 01 GTCAATGAGG GCCACGCACG CAGATGCGCA AAAAAAACTG GCGTCGGCTT TCAACGTCGA 
101761 AGGCTCACAG ATTTCGTCCA GCGAAGCGAA AATATCAGTC CGGGGGCTGA AGGAACAGAT 
101821 CGCCAACCAC CTGAGTCCAG GCTTTTTAGC GGTCTACTCC GCGGACGAGG TAAAACATCT 

1018 81 ACGGGATAAA ATTCAGGACC TAAAAACAGG CATCGAGCAG CGCAACAAAG AAATCCAACA 
101941 GGAAGAACTG TTTTTTGATG CCATGCTCAC AGCCCTGGAC ACGTTCCAAC CCCCTCCGAA 
1020 01 AACGGCATTT CCAATGGAGA TCTTTCCGCA CCGTAAAACC GAAGTTATGC TCGACCACCT 
102 061 GGCGTCCATA ACCACCAGGT TAACCGAGGA CGCCACCGAA GCCCTCAACA ATTACCTGGA 
102121 GACCCCGCCC GACCAAGGAA CGCACATTAC CAACATTCCA AACTTTTCAT CCATCGTGGC 
102181 AAATATCATA TCCACGTTAA AAATCCTAAC GTACGCAGAA AACGACATGC AATTAAACGT 
102241 AACGCCCATG GCAACGTACA GGCGTCAGCT GTTGTACCTC GGAGGCGAGT TAGCAACCAT 
1023 01 CTTTAATTTA GAGTGGCCAT ACGAAACCGT GCCACCGGTT CAAGAACTGC CCCTCGTGGC 

1023 61 GCGGGCG AAA GCAAAAATGG AATCGGTAAC AAAAATGGAA AAGAACCAAC AGGCTCTCGA 
102421 CCAAATACTG GGAGACGCCG AAACGTTACT TGACACAATA ACCGCAACAT CCGGAGATGA 

1024 81 GAACCCGGTC CGCGCCATGT CCATACCGAT ACTGGAGACC TACATTACAA ACGCAGGCGC 
102541 CCTGATAGGC AGTTCTCGAA ACCAGCGGTT CGAAAAACTC AAGGCCGCCA TCCACGACCT 

1026 01 GGCATCATCG GAGTCGTTCA TAATAATGCT GCTAAACAAC ACGCGGCTCG ATAACATATC 
102661 AGACAATCTG GCCAAGATCG ACGGCATCCT GACCAACAAC ACACGTTTTC TTTCAAACGC 
102 721 CACTGTTAGC AAAACGCTCC AAACGCTGGG AGGCAGCCTA ATACGCGAAT GCGTAGAAGC 

1027 81 GCTAAATAAA AGGAGC CCCT CTTCCCTCAA CAACGCGCGT CTCCTCGCGG TTCAAACCAT 
102 841 ACTGGGGCAC GCGTCCGTTC CAGATCACGA GACGCTGACG CGAATCGTTT CCGGCGTCGC 
102 901 CAGCGCACAA AAGGAATCCG CTGGCGATGA TCCAGATAGG TGGACGCGAG TAACCGGTCA 

102 961 CCTAAACGAG CTGAAGCTCG TAACTACCCA ATCGCGTGTC GACAAAGCCA CCAGGCGCAA 

103 021 ACTGTTAATG ATAATAACCC GTGACCTCAA GGAGGCGGAG GTGTCTCAGG AAACGGTCCT 
103 081 GGAAACACGG TGGCAAGAAA ACGTGCTAAA GTTTCAACCG TCGACGTCCA AAGAAATCGA 
103141 AGACTTTTTA CAGTCGGCAC CGTCAGCAAA GGCCCGAAAA TTCG C AG AAA AACACCTACG 
1032 01 GACGCTGATC ACCCAATTCA ACGGCCACGA GCGACCGCCG TCCGAGGCCA CCGCCGTTCC 
103261 CATGGACTAC ACGCCGACGC CCATACCCAC GCCACAGGCC GTTTCTACGG CTACCGCGGA 
103321 AAAGGGAAAG GCCGCATGGA ATAAAATTCA ACAGGCCTTT CAGGATTTCA ACTTTCACCT 
103381 CATCGACGCT TCGGATTGGC AAGAGATGGC ATCAGAATAC TCCAGACACG GCTCGTCCCT 
1034 41 TCCTGGTACG GTTGGACCAA AGCTGGTGCG CTTCATGGAG AGCATCTCAA ACACCCTGGA 
103 501 CGACATCCTC ACGCAGAAGC TGGCATCTCT GCTTCCAAAC GGGCCCGCGT TCAGACCCCC 
103 561 AG CGTTTG AC TGGATCGCGC CCTATCAAAC ACGCGTAAAC GCGTTTCTAA AAACCATAGG 
103 621 CCTGCCCATG GTGCGCAACC TGGCGGACAA GATCCATCAC C AATG C C AAA CTGTCAGTCA 
103 6 81 CGCGGTGCAA TCCGCAGACC TTCAACAGGC CACGGTGGGA ACAAGTTTAG AACGACCCGC 
10 3 741 GGCCGAATAC TGTCGAATAC TCTCTGACAT GCAAGTCGCG TTCAACGACC ACGGAATCGC 
103 8 01 CGTAAGATCG GAGGCCGCGG CGTACACGGA CGCAATCAAC TCGCCGGCCA ACGTCGTGAC 
103 861 TCCCCCGAAA CCCAACCTAG AAGCCCCCAA GAAGCTAATA ACGGCAACTG ACGCCCTAAC 
103 921 CGTCGAGGAC TTTCCAGATT TCCTAAAAAC GTCAATCCTT C AAC AGGAGC AGCGACTCAT 

103 981 TGCGCTCCAG AGAGCGGAAT TTCAGCAACT AGAGGCCAGC ATCTCGGCGG CCGAACGGCT 

104 041 CCGCCAATCC ACCCGTGACG AGATCGCAGG CAAGATGGCA ACCGCTATCA CGCAACTCTT 
104101 ACCCCGCGCC CCCGTCGCAA TATCATCGAG ACCGTTGAAC TTATCAAAAC CTATAGACTT 
104161 TTTGAGTTCA ACGGTATACG ACAAAATCCT GGACAAGGAG C CTTACG AG A CAGCCATAGC 
104 221 GGGATTCGCG TGGCTGGAAA TCGCGACAAA ATCCGTAATG GTCTACAGTC AACAAAACGA 
104281 AACGCAACAG TTAAACGTAC TGCTGAGCGA GGTAGAAAAA CAGAGCACCG TCGCGCAGCG 
104341 TCTACACGAT TTGGAACTGT CGGCGAAAAA CACGGACGAC GTAAAGGTGC TGAAGCAGGC 
1044 01 GCTAGACGAA CTCGCGCCCC TCAGGGTAAA GGGCGGAAAA ACCACCGTAG ACGCGTGGAA 
1044 61 ACAAAAACTG GAAAGCATAG AATCCCTGCT TCGCGCCACG AGGACGGCAG GCGAAATATC 
104 521 ATCGGAGCTT GAACGCATCG GCACACAGGC GGTTGGCACC ATCACCGTCC GCGATTTAGG 
104 5 81 AACGCTCTCC GATCAATGCC GGGAAGCCGC AAATTTCCTC AGACAGG CCA GTCTACCCGA 
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104 641 AGGCTTCTCG GACATAGGCA CAAAACTCAG 
1047 01 ACAGTTTCTG GAGCATTTTG AAACAACCCA 
104 7 61 CCAAAACATA ACCGAAAACG TTCCGGCGCG 
104 821 CAATCACCTT CACGTGCGCG GCAGCGCGCC 
104 8 81 GACCGTCGAT CCGGAAAAAC CAACTCACGT 

104 941 CCAGATCACG TACTCAAACG TCCTAGAGGC 
105001 CCCGGTTCCG ACGGCCCCCG GTCTGGAAAT 

105 0 61 cdb&ACGTGG ATGGACAGAC AGTGGCCCGA 
105121 CACGTACGAA CACACCACCG CCCACGCAAA 
105181 GATGTGCGTT TTTACGCAAA TCATCAGGGG 
105241 ACCGAGCACC GCCGTCGATT TTCCAGAAGA 
10 5301 ACTGGTCACC GCCATGTGGC CCACCCTGGC 
1053 61 GGAAGCCCTA GGACTAATGA GTCGTTTCCT 
105421 AACGCTAGAG GCCCAGGTCA AAAACGGCCC 
105481 TGCCAAAACA GAGGCAATTC CGTATTTCCC 
105541 GAGCCTGTGG GGACAGACGG ACTTTTTG C A 
1056 01 GGTGGCGGCC GTCACCTGGG CGCTCACGAC 
105661 GTCCACATTT AAACCCATGA CAGCCGCGTC 
105721 CCTACACCTG ACCACCTTTG GCCCGCGCGG 
10 5781 GCACCCGCCG TACGAGTACG GACAGCCCAC 
105 841 GCCGGTCCAG GCTTCAAACA CACCGGTATC 
105 9 01 GTTTCACGTA CCGATCAGAA TATTTTTGGC 

105 961 CGGCGGCATG GGGCTCCTCA CACCCATCCT 

106 021 AAGCCTGTAT AACGCACCCC GCAAACCCGT 
1060 81 CCCGCACGAC GAGCGACAGG TCTTTCTGAG 
10 6141 ACCACACGAA GCCGCCCGGT CGTCGACTCC 
1062 01 CCTCGTAACG GCAACGTACT CCAGTGGCGG 
106261 CGTGATGCCC GGACCATACC CCCCAGACTG 
106321 GTCCGTGACG CACCTCAGCC ACGACGAGAT 
106381 ACACGGGACC GTGCAAGGCA GGGATATCTT 
106441 ACAAACCGCC AATCCTCCGG CATGGGAAAC 
10 65 01 CGCCAAAAAA C CT C AT AT AA TTCCTGCGTC 
106561 AACCACGACC CACCATTCAC AAGGGCAAGC 
10 6621 GCCCGGTCAA ATTACTTCAC ACGCGTCACG 
1066 81 ATCTTCACCG GAAAAATTCA ACACGCAAAC 
106741 AACGGCCCAT ATAAACCAGC CAGCCTCCGG 
106 8 01 GACGTATAAA CCCCGAGTGC TC AC CG AAC C 
106861 TCGCCAACCA GAGGCAACCA CCACGGTCCC 

106 921 GGTCTTTATA GGGACCGCGT CCAAACTCTC 
106981 ACCCGACGCA CATCAGTCGA AGATAGATCG 

107 041 CACCCCACAC CTCGAAGAGG GGCCACGGGA 
107101 CATAAACGTC CCCTCTAGCC AAGGTCAAAA 
107161 TCAAACAGCA ACTCCCAGCG CCCCCCAACC 
107221 GCTCCCCAGA ACGGACGGCC GGTTGCTTCA 
107281 CGAGGAGGTA AAACGATCGC CAAAAACACA 
107 341 CTCCAGGTGG ACCGCCCCGC TCGGTCCAAC 
107401 AATTCTCAAA AATATAACAT CACTCACCGT 
107461 CACTAACATC TGGATACCCC TATCCCACGT 
107521 CAAGAATGTG TTAATGCGAT TTATTCAAAA 
107581 CGCTCTATCC GAGGCTATTG CCAGAATAAA 
107 641 TTG CTTTTTT TATTTTGAGA CGAGCGTCTT 
107701 AACGAAGTGG ACCCTATGCC AGAAGACGCC 
107761 GGGCCCATCG AGCCTATCAA TGAATGGGGC 
107 821 CCGAGGGTGG AGGCCACCGG GGCAACGCTC 
107 881 CCCGAGATAC TAGAACTAGC GTCCTTTTTG 
107 941 AGAATTCCAT TCTTCTTTTG CATATACATG 
108001 AGATAATTAC GTTTCGCGAT GGCATACTGG 



CGAGCTTCAG GCGTACATTA AGTACAAAAA 
GCCTAACGTC TTTCAACGCT TCCCGCTATC 
CCCGGCGATG GACTCGGTGG CCAGACTGAC 
CCACTTTACA ACGTGGATAG AAACGCTACC 
CCCGGCGCAC GGAGGAGCCC CTCTGCACCG 
GTTGTTTTCA TTATGTTCCA CCACGCTAAC 
CGCAACCAGG GCACGCCGCG GGGCAGAGGC 
CATCGCTCAG ACGCTCCAAG ACGTTCTCGA 
CCGGGACGCG' GCATTTAACA CATTCTTGGC 
CGCTAGCAGA GCCGTGACGC TCCCGAAGTT 
GATCGTTCTA ACACCCAGGG AATGCACAAC 
GGCCGCAATC TTACGATTAA AATCCTACTC 
CCCGCTAATG TTCCAGGCGC TGCCGCACCT 
AC ATAAC AC C CCGCCTCAGT TGAGATGCTT 
GGCGCAATGG CAGTCAGCGA ACCTAGAGCA 
AATCTGCGAT AACAATCAAC GCAAGGCCAG 
G AT AG ACGG C GTGGTTTTGG ACCAACTGTG 
AGACGACACG TACGTCGACC TAGTCGAGAC 
TCCAACGCCG AGG CGAG AAA CGACCACCGA 
GGGCTACTGC ATCTCGGGTC AATCGACGAC 
CGCTTTCGAG GCGGTGCTCG GAGCAATGGT 
GGCCACGCCC AAGCGCCTTG GCCAGGCGCG 
GGAATGCGTC CCCGACGTCG AGCCCTTCAA 
GCCCATTGAA ACGCTACCCG CATCCCTCCA 
ACAGG CAC AG TGGCTATCCT ACCGATTCAC 
GCCGCTTCTG GTGGTCATAG ACCCTGAAAA 
GCCTGCAAAT TTCG AG AG C A GGCCGTTTTA 
GCCAAAAACG CTGTCGGTAA CATCAAACAC 
ATGTAACCTC TTTACTACGC TATCCCGAGA 
CGCAGCGGCT CCGACAAACG TCACACCGGA 
GGATAACCGA TTAATAACGC AAACAGAAAC 
TCCTAAAGCG CGGACAGATC CACCGGTGGA 
GTCGCAACAC GCAAACAGCA ACGTAAACCA 
TAACACACCG TCAACCGCAC CTCAGGCCTC 
GGTGCCTCGA CTAATATCTC AAACGTCGGA 
CCAGGTCACC GAACCAAAGG GAATCTTTGG 
CGCCAAACCC GCAAACGCCG GCGTAGCCTC 
CAAGTTACCG ATTAATCCAC CCACCGCTAG 
GCCAGCCGTC GAAGAGAGCC ACGGCGCCAC 
GGAAAAATAC GCCGAGAGTC GGCCTCGCCG 
GCCTCACGTC AACACTCCAA CCAGCGCACA 
AACAGTACAC GGGCGCGAAA ATCCCGGCCT 
AAC CG CATC A AACCCGCGCA TTCAATACAC 
CGACGAATCG GAGGTGGAAT CGACCCCAAC 
AGATGTGTCT CACGGGCCCG AACCGGACGA 
CATAGAGATT CATCGACTGG AACACCCCCA 
CCCCACTCCC AGAGTCACCC CAATCCCTCC 
C AAC AT C C AA CACGAAGAAA TCACACGAGC 
CGTACGAAGA AAACTTCAAG CGTCGTCTGA 
GTTTTTATAT CTGTAACGCG CCCATCTCAC 
GTGTCCAGAG TAGTTGCGTC GCTAGGTGAT 
ACGCTCCCGG GTCCGCCGCC ACCGGGCGCG 
CCGCTGGAGA TCGTAGTAAA GCTATTTGAC 
GGGGCGGCCG AGCCGGACCC GGACAAAACG 
CCGCGAAGGC CCCGGAGGTG GTCTTTGCGC 
TCGTAATGAT GTTTGG CCGT TAAAAACACC 
GCGGGAGACA TGTCACCCTG GGGAAGGTTG 
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108 061 TTCATCTCGG CAACCAGCGG GTGATTTGGG TAATCGTGCT CAAGGCGTCC CTGAACGATT 
108121 GGCTCCTTAA CCCGCAAGGA CGACATCTCT TTTAACTTCT ATTACACTTT CCACAGGACA 
108181 GGGACGATAT AGACGAGGTC AAATAAAACA GCTCGGCCAC ACGCAAATGC TTTAATAGGC 
108241 CGGTCGCGCA GTCCGGCGAC GCCAAACAGG GCACGACGAC GCTAACCAGG GAGTTCGCGT 
1083 01 CTCGTATGGC ATGTGCCGCG TTGTTGGCGA GCACGCACCG TAAATAGGGA TCTCCAACAC 
108 3 61 ACGTGATCTC GAATAGAGAT ATAACCCGCA TGTGCGATCC GCCACAATAA GAGCAATATA 
108421 CGCGCCCGGT GGTAG CAC AG ATCGAAAGCT GCTTCTCTTT TTGGTCGCGA CTGAAAAACA 
108481 CGTTGGTGGG TGGGAAATTT ACGG TTTCAA ATTTACCCCG TCCGAAATTC AAACAGTAAC 
108541 CGCACTCGAG GCACACCACC ACCCTCGGAG CTGGCACGGT CTTCTCCAGT ACGCTCCTGG 
108 6 01 CCACCACCTG GGACCAAACA GGTAGAGAGA TACACGGAAA CAGTACGTTA TACGCCAATA 
108661 CTTTTTGACC CAGGTCGCGG GATATCTCCG TCTCGGTCGA CTCCCCTATG GGCAACACAA 
108721 CACGGGACAT GCTCAGCAGG GCCCTAAACG TCAGGCTCCT CAGAAGGGCG TTAAACGGGT 
10 87 81 TGCCGCACGG GACGGTCGGC GCCAGTTCTC GCAGCGAGGC CAGAAGTCCC GCGTCCGAAG 

108 841 GGCCCGGGAC ACTCTCATTC AGGTTAGCTC CCAGACGTCT GGAAATGGAC GGAACGTTCA 
10 8 901 ACTGCATCGA GACACAACCG CCCCCGTTCC ATTTCTTCCG CAAAACGGGC AGAT CCAACG 
10 8961 CGTGCTGTGG CAAACAGGTA ACCAGGGGAA ACCGCTGGCG ACAGTTAAGG GTTTTGCACA 
109021 CGAGACAACA CGCCCTCTGA AACGACACAA CAAGATACCT GGACCACGCG CTCCTGGGAA 
10 90 81 CGGCCGTTAA ACTTAAACCT TCGTCGCTGG GACAGCCCAC GCCGGTGCAG ATAC AC CTC A 
10 9141 GCACCCACGC GTACGCCTCT AACAACGACC GGCCGATATC GTGCAGCCGC GATCTCACGT 
109201 CGCCGTTTTT AGGTGGGTTA TCCGGCCTCC AGCCGGTAGC AATCTCGTTC AGGGCGGTCT 
1092 61 GAAAGGATGG GGCAGAAATT AACGTGCAAG CCCATTTTGG GGGTCGTCCG TCCCAGGCAC 
10 9321 CGAGCCCGTA CGTCACAAAA CACACGTAGC ATTCGGGGCA TAGTCCGATT GAGCGTATAG 
109381 AGGCCGCGAG ATCTAAGCCC AGCCGAGAAC CATCCAGCCA ACGATGGGCA GGATAAGAAC 
10 9441 CGCGCCCATG ACAGGCAGCT TCGTCTTCAG CCAGGTTAGG CAAGCGGCCC GAGGCCATCC 
109501 CCCAATTGTA C CGATTG AAT TGGTTAATTG GTCATCGCCG CACGCTCTCC GCGCCCAATA 
1095 61 TCCTTCAACT CCGACCCCGA AGGGCGGGCC ACGGAGCCCG TTCGCCTCAA CGCACCGAGG 

109 621 CCGTCGGAGT GACCCGTCAG GGCAAAAACC GTTCTAAGAA GGGTTTTTAA CCGTTTAGCG 
10 9681 CTCTTTGGAG TCACGACCAA AAACTGTAAA ACCTGTCGGT GCTCCGTAAA GTAGGTGCGG 
109741 CATATGACCA TGGAGCTGTA AACGTTTAGG TCTCCGGAGA AAACCAGACG TGCCTTAAAT 
109801 TTCATAAAAT CGTCCTGGCC CAGGGACACG GACGAGTTCC TCTCAAGATA CACGTCCGAA 
109861 TTTATAGGCA GGTTTTTTCC AAACTGGGCA TCGGCGTCAC GTGGCTTACA CAAAAAACAT 
109921 TTCAGCGTGG TGGCCAAACC GTTGTTGATA ATTACAAAAC ACGGGGCAAA CGGGTAGGCC 
109981 AGTCTCTCTA GTTTGTGGAG CCAAAACTTA TACACAAACT CGAGATGATA GACGCAGCCG 
110041 TGCTGCAGGC GCACG GTGCA CACGGGGACC GCCCCGCCTT TAGCGTATAC GGGAGCCCCG 
110101 TCCTGACACC TCTCCAAGTC CAGGGAGATT CCAGAGGGTC CCAGGTAAGA GACAACTAAA 
110161 TCGCACAGCT CGTCAACTAA ACGTTTTCCG GAACTCATCG TTATAAAGAT CCTTTAGGTG 
110221 CTGTGCGTGG CTCCCGTAAA AACCGCGTCC GTGCTAACGA TTTTGTGAAT GACCTGTTTT 
110281 ACGG CGTTTA CCTTGGCGTC CAGGACCATG CAGTGCTCAC AGTGAGCTGA CCGCGTCTGA 
110341 GCACG ATGAC AGAGGAAAGT TTTTAAATAC TGACAGTAGT TAATGGCGTT GAGCCTGGAA 
110401 TATATGGTGG GAAACATAAT TTTCATGTCA TCGGGCAGCA GGGACTCGAA CGCCAATAAA 
110461 TCGTCACCGA ACATCACGTG AGACAGAGGT AAAAGATGCT CACCGCCGGT ACCGCGTAAC 
110521 ACGCGACCAG CCACCCCTTC AAATATTTTA GCCTTAAAAA GCGGGCCCCC TAAAGTCGTC 
1105 81 CAACTCAGCT TAAAAACTCC TACCCATTAT GTTCGTCCCG TGG C AACTGG AGACCCTCAT 
110641 GAGACACTGG CCCTCTCTGC GGGGACTCGT AGAACAATCC TTCCTCCCCG GTACCCCGGA 

110 7 01 CGGAGCTTTT AACAGCCCGG TATTAATCCA CACTCAGGAC TCTCTACAAC CCGCCTCATC 
110761 GTGCAGGGTG TGTAGCCTCC TGTTCACTCT GGTCCGGACA TTCCCACCCC CCGACTCTTT 
110 821 CTTCGAAGAC T ACGG CTGGT TGTGCCTCAC CTGCCTATAC GCCCCCCGAT CATGGACGGC 
110 8 81 TACCCTCATG GTGGCTGCCG ACCTTTTGGA ACTAACGCAC GTGTACTTCC CGCAATGCGT 
110941 GAAAGATGGG CCAGTATACA CCGCCCAAAG CATCCTCGGA ATCGACGTCC AGCTGCACTT 
111001 CTTCGCAACC CGCTGCTTCC GACCCATCGA CAGAGAACAA ATACTCCACA CATCTCATTT 
111061 AAATTTTTTA CAAACCGAGT TTATTAGGGG CATGTTAGAA GGCACGATTC CGGGATCGTT 
111121 CTGTTTTAAA ACGTCCTGGC CGCGCACAGA AAAGGACGAC CAACAACCTA CCGTTGCGTG 
111181 TTGTTC CGTT GGCCGCGGAA GTCACACCAA CCGGGATAAC CGCCTACCCG AGGACCTGGA 

1112 41 AGAGGCGTTC AACTCCACGA ACGCCGAGGA AAAGCCCAGC CTCCTCGGCG TCTTTTCGGC 

1113 01 AACGTGGGCA GAATCCCAGC TTCTTGGCTC CGACACACAA CAGGCAGATA CCCATTTACA 
111361 ACCCTCCGCC TTCCCAACCC CAGAAGATGC TGACCAATCA CAGGGCCCCT GCCTGATGCA 

1114 21 CCCAACGCTC AACCTAAAAA CAAAAAACCA CACCGCATCC ATATGCGTTC TATGCG AG TG 
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1114 81 TCTGGCCGCA CACCCGGACG CCGGTCCGGT TCTGAAAGAT CTGCGTCGCG ACATTCTGGA 
111541 AAACATGGAA AACAACGTTA AG CTCG TCAA TCGCATATCG TACATCCTAA ACGATCCGGA 
111601 CTCACTGTCA CACGTGCGCG ACGAACATCT GCGCGGCCTA ATTAAACGGT GCTCGGCACA 
111661 AGAAATCCAC AAGCATTTTT TTTGCGACCC GGTGTGCGTC CTGAACACGT ACTCGCACTG 
111721 TCCCGCGGTT TTATTTAAAT GCCCACCTCC CGAAAAGTAT AAGAAGCTCA AAGCTCGTCT 
1117 81 GGCAACCGGA GAGTTCCTAG ACTGCAACAG AATATTTGAC TGCGAGACCT TACAGACCCT 
111841 GGCCGTCCTC TTTAAGGGGT CTCAACTGGC CAAAATCGGC AAAAC CACGT CGCTCGAGAT 
111901 AATCCGTGAA CTCGGATTTC AACTGCGTCG ACACAACATT CAAATCACCC ACCCGTTTCA 
111961 AACCTCCAAC CTATACATTT AATCTTCAGA AGCGCACCAG ACAATGCCAA AACAGCCCAG 
112021 AAGTCGATTG GCGTCTCGAG CGCCGTACGC ACCTAGCGTC AGGCGACCGG ACGGGCCCCA 
112 081 GTCCACGCGA CCGGCATCCA GGCACGGCAG CTGCAAAAGC GAAATCATGC AGTGGAAAAA 
112141 GTTAGTTTCA GACACGCAGT TTTTTTCTGC CCTAACGCGC CGCCACGAGC TGGGGGTGGA 
1122 01 CTTTTTAAGA GAAATGGGGA CCCCGATATG CACCTCAAAG TCCGTTATGT TGCCGTTAAA 

1122 61 CCTAAAAACC ATCGCCCCGG GTCGGTGCGT CTCTCTCTCA TCATTCGGAC ACTCGTCAAA 
112321 CATGGGGTTC AACTGTTCGT CGTGCACGCC AACTGACAGG TCAGCGGTGT CTCTGGACGC 

1123 81 AAACGCGCTC GGCGAAGATT CCGCCAGGAA AAACAG CG AG CTGTGTTCAG TGGCGTTAAC 
112441 CTTTTACCAC CACGCCGAAA AGGTCGTGCA GCACAAGGGC TTTTACCTGT CTCTGCTCAG 
112501 CCACTCCATG GAAGTCGTTA GGAAAAGCTT CACGCAACCC GGGTTGCTCT ACGCCCACCT 
112561 AGTCCTAAAA ACCTTTGGCC ACGATCCTTT ACCTATTTTT ACAGTCGATG CCGATGAGAG 
112621 ACTCGCACTC TGGGCGGTGT TCCACACTAG AGACCTACAC CTGGGGGAAA CCAGTCTGCG 
112681 ACT C ATTATG GACAACCTTC CAAATTATGA CATAACGGTG G ACTG CATC A AGCAAACGTA 
112 741 CATAATGAAG TTTACACCCT CGCGACCGGA CAACGCAACC GTGACGGTTC CTGTCAACAG 
1128 01 CATTTG CG AG GCCGTGGCCA CCCTAGACTG CACCGACGAG TTTCGAGAAG AAATTCAAAG 

112 861 GGGCACGGCC ATCATAAACT CCCAGGGGCT ATTGTAACCT TTCCCTAGAC GGAAACAGAT 
112921 GTAATTCACC TATTCAAATG TTAAGTTTTT GTTTTGAACA TATCACCAAT AAAAACAACT 
112981 TATGTTTACA CAAATTGAAG AGCGTTTCAA TTTTAC C ATA AACATAGCAA AAACCACGGT 
113041 AACTAAAACT CAATAGCGAT ATTCGGATGG GATCCCTAGA GAG CAATCG A CGTGCGCCGG 
113101 TATTAATTGA CGGGAACGCA CGTTAACTGC TCTCCACGAA TCGCAAACTC CGCGTTTTTA 
113161 GGACCATTCG ACGCCGTTAA TAATTGGCAG TAAACTGCGA AAGGG CGT AC ATCGCAGTAG 
113221 TGATTTTACG ATGTGCACAC CTTTCCGCGG TTCACCGCCA ACAACAAAAG CGGGTCACCA 
1132 81 GGCACCTAAA TCGCCCGGGC CTTTTATGCC AAAGTACAAA AGGGACCGGC GTTCTACGTT 
113341 CACGAACAAC ACTGTTGATC GCAAATACAA TAGTGTTGTT GGTGACCGCA AACTGCCAAA 
1134 01 TCCTAAAATG CATGGACATA GTCCTCAAAA TCTAGAACGA ACTTCCAAAA ACATGCGGCA 

1134 61 ATAAACAACC CCTTGCAGTT AATGCACATC GTAACGAGAC GGAATCCAAA GAACACGCCC 
113521 GATCTTAAGA CCACGGGCAC GTGTCTTTGG GTTCCGGGGG CGTTACGATG GAAGTTCATC 

1135 81 ACCAAGTATC CCATCAACCT GTTACAAAAG CGTAAAGATA ATCGGTTTAC AATAAACAAT 
113641 AAAAAATAGT GGACGAAGTT ACATTCCGAC ATACAAGCGG ACCCAAAAAA ACACCCCAAC 
1137 01 GCAAAACAAC TGGAGACTGT GTCTTTGGGT CCCTTCCACG TCGCAATGAG ATTTCTGGTG 
113761 ACCGTGTGCT G CTTAACG CG TTACAGTTGT GTTTTATGGC TACGTAAACA CACCAATCTA 

113 821 GAATGCTAAC ACGTGCACCG CGCTTAGGCG CGTTTGTGCA ACTAACTGCT GTCAGGTTAC 

113 881 TACTCTTATT TTTAACAAGT AGCATACCAA CTG C AATTAA TTGCACTAAA CCCAGTTTAG 
113941 TCTTTTTAGT CGAGTCGGCC CATAGTCTTC ATGGAGAACC ATCATTCATC TCCGTCAACG 

114 001 GCGTCGGTGT TACCGTTGTT ATTGCGGGCA AAACAGG CTC GGGGGAAAAA AGAGCAACAT 
114061 ATCGTCAGTT TAAAACAACT CAAGTTCTAA AATCTCACCC ACTCGTAAAC AAAAAACATC 
114121 GCAGTTAAGG TGTATGTACC GAACATAACA ACACAAGTTT TTTAAAACAC AGCTGCGGTA 
114181 AGTAACCCCC ATTGCCACGC GTGTCGCGTG CTAAGTGTTT TTAAAATTAC ATTGTG CGTT 
114241 TTACACACCG AG C AGTAATC TCAGGAGGGC GGTTAACGAG CGATATACAT ATTCCCTAAA 

1143 01 CACGGGAACG CGCGCTGACC GCCCTCCCCA AATCACAACA CGGGACTACA AAGCCTAGTG 
114361 TTAATATAAT CAAATTAAAA AACCACAGAA ACCTTTAGTC GTGCGCAAAC ACT AG C AAAG 

1144 21 GTACCTAGAG CTTTCCCCTA TACTTCAAAA AACAGCGGTG GGTTATTTGA CCACACGTTA 
1144 81 AGTAAACACC CGTAAGAATT ATTCCCGTTT ATCAAAATGG AAAAATTAAG GCTTTGCGTT 
114541 AAAATCTGCT AACGCAAAGG GCACTTAATT TTTCCAGTTT GGACTCGGAA CTTTACCGCT 
114 601 AACGTTAAAA TTTAATAATG CAAAAGGCAC ATAATTTTTT TTGTGTAACC TCAAAACTCT 
114661 ACAGCTAAAG TTAATAACGG CGACTTTGGG CAGCGAATCA GTGTCTGTCG CAAACCCTGT 
114 721 TAAATTTAAA CACAACAGGG AGCGCGACTA AACACGGACT AACTGCTCAT CGGGCCCTGT 
114 781 TGAAGAAGTT GGTGTTGCAA TGCATTAGGG AATCTAAAAA CGAAATCCTG CTCGGTAACA 
114 841 CGGAAATTGT CTTTAGTTCC CTACAGCATC ACAACAATTG TAAACCATAA ACGTACGCGC 
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114 901 GTGGGGTTGG TGTGGTGTGA AATGCTTGGT GCAACTGTTA CATGGGCGGA TTGTAAATGT 

114 961 GGTGCTTGGT GCAACGGTGG TGGGGTGCAA GTCCCCCGGT GGGGTGCAAG TCCCCCGGTG 

115 021 GGGTGCAGGT CCCCCGGTGG GGTGCAGGTC CCCCGGTGGG GTGCAAGTCC CCCGGTGGGG 
1150 81 TGCAAGTCCC CCGGTGGGGT GCAAGTCCCC CGGTGGGGTG CAAGTCCCCC GGTGGGGTGC 
115141 AAGTCCCCCG GTGGGGTGCA AGTCCCCCGG TGGGGTGCAA GTCCCCCGGT GGGGTGCAAG 
1152 01 TCCCCCGGTG GGGTGCACGT CCCCCGGTGG GGTGCAAGTC CCCCGGTGGG GTGCAAGTCC 

1152 61 CCCGGTGGGG TGCAAGTCCC CCGGTGGGGT GCAAGTCCCC CGGTGGGGTT CAAGTCCCCC 
115321 GGTGGGGTGC AAGTCCCCCG GTGGGGTGCA AGTCCCCCGG TGGGGTGCAG GTCCCCCGGT 

1153 81 GGGAGCGGCT CGGCTCCGGG GTGGCTCCGG GTGGGGGCGG CTCGGCTCCG GGGTGGCTCC 

1154 41 GGGTGGGGGC GGCTCGGCTC CGGGGTGGCT CCGGGTGGGG GCGGCTCGGC TCCGGGGTGG 

1155 01 CTCCGGGTGG GGGCGGCTCG GCTCCGGGGT GGCTCCGGGT GGGGGCGGCT CGGCTCCGGG 
1155 61 GTGGCTCCGG GTGGGGGCGG CTCGGCTCCG GGGTGGCTCC GGGTGGGGGC GGCTCGGCTC 
115621 CGGGGTGGCT CCGGGTGGGG GCGGCTCGGC TCCGGGGTGG CTCCGGGTGG GGGCGGCTCG 
115681 GCTCCGGGGT GGCTCCGGGT GGGGGCGGCT CGGCTCCGGG GTGGCTCCGG GTGGGGGCGG 
115741 CTCGGCTCCG GGGTGGCTCC GGGTGGGGGC GGCTCGGCTC CGGGGTGGCT CCGGGTGGGG 
115801 GCGGCTCGGC TCCGGGGTGG CTCCGGGTGG GGGCGGCTCG GCTCCGGGGT GGCTCCGGGT 
115861 GGGGGCGGCT CGGCTCCGGG GTGGCTCCGG GTGGGGGCGG CTCGGCTCCG GGGTGGCTCC 
115921 GGGTGGGGGC GGCTCGGCTC CGGGGTGGCT CCGGGTGGGG GCGGCTCGGC TCCGGGGTGG 

115 981 CTCCGGGTGG GGGCGGCTCG GCTCCGGGGT GGCTCCGGGT GGGGGCGGCT CGGCTCCGGG 
116041 GTGGCTCCGG GTGGGGGCGG CTCGGCTCCG GGGTGGCTCC GGGTGGGGGC GGCTCGGCTC 
116101 CGGGGTGGCT CCGGGTGGGG GCGGCTCGGC TCCGGGGTGG CTCCGGGTGG GGGCGGCTCG 
116161 GCTCCGGGGT GGCTCCGGGT GGGGGCGGCT CGGCTCCGGG GTGGCTCCGG GTGGGGGCGG 
116221 CTCGGCTCCG GGGTGGCTCC GGGTGGGGGC GGCTCGGCTC CGGGGTGGCT CCGGGTGGGG 
116281 GCGGCTCGGC TCCGGGGTGG CTCCGGGTGG GGGCGGCTCG GCTCCGGGGT GGCTCCGGGT 
116341 GGGGGCGGCT CGGCTCCGGG GTGGCTCCGG GTGGGGGCGG CTCGGCTCCG GGGTGGCTCC 
116401 GGGTGGGGGC GGCCTAAAAT CCTTACCGGT AAATTTAGCA GTAAATCCAA CGCAGTAAAT 
116461 CCGCAAGCTA GCCGCACAGA GGTGCGACTG CCTGCCAAGG CTCCTGGCGC CTCTTTTATA 
116521 GCGCTAAATG CCCTCCCCAA ATGGTTACTA TGGTTTAGTT ATGCAACCAT AATACCAATA 
116581 AATGAATCAC CAAGAATACG GCCAACGCGC CAGCGTCCAA CCGCCCTCCA ACCGTGCAGC 
116641 GCCGTATATT CGAATTGCCT TCCGCACGAA TACGGGGCGC AGCATGGTCG CGGGAGGCTG 
1167 01 GCATTGCGCA ATATTCCCAG CATTCCAACG GGCACG C AAG • GGTCCAGG CT ACCGGGGATT 
116761 CCCCAAACAT CATCCAGCGC ATAGAGGTTG TGGCTGAGCT GCCTATCTGG GGATTCCCCA 

116 821 AACCCGAGAC CCACTGAGAT G CT ATTCAGT GGGACTGCAG GCTGGGTCCT CCCAGGACAG 

116 8 81 GTGCCTGCGT GTGTGGTAAG GGAGTTCCCC TAATTAAAAT ATTATATTAA TAAATAAACC 
116941 AGGCTAAGCT GTAATTAATG AGGCACAGGC TGCCGACCTA GGTACCTGGG GAACCCCCTA 

117 0 01 ACGGGATGTA ATTAATTCAA AC CT ATATAA ATTCCACCCT GTTAGGGGGA TCCCCACGTG 
117 0 61 TACCTGAATT AATGCGGGAT ACAGGCTGTA GGTTTGGTAA TGGGTAC CAG GCTGGAGTGA 
117121 CTCATAC CAT GGGATTAAAT ACCAGGGATA CAAGCTGTAG GTTCGGTAAC GGGTACCAGG 
117181 CTGGAGTGAC TCATACCATG GGATTAAATA CCAGGGATAC AGGCTGTAGG TTCGGTAATG 

1172 41 GGTACCAGGC TGGAGTGACT CATACCATGG GATTAAATAC CAGGGATACA GGCTGTAGGT 

1173 01 TCGGTAATGG GTACCAGGCT GGAGTGACTC ATACCATGGG ATTAAATACC AGGGATACAG 
1173 61 GCTGTAGGTT CGGTAATTTA AACCTATATA AATTTTACCC TGTTGGGGGA ATCCCCGTTT 
117421 GTACCTGAAT TAATATAGGA ATACAGGCTG TAGAATCGGT AATGGAAACC AGGCTGTAGG 
117481 ACTAGGCTGG AGTGACTCAT ACCATGGAAT T C AATTAAC A AGGAAAATAT AATAGAATAT 
117541 ATATATATAT AGGGAACTTG TAAACAAAAC CCAACTCGCG GATTGGCTGC CTGTTTGGGC 
117601 CAACCAG CAG CGAGAATTCC CTGATTGACA GGCGGACTGG CCAATGGTTT GCGAGCATTT 
1176 61 TTGATTGACA GAACGGCCGG CCAATCGCAA CCGAGAACTC GGCAGCGAAG CAAAAGACAG 
117721 ACGGCCGCGG CGACCAATGG CCGCCGCGGG TTAGTTTGAT TGACGGCTTG GCCGGCCAAT 
117781 GGGAAACGTC CCTGCCGGCG GCCCCTAATC CCCCTGGCTT AAGGGATTAC GGCCGTAAAC 
117 841 ATCGCTGGCG CGGTGCCACC GCCGCCGAAC CCCCGCGCCC GGTTCGG CGT GACCTCGCCG 
117 901 CGACCCCGCC GCCCTCATGC GCACCATGGG TGCGATGGGC ACCGGCGGCA ACATTATTCG 

117 961 CTGACCGCCG GACGACCCCG GCGCCAAAGA GCGGCCGATG GGTACGCGCG GCGACCATCG 

118 021 AAACATTTTT CAATAACCCC AGTCTGACCC CGCGGCCCAA ATCTGGGGCC CATGGTGGCG 
118 0 81 CGCGGCCCCC ACCGTCCCCA TTTCCCACGG TTTGCTCATG GGCACCGACG GTGCCCGTGG 
118141 TGATATGTTT CAGTGACCCT CGGATGACCC GGGCGACGTT TTGCGCACCA TGGTCCCGAA 
118201 TTCCCCCATG AGTACATGTT TCGGTGACCC GCCGGCGACC CCGGGCGGCA AAAGCCGCCC 
118261 CATGGGCACC GGACGGCACC ATGTGCACAT CTTTCAGTGA CCCGCGGGCG ACCCCGGCGG 
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1183 21 CGGCACCGCG GGCGCTGGGG GTGATATGTT CTGATGACCG CCGCCTTACG TTTGCAGCGG 
118381 TGTCAGCGGA ACATATTCTC CCCGGCTCCC GAGCCCGGGG CCCGCGGACC CGAGTTGGGT 
118441 TATTAAAGTT ACGTTAGGGA TTAATTTTAA TCCACTACCC CCCTTAATGT GCGGCTCGGC 
118501 CTGCGGCGAC GCTGACGCCG GCGGGTTCGA CCCATTGACC GCATCCTCGG TGGCCGGGCA 
1185 61 GAGGCCGGCC GGCCAAAGGT GCGATCGCGG GGTCTGACCC ACCCG TGGG A GAAACCCCTC 
118 621 CGTTCCGTTG TCTTCTGCCT GCGGTCCCTG GCCCCCTGGC CCACCGGGGA TGTCTTGTGA 
118681 GCACTTTCCC CGGTGAGAAT AGACAGGAGA GTGCCCGCAA GACATCGACG GGGTTGCCGG 
118 741 TCGGTGAACG GGAGGCGCGC CGCTTGCGTG CGTGAGGTGT CGCCGGTGGG CCGTATGACC 
118801 CCCGCAACGC GAGGGGCCGC TGGCGCGGAA ATTTCCCCAG AACCGGCCGC GCCGCTTGGC 
118861 GGCGCGGCTT CCTCCCCCAA CGACCCCTAA TTTGGTTTTT AAGTTGTTAT TTAAAGCTCC 
118 921 GCTGTGTTTG TGGCACGCGC TTAAGCGTTG CCACCTGTTG CGGAGATCCC CCGTGCGCAT 
118981 CGCCGCCCTC CTCGCCTCGG TGCACGCTGC CGAGCGTGCC ACCTGTTTGG TTGTGTCATC 
119041 CCATGTGCGC ATCCCCCCAT CTTATTTCCC CGACCAGGCA ACATAAACGT CACGTCGTTT 
119101 GTAAGAAATA ATTG CTTTTT TTATTTTCGT TAACCCCGCA ACCGCGATAC AGTCTCTCGC 
119161 CTTTCACCCG TTGTTTTGTT ACCAGCCCCT CTTGGCGGCC GGCACCGGCG TTAACCGGGT 
119221 GCGTTGGCGG CCACGACCTT GGCCACGTCG CGCCTGGAGA CGGCCTGCAG CAGGTCACTT 
119281 AAAACCATGT AGTTTGAGGG GCTGACCAGG GCCGCCTTTT CCATTTGCGA GAGCCACCGA 
119341 AAAAAGGTGG GTGACTGGTT GCGCCTGCGA CCAAGCTGGT CGCCCGTTAG AAAAACTAAA 
119401 TTTTTCACGT CCCTTTCTGA GAGCTGGCGG TCAATGGAGA GCATCAGGTG TTTGTATGGA 

1194 61 CTAAAGTAAC TGGGGCTGGA CCGGAGCCTC TCGATAATAA ATCCCACGTC TAGTAAAAAA 
119521 AAGGTGGAGA CCAGGTCCAA TCGCCCAACG AGAAACACGG CCTCCAGCAA CACCGGGAGG 

1195 81 GGAATACCGG GTTCTCCAGT TGAGGGGCAC AGCCCGTTTG CAAACCCCTC GGGCGTATCG 
119641 TCGGACGCCG GCCGGTCAAA AAGCCACAGC ACGGCCTCTC TGTCGTCTGC CTCTAGGTGG 
119701 CGACCGAAGT CCACTAACCG CTTATGCGGG AACATGGGTC CGGTGCGGCG CTCGCCGACA 
1197 61 CCCTCTTCTA CCTACACTAG AGCGTTTGAT AAATCACGAC GAACCGCGGG CGTTACTTAA 
119821 AATGTGGTCT TCGTCAAATT CCCGCAAGGT GGTGGCAACC GACTCGGCGG CGGCCGCGAG 
119881 GGTTGCGGCC GAAACGCCGA CGGCCTCCGC CAATCTGGCC GGAGGCGGCG CAGCGGCACC 
119941 GAGGCTAAAC AGCGCGCAGG CGGCCGCCAC CAGGGACGGG GGCAGACCGC CGGTGGCCGG 
12 00 01 GTTCACGATG GCTTTGTGAA CAGACTCCAC CACCTGGCTG TGAAGGGCAA AGAGCTGCTC 
120061 TTTTGTAAAG CCGCTTTTAA ACAGGGTGGG CCCGATGGCG TCCGTTGGTA AGACGGCCTC 
12 0121 TAGTTTCCAC CTTAAAGCCT TAAGGATTGA TTTTTCCTGT TTCAGTAAAT CCGCCACGGA 
120181 AAAATCCTCC GCCGCGCAAA AG C AT AAAAA GGCGGCCTTG AAAGGATTCA GGTCTCTAAT 
12 0241 CTTGCCGGCT AAAAATAGAC AGGCGGCCCC CAGCCTCTGA AACCGCCGAC GGGGGATGCT 
12 03 01 GCGACACTCT AG AT AG CG AT C C AG AATACT AACGGCCAGT GGGAAAACGC TAGCATCCGC 
12 0361 CTGGTGGGCC CTGGCAACAG ACCGCATCCA CGTCCCCAAG AT AACTCG C A TGCCCACCGT 
1204 21 CACCTCCGTT TGTATCGTCC CAT AG CAGGC CGTGGATGTT ACAAAGGACG CTTCGTGGGC 
12 04 81 CAGGAGGTTA GAGAGGGCCC GGTCCTGGTA CAGGACGGGG TCAATGGTCC CGGTGGGCAC 
120541 TGGGCCAACA GAAG CCATTG TCAAGGCAAG GTAAAGAAAG CGATATAAGT CCTACCTGCG 
12 0601 GCTCACGGAG GCTTAATACG AACGTCCGGC TTAACACACA ATCTTCTTTG CGCTCTATGT 
12 0661 CCGCTCCACA CTGGTCCCAT GTCTTCTGCA GTTCAGCCCT GCCTGTTCCC GGAAAGCGGC 
12 0721 GGACTAGATG CAACGCGGAA GCGGCTGTTC ATTTAAGGTG AG TCA CAGGC GGGGCAATTT 
120781 GCCAACCAGT GACGCGTTTC CTGCAGTCTG GGTTTTC AC C TGGCAAGCGC CACTGGTTCT 
12 0841 TGGCAGCCGG CCGGTGGGGA TTTTTTTAGT GCTGAATTGG CAGTCCTCTG TCCATGCGCA 
12 0901 CTATGCTGGC CTGGAAGTGG GGGGCGGATG GGTGATGTCG TTCCCACCGG CGACGGGCCT 
12 0961 TTTTGACGGC CTTGGCGGTT CCGCGGCTTT GACAGAACAC CGTAATGCAA AAAGCCGGGA 
121021 TGACGGGACT GCCAGGTGCG TCCGGCAAAA CGCGGACGCT GGATGCTCTA ATGCCGGGAA 
121081 ACTGTAGGCA CCTTGCCAGG CGATACAGTG AATAGGGGTC TGAGTTACAA AAGATGACTC 
121141 CGTGACACCA CGGGCCCGGC CGAGGGGGGT CGATCCTGGG TCCCAATACC TGACGATAAA 

1212 01 ATCTTCTCTG TGGGCGGTAT TTTCTGGGTA CCTCTTCCTC TGATGAGGGA TACGGTATGG 
121261 GAATGGGGTA TCGCGGCGGG CGCGGGGGGC GCGGGGGGCG CGCCGGTTGC TGCTGCCGCT 
121321 CCTCCCCCTC TTGCTCCTGT TGTCTTTCAG GAGCTTCGTC CTCGTGGGAG TTGTGACTCG 

1213 81 TGCATCTGAG ACGTAAACAA GGAACTCCTC CGGCACGCGT GGGTGGACCC CAACCCCCTA 
121441 CGGTGTACGA GTGGGATGTA TGGCCAGGAA ACGGGGGTCC TGGCCGAGAC GGCCCGGTGG 
121501 GGTGGTCTGT GTGGCTCACC GCCTGTTGCG TGTTAGGGGA TGGGG CCTGT TGAGGCGTTG 
121561 GCGATTGGGG TGGTCCCGGT GAGTCGGGCG GGGATTGCGG AGGGGGCGAG GACGTCGGCG 
121621 TTGGCGATTG TGGCGGTCCA GGTGAGTCGG GAGGGGGCAA GGACGTCGGA GGCTCAGGCG 
1216 81 GTTCAGGTGA GTCGTGGTGC GATGGTTGCA AAAACGACGA AGGCTGCTGA TTGGGCGATG 
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.121741 GATTATGCGA CGACGGCGGT GCAGGTGAGT GGGGGCCGGG GTGTGACGGG GGTGATGGTG 
1218 01 GGGGCGAATG CGATGCGGGG G ATG CCGGGG ACGACCCTGA CGTGTTTTGA GGTGACAGGC 

1218 61 CTGAAGCTAA CACCGGCGCA GACCCGCTTG GCGACGGCGT TGGAGATCCC AACACGATGA 
121921 CGGACTCAGA CCCTGACCCT GATGGCGACC CTTGGGATGA CCCGTGCCTG GATTCGGTGG 

1219 81 CCGTTTCGGT GTCCGATCCG TGCCCTTGTT CTGCAACCGC CCGCACTTGC GGCCCTGATG 

122 041 TTGTTGG AG C GGGGGGCGCG GGTGCGGCGG CTGGCGCGGC CGTGCCACGC CCCCTGCCGC 
122101 GCCTCCCGGC CTGTTCTCTA GTGCCGCCGC GTCTTCCGCA ATGTCCCCTG CAGGAACTTC 
122161 GCAACCCGTG TCCCGATACT ATGCCGGAAC GATGTTGCCG GCTGCCCCAC ATGTTCGCGG 
122221 TTTACTGCGT GTAAAAAACG GCAGATTAAG TAGATTG CCC TTTCCACGAC CCGCAGCCAC 
1222 81 CTGAGCGCGC GCACGCAGAA CCTATGTTTA GTGCGATTGC GATTATGTCC GCTAGGTGGC 
122341 GGGCTATTTT GATCCTG CCG CGCACG CATT GACCGTTAGG TGGCGCAGAG CGCCGTTAAC 
1224 01 CGCAGCGAGC ACCGCACGGC GGTTTTAGTT TCGTTACTAT AACGTTTAGT TAAAATCCGT 
1224 61 GG C CG AT ATT GGGTGGCAGT GTGTAAGTCA CGCGCGCCGC GGCCCCCTTA CCTTGTCGCC 
122521 GGCGAACCGC GGAAAGTCTG TTCCGCCGCG CAGCGCGCGG CCGCTGCCGA AACCAACGCC 
122581 GCTTGAATTT CGCGGGCGGC TGGGCTTGGT GTCAGTGATT CACACAGATG CGATGTGATT 
122641 GGGCGTGGTG GTCAACGTCA CTTGTTAACC CGTAAATCTG GAAATTCTTG TCCGCCGCGC 
12 27 01 CGGTG AG CCG CGGCCCTAAG CCTTATAGTG CTGTGCAGCG ATCCTGAAAC TCGAAAGCTG 
122761 GCCATATAAA CCCCAGTATG GCGTTGCGTG TCGGTGGCAA CCTTTTTGAA AAAGATTTAT 
122821 TGCCGCCGGG TGTTAAGCAC CAACACCGAC CGTGCGTTTT CAATTATGTC GGGAGGAATT 
122881 ACATTAACGC TGCTGCTGGC GACGCTTGCC ACGGTTCGGT GCGCTCTTCA AACG C ACT AT 
122941 GCGGCGGTCC CCGTGCACTC TACCGCGTCC CTGGGGTGCG TGTTAACGAC ACCCCACGAC 
123001 GTTCTTATCG TTACCTGGCA AAAACAGGAA TCGCCTAGTC CCGTTAACGT GGCCACATAT 

123 061 AGTTCCGAAG CGGGCACGGT GGTTCAGCCC CCGTTCGCCG GTAGGGTTGA CATTCCCGAA 
123121 CACAAGTTGA C C AG AACG AC CCTGAAGTTT TTTAATGCCA CCCTGGAGGA CGAGGGGTGC 
123181 TACCTGTGTA TCTTTAACGC GTTTGGAGTG GGAAAGCTGT CGGGAACCGC CTGCTTGACG 
123241 GTTTACGTCC CCCTGTCCAT GTCCGTCACG TTTTACCCCC CGATTAACCC GACGCAGCTC 
123 3 01 GTCTGTCGGG CCGAGGCCAG TCCCGCACCG TCGGTCAACT GGACCGGCGT GCCGCCCGAG 

1233 61 CTGTGCAGCG AACCTGAAGT GTTTCCCCGG CCCAACGGAA CAACCCTGGT TGTGGGTCGC 
123421 TGCAACGTAA CGTCGGTGGA CCCCGAAGAC CTTGAGAACG CCACGTGCCT GGTCACTCAT 

1234 81 ATAGG CGGTT TGGCCGCGGC GCGGCCCCTG GACCCCGTGT TTTCGGATCC CCTGGAAGGG 
123541 ACGAGCCACT ACGTGGTGGG TGTGGTGGCA GCGGCCGCCG TTTTAGG CAT TTTTTTAACG 
123601 GGTGTTTTTT TGTATAGGTC TATGTGAGCG CGCGTGTCCC CCGTGTCTAG TGTTTTGTTC 
12 3661 CCCAGTGAGT GTCTCCATGA C AAATAC AAA TTTGAGGCTG GCTTTTTAGG GTGGTTTCTT 
123721 GTGCGACGCT TCCTGTGTAA CTGCATACAC CGGGGTGTCG CCAGGAAACC GCGTCTCCCC 
123781 TTTATGTCCG CTCGCCCTCC CAGAGCGAAA GTGAGAATGG TTCCTGGGGC GTTTTGGCGT 
123841 TGAGAGAGTC GGGCGATGTT GCCGTAGCGG CGTCTGCAAA GGCTCACCCG CTTCTGTTTT 

123 901 TTTTCTTTTT GTCAGACAAC AACATGGACG CCTTGAACAA TAACCTTAAC CTGCTGATGG 
123961 ATTTTCTGTC TAACTATTCG AATAG CT AC A GTAGTTATGA CGACAATATG TCTTACACCT 

124 021 TAGACACGGA ATCCACGCTG TGTCGGCTGA CGGTGGTTTT CCCACCTACC GTTTATGCGA 
124 081 TTATATGCTT TTTTATTTTT TGCATTACGC TGTTTGGGAA CGCGTTGGTG CTATATATTT 
124141 TTTTTAAATT TAAAGCGCTC GCCAACTCTG TGGATGTACT GATGGCTGGG TTGTGTTGTA 
124201 ACTCCCTGTT TCTGTGCGCG TCGTTTTTGT TCAGCTGGCT GCTGTACGTC GCGCCACAGA 
1242 61 TGCTCACGTC CGCGACGTGC AAGGTGGAAA TCTTTTTCTT TTACCTGTAC ACGTACTTTG 
124321 G CGTGT AC AT TGTGGTGTGT ATCAGCCTTA TCAGGTGCCT GTTAGTTGTG TTTTCCCGCC 
124381 GCCCGTGGGT CAAGCACGGG GCCTCCGGCT TTCTCTGCGT GTGTGTGTCT TTAATCGTGG 
124441 CGCTGGCGCT GTCTGCCAAC GCGAGCCTCT ATAGGACGGC CCTGCGTCAC CCAGAGACCA 
124501 GCGAGTGGAT ATGCTACGAA G ATG CCGGGG AAGATACCGT CAACTGGAAG CTGAGAATCA 
124561 GAACCACCAG CGCGATCTGC GGGTTTTTGG TTCCGTTTGG GCTGATGGTG CTCTTTTACG 
124621 GACTTACGTG GTGTATGGTT AAAAGCACGA AGCTGGCCAG AAAGGGAGCC GTTAGGGGTG 
1246 81 TAATTGTGAC GGTGGTGGTG CTGTTTTTAA TTTTTTGCCT GCCCTATCAC CTGTGCAACT 
124741 TTTTTGACAC CCTGTTGAGG ACCGGTTTTC TGGCCGAAAC GTGCTACCTC AGGGACGTGA 
124801 TCAGCGTGGC C ATG C AC ATA TGCTCCCTGC TACAGAGCAT GTATAGCGCG TTCGTGCCAG 
124861 TCGTGTATTC TGGTCTTGGG TCTCTGTTTA GGAGAAGGGT TAGGGATACC TGGTCCGTGT 
124 921 TTAGGTGTTT TTCCACTTCA GGTAG TTT AT GAGACACTCA CGCGACACTT GGTTGGATTG 
124 981 TTTGTGTACA TTTATTTTCA TTTTGTGTAC ATTTATTTTC ATTAAAGCGA TCTGACCTGC 
125041 AGACCTTACC TGACGTTTAC TGTCTGTTTC TTATGCACCA GAGGAACAGG GACTGGAAGG 
125101 CCAGCGCCCA CGGGGAAACT GTCATGTCCG CAAGCTCGGG CGGCACGTAC GCCCACTGCC 
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125161 AGGGGTGGAA CGTCACGGCG GGGTCGACCA 
125221 TGATGGCGGC CACGGCCGAG TTGGCAGACG 
12 52 81 TCGGGTCGGC GTTGCCCGTG TGGTAGTTCA 
125 341 ACTCCATGGC GTCGCGCGGG TATCTCACGC 
12 5401 GGACGAGGCC CCGAAGAACC CTAAACATGA 
12 54 61 TT AG C C AG AG GCACTCGTGG TTCCCTGATG 
12 5521 CGGGGGTTTT GGGTCCGGGC GTGTCGTGAG 
12 55 81' GGAGCTGGAA CCCAAACTCC CCGCAGCATA 
125641 CCATAAGGCG CCCATAGCAC CCAGGGTCGT 
12 5701 CCACGTAGCT GTCTTCGATG CCCACGGTTC 
1257 61 GGATGTTCCC TCCTCTAAGG TCGTCGGTGG 
125 821 TGAAGGCGGC CATCAGCGCC CTGGGAAGCG 
125881 CTGGCGGCCT CGACG GCGTT GCAAACAGAG 
125 941 GGTGGCCCAG GTTGTGTTGG TTGCACCCGT 
12 6 001 TGAAGCTGGG CCCCCATGAG TACCACTGTT 
12 6 061 CCGAAGTGAG GGTCTGTGAA TAGACTTCAT 
12 6121 ACAGACCCTC CTGACCCACG GTGCCGCACA 
12 6181 CCTGGCGCAC CTCGTCGACG TGCTGGGGCT 
12 6241 ACACCAGCCA CTGCAGGGGC GTTTCCTCTG 
12 63 01 CGGCCAGGGC CATCTCGATA GCGGTAACCA 
12 63 61 ACACCACCAG ACCCCGGGAG AGCAGGGCCT 
126421 TCGCCTGAAC GTCCGGTAGG GTGGGACTCG 
126481 CCTGGGTCAG GTGCGGCACG GGGAAGGACA 
126541 GGTCAGGTGT GACGCGTTGC CTGGACGACT 
12 6 6 01 CGTTAAACGG CACGGGGGCT ACCGTCTGAT 
126661 ACAGGCACGG GGTGCTGCCG GCCGACGTGA 
12 6721 GTTCCTTGCA GGCAAACAGT GTGTGCTGCA 
12 67 81 AACCGACCGA CGCCGTCAGC GTGATGTCGC 
12 6841 TGGTGACCGC CTCGCAGATG GCGTATCTCG 
126901 ATGCCTGTTC TCCATAAGCC ACGCACTGTC 
12 6 961 TGACCCAGGT TTCTGGG CTA ACCAGGCGTT 
12 7 021 GGCGATATAT GGGCACCCGC GCCATTCTGT 
1270 81 GGTAATCTGA CAGCGGCCGG CCCAGGGGTC 
127141 TGGAGCATCT GTCCACGTGT CGCGTGACAA 
12 72 01 GAAGCTGATC GACCACGGAG GCCAGGCGGA 

1272 61 GTTCGCCACC ATCCTCAGGT TCTTCTGGGC 

1273 21 CGTCGAACCC AAAAGTTGCG ATGACTTCGG 
1273 81 CCGGGTTCCA CAGCTCCAGG TCGTTCACGA 
127441 CAACGACGGC G AAAGGG CAT CCGCAGATGC 
127501 CCGGTTCCGG CTCCGTGTCG CGCGGGAGGA 
127561 GAAACTTGTT AACTAGGATG TCGTTCACGG 
127 621 GTTGGCTCAG AATGGTTGGT GGAAGCTGAG 
1276 81 CCAGGGCTCC CAGGTGGCCG ATG AC AG AG G 
127741 ATAGGGGCCC TTCCAAAATC TCGTAAGCCA 
127 8 01 CTGCGGCGCT CCCCAGGTAA AGCGAGCGCT 
12 7 861 ACCCAAGACA GATGATAGGC TGACTGAATA 
12 7921 GCACGCACGA TAACAGCGAT GTGGTGGTAA 
127981 GGACGGTTTT AAGAAAGCCA CCGAGGACAG 
12 8 041 GG AGTGTCGC TGTTGTGTTA ATAGCTCTGT 
12 8101 GCGTAGTGGC GT AAAATC C A CACAGGGCCG 
12 8161 CATGGAACAG AATGTTTTGC AGCGCACCTG 
12 8221 TAAATGAGTC AGCTATGTGG GTTACGTTTT 
12 82 81 AGCTCCTGCG GGCCATGACG GCGTATCTGG 
12 8 341 GCACGGCCAG CGGTAATATG TTCCGTTGTA 
12 84 01 CTCTCTCGGG CAACGGTCTC CAGGCGGTGG 
128461 CGCGCACATC ATATTCTACT AGTGTGCCAA 
12 8521 CCCGATCGGC GACAAACCGT CTCGTCTGGC 



GGGAGGCCAG GTGGCGCCCG TCCGGTGACG 
GGTTGCGTGG GTAGTGCTG C GCGAACATCC 
GGGCGATGCG CTGCTGCTGG TTGAGATGGT 
CCAGGTACCG GCCGTTGGCC CACCCTGGGA 
TGCTGATGGT GGTCTGGGGG ATGTGGAGGT 
CGTTCTCCTC CAGGTGGATG TCCCACTGGT 
GGGTCTCTCT AAGAAGACCG AGCGCCCCCA 
GTGAAAATGT ATCCGCTCGG CGGAAGAAGG 
TGAGAAGACC CATGATCGCG CATCGGGCCC 
C AC CGATGGT CAGACGGGAG AATCCCCGGA 
AG ACGG CCGC GACGTCGAAC CCGACGTTGG 
GGGCACCGGG GGTG AC C AAG GCGGCCACTG 
TCAGTTCGCT GTTTCTGCAA ACCTCGGCGA 
AGTCCTTTCT GT AG AGTTC C TGCGCGGGCG 
CGTCCGAGAA CGAGGTCCAG TTTGCCGCCA 
CGTTGTTGTG TGAGATGACG ATTCTTTCTG 
CGGTGGCCCG GCAGTCAAAG TTTTGGCACG 
GGATTTCGAA GATGACGCCC GGGGTTTCGG 
ACGGGATGCG AATCTGTAGG CCTCGGTTAC 
CCATGCCTCC GTCGCTGACG TCGTGGCCGG 
CTGTTAGCAT GAAGAGGTTG GCCAGGTGTG 
GCAAGAGACA CAGGTGCTCA AAGGTCGATC 
CCAGCACGAT GAGGTTACCG GTGG CCTTC A 
TCACCTCGGC GGTGGCCGTG ATGACGACTG 
GCTGTTGGGT GGCGCTGATC AGTTCTTCCG 
TGGCGAAGTT GATGCTCAGG TCCCTGCATA 
GGAGCCAGGC CTGGTCGTCC TCCGGGTTCC 
TTAGACGCCG CACGTGCGCT AG CATGATGT 
CGCCCACTGC CGCGTTGATG CCCATCTTGT 
CGACCCACCG GTTCCTGCCC CTGCCCTGGG 
CCGTCGCGTC CTGCCTGGTG ATCGGCTGGG 
CCGTAAACAC CGACGTGTGG TTGACAATGT 
CCACTTCGCA CTGCTGAGCC ACGAGTCCGT 
ACTCTTTGCT GCCAACCGTC GGACATCGCA 
ATAGGCCCCA GTCGGTCGGC GAGGGTGCCT 
GCACCAGCCA GTTGGAGGAA ACGGGCTGCT 
CCGATGTTGG CTGTCTGATC GGCCACGCTC 
ACTGAATTCC AAGCTCTTCG CAGGTTTCCC 
GTGCGGCCCT CCTGATGGCA TCCAACGGGC 
CGGCAAAGAC CACGCAGGCA GACACGTTGA 
TTTCCCGTGA GATGTCTGGC GTGGCGGTCA 
AGACAAAGAG GCGCAGGCCG CCCCGCGAGA 
CGGGTTCATG GGCGCGATTA ATG CTGGTAA 
GCGACAACGT TTGGGTGATG CGGGTAATAC 
GGTTGCCATC GCCGTCAACG GGTTCAAAAG 
AAGACCTCCT GAGGAATGTG T ATG CTTG AG 
TAACGTTCGC CTCGCTGGTG GTGGCGCTGT 
GCGTGCCGGT GCGCGCCAGC TGCTGGCAGT 
GGCGTGCGCG AGAAAATGCG CCCACTGACG 
ATCTGATTTC GCCCCCGGGC AGCAGCGTGG 
CACCTGTCTG AATGGCGCTG TGCAGGTGTG 
CGCACGGGAG GCGCTGGAAA ACGCAGCTGT 
ACACCGTATA GCTGTTGCCG TTGGAGGCCT 
ACGGGAGCAT CACCGCGGCG CATATTGGGT 
ATTGCAATGT CACCGGGAGC TGGCGCTGCA 
GTGTCAGGGC GGCCAAGGTG CCCGGGGGTA 
AGCTCAGATG TCTGCCCACC TCTAACCGCA 
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12 85 81 ACGTTCTATC GAGGCCGCTG TCTCTAAGGC ACTGCACTAT TTCGCTAGAT CTTGTGGTAG 
12 8641 GTTGGCGCAT CAGGTCTGGG CCATAGGGCA GGGATATAGT ATGATGGCCC TGGAGGTGCG 
12 87 01 CCGACCGAGT TTCCAGAGGT GCATAATTAA CCAGAGGCGC CAGAAGATGC CAGATAACGC 
12 87 61 CCAGCGCTGG GTCGTCATCG TCGTTGTCAA AGTTATCAGA GGGGCGTGTT ACAAGCCAAA 
12 8 821 GTAAAAATTC ACTAAAG CAC TGTTCTGGCA CAAGCGCCAA AACACTTTCT GCGTCCGAAT 
12 8 8 81 TGTCATGAAT AAAAGCCTCC TCTTCTGGAC TAAGGGCTGC AGCGGCCCAG CGAGGGTTCG 
12 8 941 TGCGTTGGGC CATATTTGCT AAGTCTAGCC GGCGCAAAAG CAAGCGGGTG GGACTTTAAT 
12 90 01 AGd&CGTATA GGGGTTGTTA GGTATGCGGT GGGGAGCCGT CCATGTTACA TGGACTTCAA 
12 9061 TATTAATTTA TGTTAGTTTT GGGCACCTTG GGATGTTACA CTTATAACTT CCCCCATGTC 
12 9121 ACAGTAATCA CTTGACAGAT CCCCTTCTAA CTCATAACAA CTGTTTCGGG GTGTTGAGGT 
12 9181 TACAGGTAAG GGTGGGAGGG GAATGTTAGG TGGCGAACCA AAGCATAACG TCGCGGGGCG 
12 9241 GGAGTGGGAG CCTGTAACCC CAGAGATGTC ACTTACGCTT GTTAGGCGAT CGACAACTAC 
12 9301 CTCCGAAAGT GTGTCCTGTT GGATGTTTTG TGGATTAGGT AAAACAGAAA GTTGTGGAGA 
12 93 61 AAGTAACCTT ATAGATGACC TCCTGCTGGC GCGTGGCTGG GTGACAGCTG GTTGTTGATG 
12 9421 AATGAACACT TCCTCGGGCG TTCCTGGGGT GGTGGGTGTG GTCTCACTAA GGACGGCCAA 
12 94 81 CTCAACACTA ACTTCCTCCC GTTCAGACCG TGAGCGGTTT CGTCTGTAGC GTCTTGTGGA 
12 9541 GGGGGTTGCC TGAACCCTTG TTGGTGTAAA CAATCCAGGT AAC CGGTAAA TGCACGCGGT 
12 9601 CAGGGCGATT TGTAACAAAC CTAAAAACGC ATAATAAGCT TACAAACGCA TTTTTGGTAA 
129661 CAGTTTGTAT AATTTAGCCC CAAAGTTTAT ACAAAGCGAA ACTACTTACC GATTAGTAAA 
12 9721 ATGCAAACCG CAAGAAGGAT AACACTCACT GATTCTGTTC CAATTGGTTC GAATTTAAGA 
12 97 81 ATGGAGAGTA AATTTGCCCC GCTGAGAATG CCTGGAAATA AGAGGCGCGT ATTTTACATG 
12 9841 TGGTACAGGG TAGAAAACCT TGTAGGTTTC AGTTGAGTCA CCCTGCCACG TAAACCCGCC 
129901 TAGAGTGCTG CACGCGGTGT TGGTTGCAGG AAGTACATCC TAATTTTAAC TCTAAAAGGG 

12 9961 GTTCCTTGTA AAATTATTTA AGTTATGGTG TACTTACATA ATATCCAATG AGTAGTTACA 

13 0 021 CTCCAAAAAG ATACTCGATA AAATGCGTGA TATGTGAAGT TTGCAACAAA TGCAGCGGTG 
13 0081 AGTAGGCCCA AGAAAATGAT AAAC CCTATA TGAGGAAATT TTATTAGTAC GTTTGTGTAA 
13 0141 CTGCAAAATA AAATTTTAAA TAAATACTTT ATATTCATGT TACCAAGTTT AATTTGTGGT 
13 0201 GGTTGTTCTT GAATTATATG TACTCCTAGT TCTGAGGCAA CTGTATGTAG TGCAAGCACG 
13 02 61 CTGCCTTGCA ACATAACCAA TAGGCATGTT CCATGATTCG AT ATTCG CCA TGC CATTTTA 
13 0321 AGTGCCACGC AGATTGCAGC AATAGCTATA AAAAAACAAA AGTATGTTAA TAGTCTATAA 
13 03 81 AGTATAAAAC ATTAAAACAA AATTAAAAAA TTAAATACCT GTATTTGCCG TAACTATAGA 
13 0441 GTATGATAAT AGTAAGGTGT AAGACTGTGT TGTTGTAGTT TCTGTTAATG CATAAAGTCC 
13 05 01 CACAAGTGCT AAAATAAAAA AATAAATGTT AGTTATTTAA TTTTTAAAAG TTTATACTAT 
13 0561 TCAAGTAATT CAAAGTTGTG TCATACCTGC GTAAATTGCA AAC CAAGAT C CAATTATTCT 
13 0621 TAGTATGTTT GTTTTTCCAC ATAG CCACAC ATAACCATAA AACACAATGG GCATTAAAAT 
13 06 81 TACTGGCAAC AGCGCAGATA CTTTGTAGTT TGGAGTCCAC ATAAGAAATG CCGTAAGAAT 
13 0741 AACAGTGCCT ATCCATG TTG GTCCCACAAG AAGACATGAT GTTGTTTTAC TAGAAACAAT 
13 08 01 TTTCGGTGGT TTTATAACAA ATATTAATAT GCCTGAAAAA TATAAACAAA CGATTAATTT 
13 0861 AAGGCCAATA TCATAGTAAA ACAAACTTTA AACAGGTAAA AAGTTACGTC TTACCTATGT 
13 0921 AAAATTGAGA TACGCATGCA ATAG CACTAC ACGTCTGTAC CAAAATACTT GCGTTGTCCA 
13 0981 AGGCCAATGC CGATAGTAAA ACACAACATG CTGCAAACAA GTACACAAAA TATAAGTTTA 
131041 ATTTGCTAAT TAAAATTAAT ATTTCAAATG TATTATAAAA TCATACCTGT AATCCATGGG 
131101 AAAAAAATTG AAGGCAACGG AGTTAATTGA CCTGCTCTGC TTCGATTACA ATACAAACTA 
131161 ATAGTTTTCC ATAAAAGAGA ACAACACGTA ACAAATGAAC AAATAATTGT GACTGCTTGA 
131221 TTAGAAGTGT CCCTGGCATG TTGTACCAAG GCTATTATCC AAAACGTTAA TGTAAGAAGG 
1312 81 TTTGTAAGTA ACCAACTACT CCAAATAAAA AAGGTATAGC TACACGCCAT TTTAAAATAA 
131341 TCAAAACTGT ATTATCAAGT TAAGGGGGAG TTTG AG CTTT AAAAGGTTGG TTACATCGTT 
1314 01 TCCAGGCAAA CCTCACACTG AGTCACACAA AGGTTTCTGT GTGCCCCTGT ACGGGAATCT 

1314 61 GCCTTATTGC TGATTTTGXT GGCAACCAAC CAAAGAAATC ATGGAAAAGC AGAGTGGGAA 
131521 TATTTGTTTT TGTTCGTTGG CCTGGATGCT TGGAAACATT CCTGTTATTT TGATGTTAGG 

1315 81 CCAAACTGTG GAAAACATTA AGAGTTGTTA TTGAAAATAT TTTTTGGTAA CAAAACCATG 
131641 TTTTAGGTCC CAAAAGGAAC ATAAAAATGT TTAATAGAAA CATGCCTAGT ACAGAACATT 
1317 01 TTGTTTGGTA CGTAAACTAA AAATATGTAT GGTAAGGGCT AGGGGTAAGG GCAAGGGGTA 
131761 AGGGCTAGGG GTAAGGGCAA GGGGTAAGGG CAAGGGGTAA GGGCAAGGGG TAAGGGCAAG 
131821 GGGTAAGGGC AAGGGGTAAG GGCAAGGGGT AAGGGCAAGG GGTAAGGGCA AGGGGTAAGG 
131881 GCTAGGGGTA AGGGCTAGGG GTAAGGGCTA GGGGTAAGGG CTAGGGTGAG CTAGGGTGAG 
131941 CTAGGGTTAT AGTTATAGTG TAAAATAATA GGCCTACGAT AAAATGTCAA CATATATTTT 
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132 001 TTAATTTGTG TTTCATTAAC AGCCATGGTG TATTAGAGTA AAGGTTAACG CTTCAATACA 
13 2 0 61 ATATACAAGT AAAAGTAAAC CACACGAATT TATAACATAT TTACAAAAGC GAACCAAGCT 
13 2121 GGCACATGTA ACTTCACGCT ACCCTAAACG TACAATACTG GGACTAGAAC CCAGAGGTAG 
132181 TTAGAATATA CGGTAGTTAC AG AACTTTG C AGTTCCCTTA GGCCAGCAGG GCTCTGCGGT 
13 2241 TAATTAAACA AAGTTTAAAG ATAACTGAAA CTTTAGGAAG TGCGTATGGG TGCAATGTGT 
132301 TCCAAATAGG GCAAGGGTTA CATAAACTGT TGCCTAGCGG CCGGGCCCGG AGGCGCCCGG 
1323 61 CCGGCGCCGC CGCCGGGCCG CGGCCGCCAT CTTGCGCCCG GGGCGAGGGT CCCCCGCGCG 
13 2421 CCCCCCGGGC CCGCGCGCCG GGCGCCGCCG GCCTCCCCCG GCTCCCGGCC CTCCGCCCCG 
13 24 81 CCGGCCCGGG CCCGCGGCCG CCGCCCCCGG CCGCCCCGCG CCGCGCGCCG GGCCGCTTTC 
13 2541 GGTTCGCGGG GCCGGGGGTC CCGCGGGGGG CCGGGGCCGC CCCCGGGTGC CGCTCCGCCG 
13 2601 GGCCCGGCCG ACTCCCGGGA GCGCCCCGGT CCGGCCGAGC CCGGCGGCGC CGCGCCCCGA 
13 26 61 CGCCCCCCGG GCCCGGGGCC CCACAAGCCG CGGCGCGCGG CTCCCGATGC CGGGCGGCCG 
132721 CCGCCCGGCA TGGCGGTCCT CCGCCGGCCT CCCCTCCCCC ACGCCGCCCC GAAAGGTGGT 
132781 CTCCGCGCCG CCGGGAGGGG GGCCGGGGCC CGGGGCGCCT CGGCGGGGCC CGGCGCGGGG 
132 841 CGCGACCGAG GGCCCCGGGA GAACGGGGGA TCGGGAAAAC GCGAGGGGAG CGGGGGACAG 
13 2 901 GGGACGGCGT GTGCGTGCTT GTG AG AC AC C GGGTACGGCT GCCTGCCTGC TCGCTGGCCT 
132961 GCTTGCTGAG GGGACAGTAG GCCTGCTTGC TCGCTGGCCT GCTTGCTGAG GGGACAGTAG 
13 3 021 GCCTGCTTGC TGAGGGGACA GTAGGCCTGC TTGCTCGCTG GCCTGCTTGC TGAGGGGACA 
13 3 081 GTAGGGCTGC TGGCTTGCTA GTAGGGCTGC TCGCTGGCCT GCTTGCTTGC TCGCTGGCCT 
13 3141 GCTTGCTTGC TCGCTGGCCT GCTTGCTTGC TCGCTGGCCT GCTTGCTTGC TCGCTGGCCT 
13 32 01 GCTTGCTTGC TCGCTGGCCT GCTTGCTTGC TCGCTGGCCT GCTTGCTTGC TCGCTGGCCT 
133261 GCTTGCTGAG GGGACAGTAG GGCTGCTTGC TTG CTAAGGG GACGGTACGC CTGCCTGATG 
13 3321 GCTTGATAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCTG 
13 33 81 GGCTGCTAGT AGGGCTGCTG GGCTGCTAGT AGGGCTCCTG GGCTGCTAGT AGGGCTGCTG 
13 3441 GGCTGCTAGT AGGGCTCCTG GGCTGCTAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCTG 
13 35 01 GGCTGCTAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCCT GCTGGCTTGC TTGCTTGCTT 
13 3561 GCTAGTGGGG CCGCTTGCCT GCTACTAGGG CTG CTGTGC A GCTGGGAGAA CAGAGTAGGG 
13 3621 CTGCCGGCCA GCTGCGTGCG AGGG CGTCCG AGGGCCAGAC GAGGACACGG GACCCGGGCC 
13 3681 TCTCCCCCGC CCGGACCGCC GGGCACCCGG CCCAGATCT 

SEQ ID NO 2 

CDS nucleotides 1353-2624 
SEQ ID NO 3 

MFVIjVIjFMLLQPVSVELLPAKLTSVPTWCPPHPGDTYIj 
SDRYACQTKTTTRSNNIDFRVSSSRLTLQERCSSYGYTYANT^^ 

LTEKTGGTYFCSAFIGNEKFYSQTINVFFTSFTFKPTNDIPNESHFNKTGQIQQTASVQHPENYWFSVPVFSIGVLTGI 

AISLIMCWLFTIRCNENSESSTNSYASQTSYIQPSHNQRSNTNECSRHTYRNAHQEESIEELPNQHTSETDSCCQLVLLE 

VKWAYDGPQENTINEVMEQYDDVWKNIEQTSYEDWEHMD 

NLDHNEYNHIiNELNMIEQYDWLE 



SEQ ID NO 4 

CDS complement (2692 .. 3258) 

/note="dihydrof olate reductase; ORF 2; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 2" 

SEQ ID NO 5 

MDITVNCIVAVDEQLGIGKNGTMPWPYLRNEMMYFQKMTSTPSW 
REPPHGAHFIiARTLDDAFNFYRQYKLKEQLNTVWVIGGKSV^ 
SE I PG KDTNFEENG I KYKFQVYEKWFNK 



SEQ ID NO 6 CDS 3676.. 5613 

/note=" complement binding protein; ORF 4; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 4" 

SEQ ID NO 7 / 1 rans 1 a t i on = '.' MTFKLFPLFLLHAIMYVHCDENCKPPHFTE YRVKSNTEKDLYS V 

GETAEL I CRPG YVTNTKI ITTECLQNGTWSTPNFPCDRKRCPTPADL.LNGAVHIHGGD 
NALKFGSNISYECNEGYDL1GSNVRFCILQDTENVNWDSNEPVCEIQKCIKPPAVEHG 



40 



WO 00/28040 



PCT/US99/26260 



DYLPNQDVYNYGDAITFKCSLSYTLVGSTTLVCTSNKKWSNSFPTCLMIiVCESPQIDN 
GYIDIGLSRRYWHGQSITVKCSDGYNIVGPETLTCTNTTWVPPLPKCVLVTNNPSTPM 
PETPMPETPTPDYQKIWXiSTAKTATTPNAFVTTWSPEKDDVTCVKPHFERFMVKAEN 
DKEKYSVGASVELICRPGFTKMQSTVSVECLSNGTWTAPNAKCHRKKCPTPQELLNGE 
YIVTSGEDAFKTGTNITYKCNEGYQLLGSMVRICMLKDDLKTVDWEPKAPICDIEKCK 
PPPQITNGKYHPVKDFYQYLDTVTFSCNRDFSLVGDEMTTCISNTWNKPFPRCEQITC 
SAPNIAHGKLLTGSSSVYKYGQSVTIGCETGFTLIGSEISTCKDSSWDPPLPTCVPAV 
SMPSDTPKPETKKPNTPTPEAPKPNTPNVGTHTPFKPPPQNPPIAPPMSKWKRHWLV 
LFASVASLLFVLAAIiYCCFLK" 



SEQ ID NO 8 . CDS 6045. .9443 

/note= "ssDNA binding protein; ORF 6; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 6" 
SEQ ID NO 9 /translation^ " MASKGNAGQPLEDNQGSRAPIGACGYVYAYSKQDFPFAEAS ILG 

NRPSGSGVFSLPILYGLT\7EHEFPLTVKAAYKKVDTTTLAVKVTCFHREVIVFHNASL 
FRPVFDGTGLNELCEEAiiALFGYTQFIEPGPPHSIWNPLECPQLPDKDEMFLGVVVTE 
GFKERLWRGCLVPAVFQTQQVQIAGRQAFKVPLYDEDLFAPHGHRMPRFYHKDVSAYL 
YDSLFTSIAQALRLKDVTAVIHATEKQFMQDHYKIAKIVQAKQFSTTLPKTTDGSSHM 
IVDSWAEIiALSYGCMFLECPQDACELLNYDSWPIFDGCDSPEARWA^ 
HVAGQLFAANSVLYLTKVQKQAPRGQKGDVNVYNSFFLQHGLGFLNEATIKENGSEAF 
KGVPSNALDGS S FTP YHLAYAAS FS PHLLAKLCYYMQFLQHHKSSTNQAFMMVHYVGT 
AANSEMCTLCHGNTPATCItNTLFYRLKDRFPAVTTPQRRDPYWTGTAGTFNDLEILG 
NFASFRDREEDGNPADEHPKYTYWQLCQTVTEKLSAIGITEDHDNHVNLITNIQSFLR 
VFKGIDSIVDGEVMKFWSMIKNNFNFREHVKSVHHILQFCCNVYWQAPCAVFIiNI, 
KSLLWIIQDICLPYCMIYEQDNPAMGILPSEWLKMHFQTLWTNFKAACLDRGVLTGCE 
LKIVHRDMFCDFFDTDAGSNGLMAPFKMQVRIARAMMVVPKSIKIKNRIIFSNTAGSE 
AVQ S G F VKPTGTRDT YWAG P YMKFLNS LHRAIiFPDTKTAAjL YL WHKI S QTNKTP VX, K 
DVPDDELAELVSYVKTNSIiAFEETNVLDWPDSLMSYARIKLNGAILRACGQIQFYAT 
TLHCIjTPVLQTIDAEEYPHVIjGSAAIATPVAYIjAEIRGRTALTVQTTARQPVAATGRL 
RPVITVPIWVNKYTGVNGNNWFHCGNLGYFAGRGVDRNLWPESSPFKKTGVSAMIiRK 
RHVMMTPIIDRLIK^AAGQTISTFEAESVKRSVQALLEDKDNPNLLKSVILELIRHLG 
KGCQDLSSEDVQYYLGDYCMLTDEVLFTLDNIAQSGVPWTIEDAGALIEDRQDADDLQ 
FVDSDDIATASCQPPEEQLPTPSAGALLAGKKRKINAldiSDLDIj " 

CDS 9468. .11528 

/note= "transport protein; ORF 7; similar to Kaposi 1 s 
sarcoma-associated herpesvirus ORF 7" 
/translation - "MARELAAXtYAQLSALAVDLSLVIFADPRSIDGARlIiKTKTQIEN 
LNRDLLPIiIiREQNSVETSSLSLEV^HLAKNIEDKl.GELERSl.RORYSSREHFETLHLR 
PECHYHSTVTFQFYGGGLIDWMCLINDVELLCKRLGSVFYCIGANEALSGLNRVLTF 
LS TLRG I S P I PHPDL YVTS VPC VQCLRE I ELVPNQGS SLLAVLADRHCDHLCKKVRAE 
PIHGLFETELSQLGLKVTKRSDATQHGVRSSADQLRESSLAAIQDHNIFKRVSASIME 
LSNLIYWNAGQTGLQTGTENECSQr4ARLLTHEADMHEHRAIiITPKLSATHFYDCFRPD 
P IESLFCGGLFNS IDDTINALSRDCS VTFFQQANYTNVMRKQNELFTRLNS ILRQGS A 
GSQKPATPSEPRTTTVAATAASDVIKDAQYRKEQYMKKVARDGFKKLTECLQTQSAVL 
ANALCMRVWGGVAYGEASELVWHFLIiRRRFVALPWEARCRSDQILFENSKYIKNSIiYS 
QRLSREHVEIITLQFYGLITGPLTRQSDLFPGPAOTALAQCFEAAGMLPHHKMIjVSEM 
IWPOIQPKDWIDQTFNRFYQLPEGDLNAVQKSAWCFIRELVLSVALYNRTWEKTLRIF 
SL. AREKLSISNLDVKGLTSGLYLTYEQDAPLVLISQNTGWIFKDLYALLYHHLQLSDG 
HDDN" 

CDS 11515 - .14004 

/note= "glycoprotein B; ORF 8; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 8" 

/ translat ion= "MMITNRTRRLLRAWWIIAIGTAVGENVTTPKGATTTAKPTPGP 
STPTPPENPPRAEAFKFRVCSASATGELFRFNLEKTCPGTEDKTHQEGILMVFKKNIV 



SEQ ID NO 10 
SEQ ID NO 11 



SEQ ID NO 12 
SEQ ID NO 13 
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PHIFKVRRYRKVATSVTVYRGWTETAVTGKQEVIRPVPQYEINHMDTTYQCFSSMRVN 
VWG I VNTYTDRDFTNQTVFLQP VEGLTDNI QRYFS QPVLYTTPGWFPGI YRVRTTVNC 
EIVDMIARSAEPYSYFVTALGDTVEVSPFCHNDSTCSVAEKTENGLGARVLTNYTIVD 
FATRQPTTETRVFADSGEYTVSVJKAEDPKSAVCALTLWKTFPRAIQTTHEASYHFVAN 
DVTATFTSPLSQVTNFTGTYPCLNDVIQKTLWATIKKLSDTHATNGSEQYYETEGGLF 
LLWQPLTPLSLADEMRELNGTTPAPPTTTSTANRVRRSVGTNEQATDDLAAPQLQFAY 
DKIiRASINKVLEELSRAWCREQVRDTYMWYELSKINPTSVMTAIYGRPVSAKFVGDAI 
SVTDCVAVDQASVSIHKSLRTSTPGICYSRPPVTFRFLNSTTLFKGQLGPRNEIILTD 
NQVEACKETCEHYFIASNVTYYYKDYVFVKKINTSEISTLGTFIALNLSFIENIDFRV 
IELYSRAEKKLSGSVFDIETMFREYWYYTQRLAGLREDLDNTIDLNRDRLARDLSEIV 
ADLGDVGRTWNVASSVITLFGSIVSGFINFIKSPFGGMLMIIiVIVAWLIVFALNRR 
TNAIAQAPIRMIYPDIDKMQPSGGKVDQEQIKNILAGMHQLQQEERRRLDEQQRSAPS 
LFRRASDGLKRRFRGYKPLENEEAQEYEMSK" 

SEQ ID NO 14 CDS 14122 17166 

/note= n DNA polymerase; ORF 9; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 9" 

SEQ ID NO 15 / trans lat ion = "MDFFNPYLGPRGPRPHSHRGTDAPAPAGAGAVQPPPDVCRLIPA 

CLRTPGAGGMIPVTIPFPPTYFENGARGDVLLANERSMWTARDRKPVAPDPQDQSITF 
HAYDVVETTYATUDRCAEVPSRFQTDIIPSGTVLKX-LGRTEDGTSVCVWFRQQATYFYA 
KVPAGINVTHILQQALKWTAGRAACGFSTRRWKRILKTYDVAEHPVTEITLSSGSML 
STLSDRLVACGCEVFESNVDAVRRFVLDHGFTTFGWYSCARATPRLAARDARTAX.EFD 
CSWEDLSVQADRSDWPPYRIVAFDIECTGEAGFPCATRDGDAVIQISCVFYTTREGAP 
NPPNILFSVGTCDPIPDTDVLEFPSEYDMLVSFFAMIRDFEVDFLTGYNISNFDLPYL 
I TRAS QVYNLRLNE YTKI KTGS I FE VHE PRGGGGG FMRS VS KI KI AG I VP I DM YQ VCR 
EKIjSLSDYKLDTVARQCLGGKKEDVSYKDIPPLFRSGPGGR 

LKMFMIHVEISEIAKLAKIQARRVLTDGQQLRVFSCLLEAAARENFILPVPTPEGQGG 
YQGATVINPIPGFYDEPVLWDFASLYPSIIQAHNLCYSTMIHGRDLHLHPNLTPDDY 
ETFVLSGGPVHFVKKHKRESLLGRLLTVWLEKRRAlRRTLAACDDPSLKTILDKQQIiA 
IKVTCNAVYGFTGVASGLLPCINIAETVTJJRGRTMLEMSKSYVEALTTEDLRTRLGRE 
VTARHGARFRWYGDTDSLFIACDGYSAEAVSAFCDDLAARITADLFPPPIKLEAEKT 
FKCLLLLTKKRYIGVLLNDKMVMKGVDLIRKTACKFVQERCRAILDLVLHDPEVKAAA 
RLLCKRPPHAVYEEGLPAGFIKIVEVLNASYLDLRNSWPIEQLTFSTELSRPVCDYK 
TTNLPHLAVYQKLASRCEELPQVHDRI PYVFVDAPGSLKSDLAEHPDYVRQHQI PVAV 
DLYFDKLVHGAANILQCLFGNNADTTVAILYNFLNTVPYKLFS " 

SEQ ID NO 16 CDS 17261.. 18511 

/not e= "unknown; ORF 10; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 10" 

SEQ ID NO 17 /translation= n MLWELSWLGDWEVTFHRGRFSFVNLTRLQTFKGHGGYARVRL 

PFS LDQLLHQHF AFGLVTRLKELP PFSDCVAL I APLDSGGDADAARVAPGFVXDSSR P 
LTVWVNASGRHTIRFCLLFLKPIDLERAVTYVFGENGGARSEGTPKPTCATESLPGGP 
LRVSGEASQTSPHSFVAYFPTANSVACLSLLRLQTOPFSDDAAHRDARISPKYVTFSN 
SGGNVCKASVHTLSPSRCKTAQMEIIYAPGDPNAEIVLGQSGPVLPTHTGGRVLGVYA 
DAJEKTIQPGSSAEVRVQLIFQQGAAARGDLAFLVTGVAPEPLFVVTPALLI.SGCTTHL 
RLFNPNGTPTTI KRDTLVAAAAPCPWRLS S ADD APRDLVAS PDTGALS INAFT I PVG 
FPG WS AECHVSLRDNGVHERMNH " 

SEQ ID NO 18 CDS 18520.. 19749 

/ no te=" unknown; ORF 11; similar to Kaposi • s 
sarcoma-associated herpesvirus ORF 11" 

SEQ ID NO 19 /translation="MGTPVRFFRGEWQTSSLVDNGTPRYSSLVWAATIHDGYLTLVNR 

SELCVTERSPCLPACPSIGRLVGKRFPGFAFASATLGDRGTRTVFYAFGHRDNPLDIV 
PAWERADRELVLRVHAPQTTRVSRYGLKVFVAIVTVVRPPGVFLHFPQDRVPIALTD 
ACSQEGSRLTSEEPWIKIQGFPVLSDETAHPFLLTQKTKPFTERKFCRLIMDNDQRSA 
VTSTTVYLGKQHVRVTVTRPPETIVTDGPVTATLSLTGNAPIAFRHNPYFELPWSSTTAI 
FTPWYVGLTVCIPPNCSKFVRYGNTYVSAFNRKLTAIISNHAHNGGFRIQDCEWPPN 



42 



WO 00/28040 PCT/US99/26260 



REIEILVTNVSQAPVYISTGTQLGQAIFVFAPRFGGPAKLRQLLGHRSRALELPGGVT 
VDSQKLCRFETMYL.FST" 

SEQ ID NO 20 CDS complement (1992 1 20544 ) 

/note="R2; similar to IL-6" 

SEQ ID NO 21 /translation="MFPVWFVIiFyLSCWAASPTI,APPPTAAGIlTVLPOWAGNRASI J DR 

TRGRLSEVGLNIQRWFVYIjCHHSTLCRVREYPRIMSFVHFPIIiMSNVECQRREFRGAE 
CMN AMVRGLRAY E S YLTRLRMLLDDAPGDADAAAIG S AVTVVL S&LVSL I EEL P VNNK 
IGGAESNEKTVRALGGQSPRDWLSAFRILEYLQMFLRDGRRAIAMM" 

SEQ ID NO 22 CDS complement (20777 21778 ) 

/note = "thymidylate synthase; ORF 70; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 70" 
SEQ ID NO 23 /translat ion= "MIVLVHLGICYVKKIIPVCVAGIAAARLRVFSAPEGAAAVRCAC 

RGDHGELQYLAHLDLIIKHGVQREDRTGVGTRSVFGLQARYNLRDEFPLLTTKRVFWR 

GWEELLWFIRGSTDSTELSRRGVK1W0AHGSRAFLAAQGFGDRREGDLGPVYGFOWR 

HFGAEYRGADANYEGQGVDQLRYVVDLINRRPHDRRIWICAWNP 

CQFYVARGELSCQLYQRSADMGLGVPFNIASYALLTYJblAHVTGLTPGDFVHTLGDAH 
VYNNHVDPLLIjQLRRTPRPFPRLKILRKVARLEDFTRADLSLEGYDPHPHIEMEMAV" 

SEQ ID NO 24 CDS complement (2224 5 ♦ . 22592 ) 

/note= tf R3; similar to Kaposi's sarcoma -associated 

herpesvirus K4 viral MIP" 
SEQ ID NO 25 /translation^ "MRGXiFVCVFFAVFACWDYAFPMGS MSG PAPELCCLGYVTHLPP 

PGLWSYSHTSSQCSVI^AVILNTRRGKKLCANPGDDAVKKLLQAVDKRPKKGRRTRRS 

LIDDSEEGLGSGI" 

SEQ ID NO 26 CDS 26846- -27409 

/note="Bcl-2 homolog; ORF 16; similar to Kaposi's 

sarcoma- associated herpesvirus ORF 16" 
SEQ ID NO 27 /translation "MAAVQGPPPPPEEENENSLPVDVYAlEGIFIiYCGLGQAEYLHHP 

VFSPIKEFI S AFLKDSARL YERLLRHTD YRS LRGLNAI GQGMLQ INTDGRHNWGRALA 

VLGLGAYVVDKVKDDERLLTFAIAVLPVYAYEAI.ESQWFRSHGEWEGI.RNYCERILRH 

RRNARRHMCYGVAAGLLALVALFAIRR " 

SEQ ID NO 28 CDS complement (27515 29125 ) 

/note="capsid protein; ORF 17; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 17" 

SEQ ID NO 2 9 /translation^ "MTPVYVGGYVDWSLPKIEKELYLEPSIVATLLPYTDPLPINIE 

HVPEAHVGHTIGLFQVTHGIFCLiGKXiTSHDFLiAIjASRLAGDSRAAQIQLNPMPRDPLL 
EMLHTWLPELSLSSLHPEELQDPNHPPAFQHVSLCALGRRRGSIAVYGPDPTWVVSKF 
DSIiTREEAGKITVNCLDLCERQVTPPEFAAPLETLMAKAIDAGFIRDRTDLLKTDKGV 
ARVARSTYLKASQFPCAQHCGNRDTRTMSALPEDNITIPKSTFliTMVQSSLDNMRNQG 
HRTYVSAPPSMPATAAYPSWIPPPELTVPSYAPPVAPPFPFQSAFAPQPSPYAATYYS 
PTYGYAPAPSRHQKRIOlDVELSDEPVFPGEEVGIHKDWlAIiSKNILDIQADLRDLKRA 
ASQTSGAQDADQRPQPPPVQFSWPQTYASAPYLAYQPQWYSGTDTHLHAPQPYQSAQG 
IQQTQPPPPQPASHHAGLATQPATPAPAAQESVMSNAIPSASAPRAGACPPLDPECGQ 
S ARAPVEAS AQPAPVSQI QKMFCEELLK " 

SEQ ID NO 30 CDS 28998. .29897 

/ not e= "unknown; ORF 18; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 18 n 

SEQ ID NO 31 / 1 r ans 1 a t i on= " MF I GRG S VYGSRVAT I EGS KYS S FS I FGRLTTSTYPPTYTG VML 

GRCLREPKJEMSAGLRGLMWRVIRCENIiNTFLPGELRFLHLVLCEMYNYGLNVYLLKEA 
IANTGTRDDIVLGRKVPVEFWKIVYDGLKEMGVSDATLLSETKRGALWLYFNGRPCLL 
KGLGDYVFCQLGLSHSVRWPENLTDGNYLYNLGSVIPCRLLVALSYCLAFWGHADHE 
PWVRLFAGKI FILYLI I SGHI MPRKS ILEQVGTSGYGGFVE AVCRDVRAVHG I PAWDF 
AS AAPALTSQQTDYIiFAFNNS W " 
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SEQ ID NO 32 CDS complement (29905 .. 3154 8 ) 

/note- "tegument protein; ORF 19; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 19" 

SEQ ID NO 33 /translation* "MRTSEKCCMRYPRKPARQITATFWAPHPNNVLFIHKPSLIEERR 

NAFVNRNQQIJU;RVHTLRJ<NLLRLELDNVLQTHQRETE 

RETANAQTSLNPQPKIAPQTHGDAFVVTIAPGDPGFTWQDLRLELLPSLYMNQNQWL 
PQYGPWYSSLTDNAMQRRVFPRDLRGTTNFQNSTSLKIiMSAVISTAASITQDFYAPVR 
NVSDTQAALCLLNGYYCHRTGTPLPPTRNGLWDNLGTKLATLVSHLKQNTKGLGFEFT 
YSNPRQRASLAPLNKETKYNADFFTNHVIYATLAQSGLLPGSKNPGTGQPPGPDLVYI 
LATTLF S EDVPP FQ AYQ WNLRAGL S ALGCLVX/VYVLLELAQ ITPRS PHRRLNLAS LLG 
GRFS KVEDPSGSKQYLKKGQLFDFLTENY I S P I LSRAPDAPTS FLFPGAYLAALEAKA 
ISHLKHTRPFVNLTGSRFNElFDILNQKLTFRDAGSLIQAQTSLRLTAEEGIiAAILSH 
PSPPGLAHEIMKSQFGVYDDYDRVYFLVX.GYLPVATSW" 

SEQ ID NO 34 CDS complement (31043 3 2 0 95 ) 

/note- "unknown; ORF 20; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 20" 

SEQ ID NO 35 / translation^ "MAFANQCKHVATLEALPASRKRAGTRAHLAVTRRLIKHRSLDDI 

LKFLSIRPTLRATKNVKFRIFFEVSLGRRIADC^TVNSEHQKTCYVIELKTCLSAAV 
FPGNAIKISQRWQGLHQLTDSVAYIGRAAPRGHENWSVRPWLLFKNQKTLKTIHTESS 
AFPPTFIOTTSAAJuNGFFSQWEDAHTOKMLYEIPTKTSAANYRNFLGPPSKQRSVYSQ 
T I SDRRKKKRVCDAKSTAGAKGSHAAKKPAPARTRQRAANAPTGNRSGHARPRNNS KH 
GRGSAVPGQGNRQCPNITKPATQNRPADTWRRVRCHNSPRRPGIHGKPGSPSGAPAKP 
VHEPKPMAATIRAWQ " 

SEQ ID NO 36 CDS 32094.. 33767 

/note=" thymidine kinase; ORF 21; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 21" 

SEQ ID NO 37 /trans 1 at i on- " MAEGGSG FGDELVRQMRDRKPRWDES SDDTDDVDTESTDLE YDD 

VFPWDTHGLMSPGSQNYDVPTSPSGTPWELLHPDALYAHPRCPPKRAWPGGGARPK 
VS AFS ARLQYVGRQS FGDRETRQLTGAQFSSESEHEYAEI PERTTTRPVESGD KRNFT 
SGRRGAISGPSSTKPSHGAGLTRKTKTSLSVSLKNLLRIKDDDAKVDVPRPVTVPVHL 
MQPHPMTEYRNAFLIYLEGVMGVGKTTLLNSMTGMVPQENVIiSCPEPMKFWTCVYSNC 
LKEQRSIVKQGTHGKLITSARVYACQSKFALPFRATAAGIGRNLQPWLVGNGSTKPAN 
WIVFDRHLLSATWFPLVHVKYNRLTPDHLFQILSLFSAHDGDVVvIiLTLN 
IQSRGRKEEKGITQNYLRQVAWAYHAVFCTWVMMQYLTPEQ^QLCVQTVSIEDICNM 
NSRIiTHRFLTLTKLHEQSMlPMVAEMDVSVKEHVTLMEVCLGLFKELRKLQILIVDAG 
EHLDDACGLWGNI YGQVMSNEAIKPRAVNWPALES YIQTLTKLEGNGAY " 

SEQ ID NO 38 CDS 33754.. 35868 

/not e= "glycoprotein H; ORF 22; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 22" 
SEQ ID NO 3 9 / translation^ "MARISFIFFFTIIRCSVTDKYVYDEKSNVELEFNGTIYQINWRN 

VSKELTSIVMEDAWYDSLLLEPLSVTLEKRKSLLRSSIVNVKNIJDYTFCKSSSDHVIN 

LTVDFNYSSLPGFTGNFNVMTHALTQGVTjLTKREIjFTNSTNIMDLFYAEKINAEMFKI 

tfdysnviisgiitenwilvsvtnssvksnmqcvallfgvpstfpalkgyvsyrdi.lv 
vknsnyalgviapksyntldlaflpknftemfvsvidsplnaidylkgkllaieakga 
cqnpsnendilsfffevtavnflfiknlqkqqlvnvgcvvrhvaaleslmhllrlcyp 
tfklyelnletlshiaesqvfnlpansmlslsvndqewfsmfkivyntpkvggkiln 
eivyitnymytkysenyqltntfrrnvmnmyevlttiklnvtdssvfypyilftsmcn 
nveisyminqiakpdditifrvfspcflslrfdldenklrsdapqtskrtgselaqga 
sgfwrllhafhatrinefsvinctrlawkqvtalmpltnityvissvrpdharvyevs 
evflnsamfvsavypncshftppgtalhipilynfsaprigcplcdsivlsydenqgl 
qtmmyvsnptvqanlfspyspffdotnfhihylwlmnngtvveirglyrrhalsaial 
vfafigtmsalyflfklfsila" 
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SEQ ID NO 40 CDS complement (3 5 8 65 3 7 073 ) 

/ not e= "unknown ; ORF 23; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 23" 

SEQ ID NO 41 / translation^ ''MIKISDLKARLVGGAVQLSNGEYVCHVVYSSAIxAAlWGLPGPAV 

PLPLLFKKFSTIYSNMMPLYAPKRPELSMLRIMVSPHPYAIiNSCLCVGTDEGERGVSL 
FRDPVIRSSDFEDTPITWSKLVIASNSLFLHCRPFSVPATVKTPPVTLTNNKQITIN 
ELANTTQEYDPNAPPTLCSALPPDNKKXRSIIiKQPPATSESOTQSDCLI^IFFAMGS 
RQPQIGESPITAFNTVTIMQRANWSIMFLPNLKLKPIQHLFLKHVLLQRLGLENILFH 
FKMLYANTCKAAGPYQREYFESMLSRVKQRLEDMVFCLNSIESHDFQKDFRVLSRAPQ 
RLLTATDKYFLMFPPQNREI^IQVGAEVIESICDGTPLSEVLANLSPRVTIQKETGmJ 
LLKFYALLTV" 

SEQ ID NO 42 CDS complement (3 7123 3932 1) 

/ no te=" unknown; ORF 24; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 24" 

SEQ ID NO 43 / 1 rans 1 a t i on= n MLLQGPVLLPACPATVAANAPS PANSDFKTQliAI FCCI*ATNNE 1 

LEWVSLEALDRAMQTETTFYACRALRRIiVLGEGLYPFIHRQGGIVGKTGNEYAGPGLI 
IDDAIGCTFSHIETHTFLPTVFTYELSDTVLVQSDERILRSLYCSPLMVCGVNYQSMF 
RiriCRYLQIWEFEECFAAFTRTLPEHLIGTCYQNYFKLLEPFKILTLARCPPPCAKLH 
LNYLKFNILGFTSDWISHPELHRVQTVIIHNIESNPVLLKNI.SKQNKFQDIKVASELI 
IDYQNIVNQSIjDWLQVKINKiaDPGKKPYKV^ 

VAEHVCHSCDRLYPNTEFLGPGETPRVLEAMFSRIQYAPKDRDYNFIFNADQNPDRYE 
QARHDHQTEPLPDMFDPVKHMSLHNFKISVFNTNMVINTKITCWSLAGTFESIIDIPR 
LTNNFVMKKF S VKE PS FTVSVF YS DNLCNGAAINVNI SGDMLHFMFAMGNLRCFL PVK 
HIFPVSIANWNSTLDLHGLENQYI Vl^RGRRDV^ 

TISKIYGRPLLKKIiSDELNPILSVPYARIDQVKNTIFTTLETRNKAQIQTLHKRFIEC 
LV^CCSFLRLDLGAiNRAARLGTFDFSKKlISHTKSKHECAILGYKKCNLIPKIYVRS 
KKIRLDEIjGRNANFMS FI ATTGHAFSNLKPQVI RHT I RRLGLHWRHKAKI » 

SEQ ID NO 44 CDS 39323.-43459 

/note="maj'or capsid protein; ORF 25; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 25" 
SEQ ID NO 45 /t ranslat ion= "MEAALEVRPFPYMATEANIaliRQMKESAASGLFKSFQLLLGKDAR 

EGGVQFEGLIiGVYTNVIQFVKFLETSLAVACVKTEFKDLKRMTDGKIQFKVSVPT 

GDGRRPTKQKQYIIMKACNKHHIGAEIELSTDDIELLFIDRETPLDYTEYAGAVKTIT 

ASLQFGTOAiERGLVT)TVLlSrVK^^ 

SYLMDNSFFLDKSDIAVKGKQYVLSVLSDMVGAVCHETVFKGTNTYLSASGEPIAGVM 
ETTENVNRKLLKnVILGQVDGGM^^ 

DRPNAQPSVDDDRDAVADGQDSLAKTPIAAAVIQIGDKLVALESLQRMYNETQFPFPL 
NRRMHYTYFFPIGLHMPRPQYSTSATIKGVEHPAEQSVETWIWKNNVLLSFNYQNAL 
KSICHPRMHNPMPCGQALGQAFPDPGHVIIRYGQRSEHPPNMNIj^^ 

VPDVALKATMTTDEIiLHPTSHETLRLEVHPMFDFFVHQQPGAQAAYRATHRTMVGNIP 
OPLAPNEFQNSRGLQFDRAAAVAHVLDQSTMEIIQDTAFDTSYPLLCYVIECLIHGQE 
DKFLINSPLIALTIETYWNNAGKLAFINSFPMLRFICVHLGNGSISKDVYAHYRKVFG 
ELWLQQAIiSKIAGHEWGRRPASELINCLQDPNLLPPFAYNDVFTNLLRQSSRHPMV 
L I GDEG YETENDRDT Y INVRG KMEDLVGDMVN I YETRNNADHDGRHVLD VGPFNENEQ 
HI^VLEKLFYYWLPACTNGHVCGMGVDFDNVALAL^ 

MGTLREMLEASDIHPTVDMIRTLCTSFLTCPFVTQASRWTQRDPAQLLTTHDDGRYV 
SQTVLWGFAAFAIADRSRDVAETMFYPVPFTKLYSDPLVAATLHPLVANYVTRLPAQ 
RVPVAFWPPALMAEYEEWHKSPMLAYANTCPMTPTSLSTLASMHMKLSAPGFICHAK 
HKIHPGFAMTAVRTDEVLAENLLFSARASTSMFLGQPSVMRREVRADAVTFEVNHELA 
SLDHTUjGYSSTITPAHVAAITSDMGVHCQDMFLMFPGDSYQDRTLNDYVKQKAGCQRF 
GGPGQIREPVAYVAGVPHSDNIPGLSHGQLATCEIVLTPVTADVTYFQTPNSPRGRAS 
CVXSCDAYNNESAERLLFDHSIPDSAYEYRTTVNPWASQQGSLGDVLYNSTSRQVAVP 
GMYS PCRQFFHKDAI LRNNRGLNTLVTEYAARLTGTPATS ATDLQYVWNGTDVFLEQ 
PCQFLQEAFPTLAASHRSLLDEYMSNKLTHAPVHMGHYMIEEVAPMKRLLKIGNKVAY 
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SEQ ID NO 46 CDS 43491.-44408 

/note="capsid protein; ORF 26; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 26" 
SEQ ID NO 47 / 1 r ans 1 a t i on= " MALDKS I VVSVTSRIjFADE I ANLQSKIGC ILPLtRDAHRLQNIQA 

LGLGNLCSRDSAVDFIQAYHYLDKCTLAVLEEVGPNSLRLTRIDPMDNYQIKNAYQPA 
* ^ FHWDNYSELWIPPVFGRKDATVSLESNGFDWFPAWPEPIiAQTVI*QKI.LLYNIYYR 

VAETTPTDVNIiAEVTL YTTN I TYMGRNYALD VDP VGS S S AMRMLDDL S I YLCVLS AL I 

PRGC^LLTSLVRHNKHELVEIFEGWPPEVQALDLNI^SVADDITRMGALITYLRSL 

SSI FNLGRRFHVYAFSSDTNTAS CWCAYN " 

SEQ ID NO 48 CDS 44433.. 4S242 

/ not e= "unknown; ORF 27; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 27" 

SEO ID NO 49 / 1 r ans 1 a t i on= " MS I PKIMTVSRDNEGTVCE VAVDNGRHRAMI YYPKTTNLANERA 

DWKEAFDTETPVI)IVKQIWEGLAISKKNCV1*I^ 
MDPPGLVFAWPMGPKmnTKLPHPAIVAVGCGADA 
DSGQRAHYGIJU^LKAAWLVMGNVCPEPvVRQGAALLGPVmRTEWSD^ 
SRGVLWSPIHEKNLCRPTWNDVINTSVFTNESLCPNIPVVPESVIVLNGDA" 

SEQ ID NO 50 CDS ' 4 54 0 8 . . 456 83 

/not e=" unknown; ORF 28; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 28" 

SEQ ID NO 51 /translations ■• MTAHTNGVLTTTGFSTSQPESVQVSPFYRVITKPPVMGIjFFCVA 

MCVIALWYV^RRVCCKGRWADSCRDPRQPAYEMLNVRIiRPHGTNP M 

SEQ ID NO 52 CDS complement (4573 3 .. 4 6779 ) 

/ no te=" unknown; ORF 29b; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 29b" 

SEO ID NO 53 /translation="MLQKDAKLIFISSSNSSDKSTSFLLNLKDAHEKMLNVVNYVCPD 

HKDDFNLQDTWACPCYRLHIPAYITIDETVRSTTNLFLEGAFSTELMGDAATSAQSM 
HKIVSDSSLSQLDLCRVKSTSQDIQGAMKPCLHVYIDPAYTNNTDASGTGIGAVIAVN 
HKVIKCILLGVEHFFLRDLTGTAAYQIASCAAALIRAIV-TLHPQITHVNVAVEGNSSQ 
DAGVAIATVXNEICSVPLSFLHHVDKNTLIRSPIYMLGPEKAKAFESFIYALNSGTFS 
ASQTWSHTIKLSFDPVAYLIDQIKAIRCIPLKDGGHTYCAKQKTMSDDVXVAAVMAH 

YMATNDKFVFKSLE " 

SEQ ID NO 54 CDS 46905.. 47135 

/note= "unknown; ORF 30; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 3 0" 

SEQ ID NO 55 /trans 1 at i on= "MENDTPKDKISEADFQQCQAFFHRPIRDLISSGADALNHFSLSE 

SDGHKLERIVLLLDLVGTECLSYTTIAAKNVK" 

SEQ ID NO 5 6 CDS 47093. .47746 

/note=" unknown; ORF 31; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 31" 

SEQ ID NO 57 /trans 1 at i on= "MSLLYHDRCKECQMTRVNSPI CRFHNVSNLYQCLDCKRYHVCDG 

GRNCVIVYTRENLVCDLTGNCVLDNVQDVCSYGPPERRVPDAFIDPLVSHGTRECLKS 
DlIiRYFETVGVKSEAYSTWKNGQLNGIIGRLIDATFNECLPVMSDGEGGRDLAASIY 
IHIIISIYSTKTV^DNLLFKCTRNKKYDHIVKTIRAQWMRMVSTGDPSRVSATGCFT" 

SEQ ID NO 58 CDS 47683.. 49077 

/ not e= "unknown; ORF 32; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 32" 

SEQ ID NO 59 /trans lat ion= "MDAHGLNRRSVAGQCDGLFHVILPRGFILANNITCGERQRFFAH 

TWFAASGRTSKTLYVWGRVFQNTDPGRGDGPSGPWSGLAISLPLFTTNGKFHPFDWI 
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LKADTPDSGSSWTVKFXjYMSLI^YRNAMRGLKDKVSQCTDAAVDGEVHPLTVLKE^ 

vspdtatrpvsacnplqmltgllqsrvrddyvthhralerpgnvrgqviaptrtempn 
gspsrvrlgfrppkqanypktwaoarhvfssrayyvcvydneeldtkwqrqdprpl.pl 
dwsdpvayllegdlflgakqnafvdsiektcrcqnytikqffpvlinrdnetvdlike 
hfieacfvirnqvsersawvkaalfrndsntywkdvlglwehgphklgtaiklptsep 
cnadvnwswllcdeditrsisgqstvclwsptltawlvlpggfvikgrydlssedlm 
fvasryghpasshs " 



SEQ ID NO 60 CDS 49049.. 50059 

/ not e= "unknown; ORF 33; similar to Kaposi's 
sarcoma -associated herpesvirus ORF 33" 

SEQ ID NO 61 / trans 1 a t i on = " MATQRRH ILKSFLNKE C I WLRHPGTS AFVRVYTATTAHS AVFD P 

PVTSENAMSLNFLNVMIVIMKPKEFGPCVTVYMNGDILDFCATESVAIRDVPGRADLC 
LIRFGTLSNAPRSVPIPGPLNPHPRETVPGLTKQEIIYTSQTVPRGQIPDAIKGKEFH 
QINPFLWFDGGAFWQLFLSVDFMLLCPALDTVPSLARIVGLLTQCDKSTCKICTGAHV 
HVNPYRG YTP PDSQGTS PSCPCLI S CGARRAADVLVTGHVNLLGLLFDPKAS PKVTKL 
RLKRNPRPVPIEDAMSGVTAEGTEVQPTSLPWALIRLPDLASRVMLYGCQNLKSICLR 
SY" 

SEQ ID NO 62 CDS complement (49977 . . 50960) 

/ not e= "unknown; ORF 2 9a; similar to Kaposi's 
sarcoma- associated herpesvirus ORF 2 9a" 

SEQ ID NO 63 /translation- "MLLTSYRERLQNNLRWTDGGCENWFRQPPVIISGNDKTERMAH 

PCLGVIHAVNAYSSVLDDYLQTYRRVQEPMPAPTLGKPRISSHATLPRLTEELTNYLK 
QTCCRVQMANAKDQYMEYQSAQRTHEAFLECPVYAELRQFLANLSSFLNGSYVPGVCC 
LEPFQQQLIMHTFYFIASIKAPEKTHQLFATFKQHFGLFETTDDVLQTFKQKASVFVI 
PRRHGKTWIWAIISVT.LSSV^NVHVGYVAHQKHVANAV^ 
KKENGTIVYASPGRRPSSLMCATCFNKNVSRCFLSSGSRIASRDWLNPAGE" 

SEQ ID NO 64 CDS 50959.. 51942 

/no te=" unknown; ORF 34; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 34" 
/ 1 r ans 1 a t i on= " MFPS SFLNNGHPETERRF VKGVQLALDLCDNTPGQFKLVETPLN 
SFLLVSOTLPESRPVRDCPQPEGFDFEHIHLPKLTRMQRVLGRYCDHVNNDDTCVNVK 
ASSSNSQGALFYLPYGQDEWNWALTLRKDKXiVKMAVEGLSNPTTWKGLEPVDPLPLIW 
LLFYGSRSFCREPECLYERNFGMKGPILLPPHMYAPQKDW1TFVHHVIKYVKFLYVNA 
GGGLETEPSPPFEASRLRAAIARLGDVEADDAYLSAKCMLCHLYKQNDTISIHETHVG 
GVIALGGDGARYITSSVRAQRCTSRGDFVLIPLYNIEGLVSMIREHGLGSS" 

CDS 51923 . .52372 

/note= "unknown; ORF 35; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 35" 
/ 1 r ans 1 a t i on= " MASAAAKKMLI KSELESEINKKLS IS VFDRFGADS AVFNAQYKG 
TRESLRSYNSLKKKDDLATWGTLETSLREKQSELGLLKGFNRKKIEEFDAVADAVRD 
LKDEL YGELE I LGTLDNES VPVEEES PKDD 1 1 RWKLERLPRVCPKS P " 



SEQ ID NO 65 

SEQ ID NO 6 6 
SEQ ID NO 67 



SEQ ID NO 68 CDS 52278.-53585 

/note= "kinase; ORF 36; similar to Kaposi's 
sarcoma- associated herpesvirus ORF 36" 

SEQ ID NO 69 / translat ion= '^LFPWKKSPPRTTLLGGNWSVCPECAPKALDPIPKVQTDVDRT 

ASSHITVIKTRKTIAQLKIPNNWGQCSHQATDWTAVLGRGSYGWRSMSLGRCVKHFG 
SRREFFYECIFNDIVRACREKHPLNRGGDRILCFLEPCVPCRALIFPQLTGNLLNADL 
KHWPERLAVEFSELREGVSFLNNICGIVHCDISPENILIKGELTTAYGRLMIGDLGS 
ASLHTGTPWTGVMVTSKLGFVQHTYHFKAPARFICKHIYRPSCLLYRCLLSCAGGPQA 
HMLNQPFQ ITPQLGLTI D I SSLG YSLLACLEKYLQPADPFPQQG ALAD AS S E S AHPLF 
YLRCMVPRWIAEIFSVAWDVPLDLGIDSSGHAPAIPLREAYRRFFANQCSLYRAQYK 
EDALENASSRLCNSKLKLVLQKLLVRDYFSHCGNCGDHGFFLR" 
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SEQ ID NO 70 CDS 53566.. 55008 

/note= "alkaline exonuclease ; ORF 37; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 37" 
SEQ ID NO 71 / translati on= " MDFFSDEP^fV'QE^IALLDIDEQQRLLSKMSLAWFIJKI^ER VRAFFS 

DNKKEISMPAIRFVYNFYLFAKVGDFIGNTDVYDFYVTCVFRGRRLTRLSEVYDACLN 

MHPHDRHHVCAlil EQVTRGQNINPLWDALRDGI I S S SKFHWAIKQQNS SKKI FNPWP I 

VlTbJHFVAGPLAFGLRCEEWKKILATL^ 

VRSDPKDGLEFHPDCKIYEIKCRFKYTFSKMECDPIYAAYAKLYQKPSMQTLKGFLYS 
ISKPAIEFVGEDRLPSESDYLVAYDKEWEVCPRKKRRLTAVHHLVKKCMIHNSTAPSD 
WILSDPQETGGQINIKAHLSANLFINVRHPYYYQVLLQSIiWQEYISLSKGTKNLGT 
QKNFI ATGFFRKRQFQDPS CCTIGEFAPLDPHVE I PTLLI VTPVYFPS VAKHQLVKQA 
TEFWAASAREAFPELPWDLSSLCANAPPTP » 

SEQ ID NO 72 CDS 54963.. 55172 

/note- "unknown; ORF 38; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 3 8" 

SEQ ID NO 73 / translation^ " MGFILSVCKRPTNTVDVKGEPIDVSKEFDPI1GEESIVLLTADG 

TAPAALYKPKTKPSKHKNNKLSDFV" 

SEQ ID NO 74 CDS complement ( 552 5 5 563 91 ) 

/note= "glycoprotein M; ORF 39; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 3 9" 
SEQ ID NO 75 / translation= "MKISRSDSFILSSWVKLLVILGLMFIMSAWPLTATFPGXjGFPC 

YFNTLVNYSALNLTVllSSAKHLTPTLFLEAPEMFv^ 

FKAKKVHATTMTSLQTWIVL2GS 

HFCLSFTHVQAMISCNSATWSLRVLEQQIPENSIjLiDTIjLRYGKPIGANLYLSLIAMEM 
LVFSLGTMMAIGNSFYMLVSDIVFGSINLFFVXiTIAWYINTELFLVKYLKHQIGFYVG 
VFVSYLILLLPWRYDKVFISASLHKVIAWISMIPITCIIAIILRIIRNDWKWCAKS 
PEYAPLPQGPKEKTTKVKYSPELNALYETEEDVSDYEDAYPKYI" 

SEQ ID NO 76 CDS 56526.. 57932 

/note="helicase/primase; ORF 40; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 40" 

SEQ ID NO 77 / 1 rails 1 a t i on= " MNARE VALTGHVLHI S LHSTHEREKLI I WQVHLL VCQQCG I QGD 

AAYLFVTETLSNTDWGNI PAINRHAPS INEHGRNYMQWEIjRTRLRNPI I QLLSRQPGA 
VXTVTiVSEPNrWIVGCERALDHSCSVRVTGAYLHCDTTMDFSIiDSVVSPTREFWFSEMF 
SH CXiVSNI EVYLKTTGGLYYRAS S ATQCRKRAKDG ALG I LD I FNCESRE I QVAGQKYT 
LiSIATATFHVljWVDEACMWNGAIiAEFFRALHNKLFGDREGVAPTLTYVCPGATPEGTP 
FPPYFSAFPHLPLVFGRPRRLDVTAVQELPKAQIAVHWPPFKDSILGDQLLIPGISPK 
KPGTVPVRWPLWVED VNLSLCETTESVARI VDPHS IVI IKFSSLLCQHLKCHRAFVKN 
ELEYIATICSSDLRLFIQEEYNRLLATIFTWAAASGYTWAAIDKTTVFIKAPQLSAAV 
SGFCPSLNSCRRKQCYEG " 

SEQ ID NO 78 CDS 57917.. 58528 

/note="helicase/primase; ORF 41; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 41" 

SEQ ID NO 79 / trans la tion= "MLRRLKITVliFLSQEQQKVVTRLEAHDGLPVQETSHPPDWLKCE 

VCSASVFLKIPAGVIiYAGLARDPTREAKRDSWLDCI/VEGATLLLNNSVLPIGALAGIL 
PTLFANRRCVNFWLLPRAWVKSAPICPPLPIDCVTPPQFWTKRGPICWYKEWPLPVD 
VDFMYYLQEALCVFSWSNGEGTESHADNIRQLEKFEKVLCLF" 

SEQ ID NO 80 CDS complement (58525 .. 5934 3 ) 

/note="unknown; ORF 42; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 42" 

SEQ ID NO 81 /translation- "MDQILKRLMGEQHRSEAVMPETECSSRGPYNYPVFPRLMLEVHK 

KNSICMASNTPKLCVRGRLNVPDLGVHVRTRLQSATFTGFVFACWEHEDMIDALDIY 
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PHVFSDRVQLFKPASASVTELCCILSMLENYDKPPLSFIIiSALDRARYLHERYTCNDS 
AFVXiYGIEVIASTIjAAYHELNPPQGILRVPPLVRFKIjHKLLDENADDMKGIjLKPIYIjE 
SFRLTENVGEEEGHAETFNIFYCGTIFTRHLHNASVLKYFQITSLHSIPRQTLF" 

SEQ ID NO 82 CDS complement ( 592 97 .. 61027 ) 

/note="capsid protein; ORF 43; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 43" 

SEQ ID NO 83 / 1 rans 1 a t i on = " MFKMNPGFGS TCLVHPTEIjS I SLFE I L»QGKYAYVRGQTLHS SLR 

NPGIFGROLFIHLYKTALGSCTYDNVLKDWTNFETTLKTRWRGVEHLTPEFKRSTFES 
WARTVRLTVDQLLLNT INQVliHTRTVLS YERYVDVAA^AXjGLVP I VRRTPDGDT IARI Q 
AHCQQMRKTYASGDVTI SRIVDKLAQEITSIMTDVTS IYIPDYAE VSVEFNGDKAAYL 
GTYRQKDITVEWSRPIIYNGRVAFDSPIiYRLFTAIMTCHRTAEHAKLCOLLNTAPLK 
ALVGSTCTTOMYKDILARLEQSSQKTDPKRELLNLLIK^^ 

SQNIVDKNKLFGTGTESTTQGLRKQVSNTVFKCLTNQINEQFDTISNLEKERDDYVKK 
IQCIETQLLQSLPEGGRPRHDINILTQNTLQALSGLRDPTIN1.SECHIPKGSSVVNSF 
FSQYVPPFMEMLKELTSI.WEGEMFQTYNLTPVVDNQGQRTSIAYSQDTVSILLGPFTY 
IIAKLTHMDLINHSLISLSLHDIADQLYVDSRLSVYINDIGHKYCEQISQPGTDGPNT 
EASNGGAAPI n 

SEQ ID NO 84 CDS 60966.. 63338 

/note="helicase/primase; ORF 44; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 44" 

SEQ ID NO 85 /translation^ "MESSVGWTKHVEPNPGFILNMTSDAKVRGVVDHVSRIiSNITTSP 

PEMGWYDLAFDPAEDSGPFLPFTVYLITGTAGAGKSTSISALYQNLNCLITGATTIAA 
QNLSRRLKTFCPTIFSAFGFKSRHINIAVRKAHQTGAVSIEQIQQQELSKYWPVIVDI 
MKEVKAKKPNGMYGTISNANFETIjSRMTGPCLWTSNIIVIDEAGTLSSYILTTVVFFY 
WFLNSWLNTPLYRQGAVPCIVCVGSPTQTNAFQSTYNHGTQKTEISSCENIIiTFMIGK 
KWSEYVHLERNWALFINNKRCTDL.QFGHIiLKILEYNLP I PDEVMS YVDRFWPKSKI 
MDPLEYIGWTRLFLSHSEVKA.YIiTWLHTCI,TLGGDTRDTKLFTCPvVCEVFVKPFEEY 
KRAVNLTNLTVTEWVTKNIjFKIjSNYSQFvX>QDMSIV 

VSI^GKTKKCICGFQGTYFEFKRILDSELFVETHSQDRPEYVYGFLNTLLYNAMYSFH 

AYGVTRSHEKYIiQDLKFAPLPAALATGRVDLQTVREELNLEDDIFYHVCSPPPPAGIT 

SLQVLVDTYCAXiKDVFASRIKVACRWFGGEFEKETFSAFTWMVVRDGVIDF^ 

NGLLAFASTVESYKIKGYTFLPVAFGRCQGLPLSDDIiRKKMPSLWQDSSGFIACLEN 

NITKLTETMEDGSVFQVCCAGDYGVSSNIiAMTIVKAQGMSLERVAWFGSHKNVQTSH 

VYVAISRAVNSNYLVMDSNPLKTLLREPVDNTSAKHIV^ 

SEQ ID NO 86 CDS complement ( 633 79 .. 64437 ) 

/note="unknown; ORF 45; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 45" 

SEQ ID NO 87 / trans la tion= "MAMFLSDPSRTPPATPRMLPIPGAPRKKRTRRFLFAGSRTGLPV 

PPGYGGPPVIDMTAPEDVFDQDSPPTTPKTPDETDSHSENSDYSDVDEEDEPPVSSPP 
RIDPHARDGESFNQSGRLPTVITSTGATTPPSAPAPLTAFGGPRPVAWTGQHRAPQS 
SESDSEDDFFIDDYEDTDESGGEADGFSPRASPAWSGDISRSPAEGGWSSDEEEPWA 
GSTAGQETI 1 1 SDDDE VDDRGSVETWDESDADEGTGATDVIDliCSSSDSDDDADHVTS 
GGVRAACKRRASRRDCNGDDDVIYVGTTQGPKRRMTSTTGGGATSNPEGPGVSGRQTM 
AATPPVCGNDNYPWPWXjD " 

SEQ ID NO 88 CDS complement ( 64479 .. 65246 ) 

/note= "uracil DNA glucosidase; ORF 46; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 46" 

SEQ ID NO 89 /transla t ion= "MEGWLKTIWSKMSPEVLEEPSTQTLLLSDSWLEFLNLSPFLKQ 

KLAALLKRV^DMSNVTVIYPPIDRIMWWSYCCEPEDIKWILGQDPYHRGQATGLAFS 
VAPDYSIPPSLKNIFKJEIANTVPGFTAPSHGCLDCWAK^GVIjLLNTILiTVERGKAGSH 
ANLGWDWFTSYIISCLSAKIiQRCVFMLWGRKAIDKATLINGQRHLVLKARHPSPLATA 
HAATGSPWPQFLGCNHFKLANDYLVQMRRGAVDWNIN " 
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SEQ ID NO 90 CDS complement ( 65222 *. 65 73 1 ) 

/note= "glycoprotein L; ORF 47; similar to Kaposi's 
sarcoma- associated herpesvirus ORF 4 7" 

SEQ ID NO 91 / translation^ " MRSMYTLSLFITCGFFLITCCTGLWNPCCKIIPLSDFIFPEPF 

EIASFHLTNLALCPGLCTATLRYKADRSTTEICVNGFHLRAFFIRILYKLNYSVPREE 
LQLLNYMQYSLDEFLAEFEDFHINGSESGTAYTRPPLLDFSDRSTKVSRIRKVITRRG 
DLWRVGLKQ" 

SEQ ID NO 92 CDS complement { 65999 .. 67168 ) 

/ no te= "unknown; ORF 48; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 4 8" 

SEQ ID NO 93 / translat ion= "MAVSIPVKGWRETESNWRSIVTTFERKGNADRAIRSLLRFFKG 

vTDHPGFLASLVILKDVTIDSEKTIERTDLIPLLQGVRFVTQQIYKHLKDHASESPMAE 
IWRDCKERFCLALELACGCQRCASAARQLRACQQACRPPKLNPHKQQCVAARLLTAVY 
NQMVljRTRVSVSEFCLNALMC^PREFGFVSGDvRVETSRVASCLNLSVTLYLIIiDSY 
TDLTNLEMAMSRACRIHGIiSAGDPFYSAIiW 

DICASLTGPVPDVIKINMLPLVNDQMHPSVCVERANFTGSCPKVSPTHHLDGLKLETT 
S LTL AADSLDD ILQAXiEL ICGDDEG I LDS YI SD INTETE VDES S I EEE I VFEELS 

SEQ ID NO 94 CDS complement (67398 68303) 

/note- "unknown; ORF 49; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 49" 
SEQ ID NO 95 / 1 rans 1 a t i on= " MSRHYGKDHIiLNHMYKFHYPPLGMIVGEMNTLTVNARNPLYQAA 

TLRVERALYLSKlLQVLMQHRQGERFIVPQCRSNMVYCLKELHKITNDRIRGtilNSVL 

PLVDAGCVGFDEELVI^VLPEIIjKIEYPHAHELLPPH^ 

EVKEMIDTFHDLSVPSFQYLASLVKKFFL VEEVIYEDYQDTQFNVFLNLCFFWTTVIK 
MYQSCIFKDKLLDTIKACIELLKGEARQFFGWYDLNTPNLGSSALVKYTEHLIRALSV 
DSSAIPIGEICSHLHHCKHALLNIiE " 

SEQ ID NO 96 CDS 68494.. 70038 

/note="transactivator; ORF 50; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 50" 



SEQ ID NO 97 / 1 rans 1 a t i on = " ME CAS LG PIS GL I AD LNLFNLFCLYRG SRVKTRGAATCNVP CAE 

CAQGI VI^ILTERALCCTEKMFIASACSGVVIPPQIjARVIjHDvYAEMKAKCLGAWRRLI 
CCRRPII^IADSVLvTYNTLDAEGKI/ELR 

VPDNYFTITGTAEKRRPVVTGSTSGMTCPGSSLVPDSLILPVCEPGLLPAPLVDLSNV 
LENPEIILSAPPLSQFVITNTHPSLPQSVSIITPTQGWPGQCFMDTWKAVSQSIH^Q 
AQTPILAAAIiTGSTSAAPGPHIACSPVAGTSRQVEGSAGVDCGKPACVPQPAIiPPNVP 
AK^METVAQLGNAPVKtrraiGGRVYAPLWIPIIDLTSPSGSGQSPADIANTPESRMA 
AGSPPFAETAATVPAKRKQPREDVADIG^LKGDWGAATVNHPFPGPSGMRVREQGLFD 
LIESSTDVTANASGPKNDDDMLAAILQDLYGLQSPPAIDSPSSNSDNEEIFPEVSPPS 
SGHGSP" 

SEQ ID NO 98 CDS 70355- . 70888 

/note="R4" 

SEQ ID NO 99 /translat ion^ "MPRVKTQPKRPQVLEFMPLDLHGGTHTEMDSQNLCPDGQDLLGS 

YIYTENNGPFSQIMHNGQSNTGTGESFGSYAAGDGFLGGSVSGMYGNNTGEGACSKRP 
SACRKRSAALIHAASEASVAEQGTSQGAHAVSDRIGRDGGADNRLLKVSARLSDKTKS 
ALRSHPCLRC YSLMFNT " 

SEQ ID NO 100 CDS 71468.. 72160 

/note="R5" 

SEQ ID NO 101 /translati on = " MGFGN I RLGWRLC FMVWVAX I ARGRS VC PTWHLTDG KYEAVYRH 

YLEECRKHEGSGSLDGSGQTKGSGTKATTEANISIRPNVVTSGQNKEPPGTAPRAESS 
HDLPRIKQVNALRLSTPELAQPLPWKSTPRESQSGGTPWNARPHAFIMHTNDMLNPS 
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WLSFRAIRARSTRDTEQSVRDRNTVTTSYRTPGRPSLFQARPSSHGARLPPSPRTMA 
RYAESRTICDQN" 

SEQ ID NO 102 CDS complement ( 724 O 1 .. 72 82 0 ) 

/note="unknown; ORF 52; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 52" 
SEQ ID NO 103 /trans lation= "MSSTRPKTRAPKKELTMEEIiAAQVQKLSVFINKQLKXLINSGDPT 

RSGSDPVI SNTEKEAKIAAAVSALCNVATRKIEAKVRAATAKAVTRGQMEDALAGIS I 

RVDVSMDETTRGGIAASADGALRRRRAQSRTRNNDAD " 

SEQ ID NO 104 CDS complement ( 72 8 84 .. 73 198 ) 

/note= "unknown; ORF 53; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 53" 
SEQ ID NO 105 / trans 1 at i on= " MTGS I VXALALLACIj YLCLPVCATVTTSSTTGTGTPPVTTNPS A 

APSVTPSFYDYDCSADTYQPVLSSFS S I WAVINSVLVAVATFLYL.TYMCFFKFVETVA 

HE" 

SEQ ID NO 106 CDS 73274.. 74146 

/note="dUTPase; ORF 54; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 54" 

SEQ ID NO 107 /translation- "MAEVTAHTVPYAFDSCKFEIIPKlWSSRIAIjRNKFPVVVKPGEP 

LWPLGLKI I RAPQCAFFLS GAPTDE VY YHTGL I DQGYRGE I KL I VliNKTKQ WTL, YR 
GEVNVSLIAFMYASPGPLKCPILNLPHYSLDAGFDVTSPHAMTIPPTDRTPFTLSLYY 
KSPQLSTPHVPLIVGRSGLATKGLTVDATKWTQSLVHLRFYNFTKEPIDIPANSRICQ 
VVFIHEDHVPSGWNILiRSRVQIiGSTLQISWAKIRFTDVATLPKTHPXjNSRHTQSQTEP 
ETARGAKGLGSSGL " 

□ 

SEQ ID NO 108 CDS complement (742 07 .. 74 83 9 ) 

/ not e=" unknown; ORF 55; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 55" 
SEQ ID NO 109 /translations "MAAPGSFWTCCGFSPFGRVGCQYRPLPDPL.NECPTHWRTEIAMG 

LPPGVDMGDVKQAEMCTAALRQTYLLAVQSNKITEYLRRFDAARVPAGCQETVRIQIS 

KXiKSIQNVIWNAMLSLAIGDITVDESAFHALLN 

TWAAEINNVLVDTEASSNPSHPVIRQPTPQLAVADNI VPDKIIQDAQADG " 

SEQ ID NO 110 CDS 74851.. 77337 

/note="DNA replication protein; ORF 56; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 56" 

SEQ ID NO 111 /translation= M MvDEIRAIFSTSGDMAEVITDILTETQATASFFCVLHDRGDAPI 

NTPHAVIKLCLPAKRPGGGPRCLPLMVLNLPAWQWLFLTGDAPLTSDNIKDRIDLAQ 
TEEILEPILSVLACKRSAQQTKHDSFKSK^AWFRAKFVSALRKVYKMTPSPYWMITLL 
GSFEASFVLAGTFYFFQSHICTAETLVHLTRLFSSSQGQSLVTVNTYDELGRVFGRSD 
FLGIVPNFWAYLKYKMQQDDVESRAIDQTINSIRGGL.MLSPQDLVHFIYLSFYECMNA 
QTFLSYSRTTSSLPTPATVNPPQLCRRIiEADFKEHVMAYYNKASYLSTYITILTVPAP 
LPDGYENFQELACQYWCGQSRDVAEIMTRINDQYPQLNLiTKDIiSGLiLDLAAIjDQYSGG 
PKENLFTVASRIPTYRCEFLNKQYFVLMHADCIDAYWKQNIIVPEDAQLQGLTDQDLT 
SRIFYCDLGLSLPTFKQOILVSRHEYFNPRLPVTRWVIjDFDLKVTEGRRTLNDI YNIC 
VTLRQVIIjETLQLIGPLKPNHPVYFFKSACPAVTWPDDISDTAFCHCDAKIGMRI VTP 
FPSGYCLVGSAPLVSLTDILNRWKLDTRLASEYPGIIiEDKGPFDSGIYAKGRCVRVP 
HCYKVGPGGELSRKLKI 1 1 CHPEESDKS AYLKNAFKVSNLLHHAPGDS VTKNGHLVYA 
ITDENEGFLESKTKNNLPKTITDLAEKIERTTEKPLIDWAATAVWPKXiHDTIQRFFPD 
DRIGQFASVSFMHSGDNIIQVKPQKGNNFFCINHKHRNHTQTVRVFLTIiHSTKESEVT 
VTFMS QCFAAKCNHNS PTAHFS FMVP I TGT » 

SEQ ID NO 112 CDS 77578.. 78906 

/note = " immediate - early gene product; ORF 57; similar to 
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Kaposi's sarcoma-associated herpesvirus ORF 57" 
SEQ ID NO 113 / trans lata on= " MRYVFHALICF I GG I S S SDFDDSS SDEMDDLS PTPEPE PSTTPN 

SFPEGPKSQWALPKIRKRSRSETPVKIEHRSPLNRSRSRSRTRSGSGQRSNQSGRYV 
KRFKPTVDAPRHREPWHRGGKGKAPFIRRDAKAGRGRRTYGHDYRGKAALTRSIKESI 
KKMHLPSTMLSRAHDKKVFEGLLPRHLGQCFQVCLPAPPPLQPEVFTDRQLTAIVKSG 
GRRD AL VAKKVS LAKLTS L YKPLLTFA/TGRNNQ AHVJLATRKNTLAS AGLE AL.AAF I EE 
GLAWAQVCVSQNRSLNDSNLDIILDSSQSVCTWFISKIRHLHIQCFLENQGEVSLVKQ 
LTYLVCINNRLAEAANI^GEVKLNFKLGMLIGFALTLPALLAEHKLSGESLYLFRSFL 
EKYRPGDVMGLLNSIWEHYTKCRSAECVITTHAMVGSGENNKGLFFFPV" 

SEQ ID NO 114 CDS complement (79266 80513 ) 

/note="R6; similar to Kaposi's sarcoma- associated 
herpesvirus vIRF K9" 

SEQ ID NO 115 / trans la tion= "MATWRPPQSGGPSAMGLREWIVTHANLATYSGLFWADDEKTRW 

LATTTPWSVGFDYL-RDGKMYEDYCNQRNIPLPSGRSRLGQAKARIiLGAIRKSAYFIEE 
KDVLRRSFSFAlWVFRL,RSDEEMLCRIiCPRASGVAAELRGLRFRMFKRKGADEAGRVS 
EYTVKQLLGLIiRTRPAGTFTMTAPATEASATATASGEDGRQDNSQGGAVALPGEHALP 
LSASSGLSACLAPSVDDPWGFMHIQWYYGVLQAQTFTHSGMGWLSTRPTDKNEHHV 
CMAPGPIjQIjWIjPPAPYMDDDFMLSRIjVNALHAIjEDG IVLiCSCQYGIMMNGYGFIjNIjWF 
RGNTSNTSEPRRVPSGVGHRVFDTDEYMLKLAQSPRPSDPGPPDPFAQIWVSAWSL.YE 
EEDQSQAPICIWHQREIYRHFE" 

SEQ ID NO 116 CDS complement ( 80686 81933 ) 

/note="R7; similar to Kaposi's sarcoma-associated 
herpesvirus vIRF K9" 

SEQ ID NO 117 /translation^MAGRGVDIRAWLVAAVTCSG 

RSVWNSEKFFDDYCNMRGI CQGEKPSHYGRFRKMRFLYDMRHHKS IRELKF INKAYGR 
PGARYRLFRLLPEPWSCAMCNLMSSTETQCLGLISEFQYDQGGGSGRERRRVFTATV 
LARSRMDKNKRWEHRLPGAIQLTFLYFGETVGLERVHAGIRVCSRPCPVLAGHACCF 
QDERTLFLPSPGVVDCQFAREDLRV^HKKCEKGVLITLTDTGICVKNLENREMKVIiTN 
NEEE YKDLP S RQP VQVFDMVDYLRAIi ARS PN PGDE P PRD YAQ I ALCL S VQ S PNPVDAP 
IAMRLRYVCETSSVCGTEGCFYPGTTVTSEGRTDCSFQMEDPGEGTSQSHDPAVELGD 

' SGPDSMDDPDAGTSGEDDGVACS " 

SEQ ID NO 118 CDS complement { 82262 .. 833 17 ) 

/note="R8; similar to Kaposi 1 s sarcoma-associated 
herpesvirus vIRF K9" 

SEQ ID NO 119 /trans 1 a t ion= " MERPVR vTKPSSLRGWLVECCETGRHPGMRWVDEERTLIRIPWN 

HDRGSRGVEEAEKNIFIDYCRSRGILHAAGRELTAKECKNWLSSAIRHSQTVSDVSTK 
DNLSTPYPDRCRIIRLLPITVRSCARCDQASGTTAMLRGLREEAVNKFGPVGAGVRYT 
GAVGAGGEQCWMLRIMFYYYGDRVGEWTESSNGIRVLPLSERRPQGHICAAPIAEQA 
LVPEIPGHLAEFQAEALRFLDKDLLRGLAFWADPSGIYIRWLGHSLAFVQGNVESTGA 
VAVXSCANACRAFNLVTJYMTAMARTSPDGAALPQACVYLYFGGVPTPEGGVQSTVPLI 

IQLWHECLWRAIiSAANV " 

SEQ ID NO 120 CDS complement ( 834 91 84252 ) 

/note="R9; vIRF" 

SEQ ID NO 121 * / trans lat ion = " MDSGCYACILDENSEGIINYLEQVCGIGLEPGMPLPAPLPTLVP 

PTRSAYARAHRLGVPEAPLPHQIVPFWRLRIQVFYFGVLALDHTSQDRRGVRLHPRPV 
PHPGHLCFYGTGFTV^FPSPDREKlTAEQITQIKTMLVAYNEGIYvHGNETGVYVDNR 
NRETLYAAGNDCNGDIIQREVMFLSKQKIFNFMGFMRKLARSPGPESHAPCNGATLYL. 
SQQPGAQESPQVPISWVCQDELVQGHMNPSRWCS " 

SEQ ID NO 122 CDS complement ( 85052 86209) 

/note="R10; similar to Kaposi's sarcoma-associated 
herpesvirus vIRF K9" 

SEQ ID NO 123 / translation "MAAGESRRGPSRYGMALKEWLTFKADSGLYPGLFWADEQKTRLV 
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. LAATPPSFPNYDYQRDGQHYDAYCELRHIPLPSGRSRLCQARGRLLGAVRKSKYFEED 
KEFPTDQFPFTALVFRLRSSEEMSCPVCPRVCAIiRLELRNMRFAMLGRGMLHALSGPS 
VSDQERRYREGHQDGHDAQDDDAAYSSGLLRARLMACAAPSAGDPWGHMHIKIYYYGQ 
LQAELLTATGQGIRLSSKPTNKAGHHVCVLDGPLQAWFPPIPQTTESSWQRLEDALK 
WLVDG 1 1 FCS TSRG I MFTI TGG PNVWFQGNTVE P YS LPHRAYTGMHVWAFDTDR YLLD 
MARSPSPRDTGPPAAFVKLV7VSGCSLGEERNSSRAPLSIIVYQTEIYRHFE" 

SEQ ID NO 124^ CDS complement ( 8 63 55 8752 7 ) 

/note="Rll; similar to Kaposi's sarcoma-associated 
herpesvirus vlRF K9" 

SEQ ID NO 125 / translation ,, ^1AERDMDI J KAWFIEAVESKRYPGVEWDDEDKTIIRVPWNRCTDS 

RVDEDYNKIFDDFCSARGVCQTGSHAQKFKKIRMLYAVRSHRYLRELTPPSKAGGVSG 
ERYRLFQLLPEVTVGCDLCNLIATTSLHSCSMGSCVREDVFERTRRPRAKAPLRVSVY 
KRKSKRLQDSSAQPVLGAVEVSFFYFGENVGVQILRAGSGVRICGLPDPKRPGHLCCA 
DNPLTCFLP SSQLIPCE FARADLQALQKTCERGL I CVMTE SGI CVKNLEERNMT ALTN 
YSENYYELRPSQPLQAFDLLHYLREIiARSPTPGDVPPRDCAWIFMCPSTQSENTWDAP 
lALKLRYVCNDDVSDDVSNGAAGDDSGDEGPSGAGVGASGTTGSTSVSTLAPYGRK" 

SEQ ID NO 126 CDS complement ( 87 8 94 8 8 961 ) 

/note^="R12; vlRF" 

SEQ ID NO 127 /translation "MAEGRAGSIRVNRPSGLRAWLLDCCDNDKHPGMHWLDEEKTLVR 

LPVmHLKGAGGVSDDERNMYLDYCQFKGIRQTGNRRLSVRECKNWXaASAIRHSQTVED 
VSTEEWLSAPAPNRCRVIRLLPIFVRSCPLOSIEADATGGML.LDVRNEVTARFRYLGAG 
MEYEGAVGGDGEQCWMLRLWYYYGRLVGNMEVGSPNGVRLLPAPKRPLQGHVCAGIR 
PEQALLPHTPQDMFPHQTSMLKWLGKEIIRGLMIYADGSGIYIRYMGHVPAFLLGNGG 
SLEPVDIINNARVLRVFSliAQYLSAVSATPPHGTRFPAAYASLHLGGVPTPEGEPCPT 
I PLS I QI WHECLWRACGDAAQ " 

SEQ ID NO 128 CDS complement ( 8 9122 9 02 16 ) 

/note="R13; vlRF" 

SEQ ID NO 129 /translation^ "MTEIEITHNHLRRWIISNLEANTFPEHLCWCDEEKRSFRISWHR 

GMSGMQPWAYCLDRDLECGRQHNVSECRKRLLRVLRENAGFEQDDARATTTRFGGER 
FFYLRPAvDPLCYACILDSHSETVLNYLEAACVHGLEPGTPLPPPAPAEADGAARSVY 
ARAARLATVAPPHPDQITPFWRLRIRVFYFGSL.VAEHTSQDRRGVRLHKRQDPKPGHE 
CFYGTAYKMWLPKPQLDGPLTPEQRETVCEIINGCEEGVFLHGNELGMYVDNRTRHTV 
RCAGND AEGNHAQRAVRS S VKS QI F YVMGLLRRL ARS P VPGDTVPSNAVTL YLGGRPG 
S SKRPQ VPVTL VI CQDELTHGD IRAAR WIL " 

SEQ ID NO 130 CDS complement ( 904 62 91544 ) 

/note= "unknown; ORF 5 8; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 58" 
SEQ ID NO 131 /translation= n MGTYTSEASIiAWLSFMSGTVSASPFILCFIYHSLYFVEPLISVE 

NlIFSWGAVGLHGLLliIiFCIFGLPAWLSRQVDVPCTISAFLITAGSMASTLGVDLPWV 

HVS I FVGSCLCLLLCVVAANDVVYLCPT I AHRYY 

FLLPLVAFIVGGVCSLRALRSHPLYEAGLQRRHAIFSLTSRRYITYSIKQAIiEVCGWD 
FYIiVTVLIGGAAAGTLSVGLTTPLLLGLVlIYFFVFH^^ 

AAAVXLTLTHVLGPGTHNLFTRVCVFTVFLLTMFGAIGCELQIIRKKLQRAANSPRIV 
LGVCACGNLLMAWFFSLNKVELGAL 11 

SEQ ID NO 132 CDS complement { 915 55 927 3 9 ) 

/note= "DNA replication protein; ORF 59; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 59" 
SEQ ID NO 133 / trans 1 a t ion= M MP VSFHYGARVD VDALGS I SRVYDHI KGI VKKGVI QI SGQGRAP 

VliSVXSSVGDAGVLGLRLKN^APLMVYSDMTDEVSFSFRNTSLGNTFTHTREMFGVN 
IAEMNVAFYHHGDESDAEGKPQFWTTIAYGDKmTSTVHKSVVDEPNLPSFHDRLEQA 
GTGNRLFLTVKTLTLLliKWLRQQKTRAKQVVTVSLSETIjAVATFTVDGVSKII 
TPDAKWTCARGRKLDVGWSSDLTTHVSLESLVAAIiNACKIPGFFLPGFRWHANEILE 
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VEGLPLTDSLADVRLGVMLLKVDPTDRimAVPGNLSEGADPEGVPELPSPPRTPDLDL 
KEQCVPIAEDGAEPTDGGAKSLRTSGSRPEKKHGKRKHSSSPSRGKGKTKTPRATFNP 

LF " 

SEQ ID NO 134 CDS complement { 92 86 8 .. 93 8 12 ) 

/ no te=" small ribonucleotide reductase; ORF 60; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 60" 

SEQ ID NO 13 5 / trans la tion= "MFGLSIVTAAMESPDRFLYASDHPGFLALTQETWQNRWFPSQIS 

LHEDSDEVRLLSPTDREFYQFLFTFLGMAESLWFNIEDLVKEFSNHDVTHYYAEQVA 
MENIHGKVYANII^LFFGGNRGDLMIYAKKIVEDATLAKKIDWLHSRVRKATTRAEKV 
LLFLVIEGIYFISSFYSIGLFRLRGIMRGVCLANDYISRDELLHTRAASLLYNTMISR 
DESPSVAYIHGLFREAVEIETLFIRSKSRDWMVNVGDIEQFLQATADRILKSINIPP 
LFGARPPNACPLSYTSAKSVNFFERDNSEYVTSVHNDL » 

SEQ ID NO 136 CDS complement ( 937 94 9616 0 ) 

/note="large ribonucleotide reductase; ORF 61; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 61" 

SEQ ID NO 137 /trans la tion= "MNTETSFSAAKSAKPLTLVTDAGTGGCSSSLDPERCAESLVNSL 

KATLGWDIEANSLTGLLWHRIMEDRCLVTV^ 

LLQDFKQSKAYTNFVNCGYLSAVRFYDTYVLRTQGSSPIFESVAQMFMRVAVFVACQC 
IKFPCLRETI1RHL.VESETELDEMYLVGYAFHYISSQIVCCATPVLRSAGLRGGQLSSC 
FILKPSMATEDKTLKALHEEMSPLLASKSGVGIDVSSFAEHKNITSCLKLINAHVGYF 
NDNNIRPVGASAYMEIiWHHQICDFLNAKMPENQERCHNItFQGVCVPELFFRLYETNPD 
GQWHIiFAPEVAPNLLKLYGAEFEIEYNRLVAAGKHSSSLPLKSMMYALINTVIKTGSP 
YVLLKEAJLNKHHWCETQGSAINCSNLCAEIVQQPEGQASVCNLANISLPKCLRPHRGE 
SGVEPGKGDVTFGFELLDDAVEAAVIIVNACILGGTAPTESVRRGQEERSMGIGVQGL 
ADVFAELGFGYLDAESAKLDVEIFQAMYFTAVQTSHEIVLLGEGTPFRGWERSRLAQG 
VFHWQTWDGVKPSHPPLERWEQLGRSIAQHGIFNSQFLALMPTAGTSQLTGYTEAFYP 
FFANIASKVTSKEEIIiKPNV^FFKRVKPGDLRTVRRYGGDVASFPEPLKDRYKIFLTA 
FDYCPIKQLERAGARAPFVDQSQSLNFFLKEEQATRASYIRDLLLTGYRLGLKTMIiYY 
CRIQKQTKLNALQCLDQWGDNISSEGAESNCVQKADGERTKVCLACQ" 

SEQ ID NO 138 CDS complement (96163 9715 8 ) 

/note="assembly/DNA maturation protein; ORF 62; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 62" 

SEQ ID NO 13 9 / translation = '-MKTRDANVNKLNDSLMRLLPPPPHRVSLSRGRDFSKGVRDLLSK 

YWSTTTGVEAIKDGFLSVSPTCQTYGDFLIYSQTMSSQEPRGTYLFSFKQTDTGSSI 
DMLFTPTSLAR1.SRMDADSAPQTNRIACVWYGHESGLLDAIPNFEELLETGSLHQFLA 
PVGPLVQTVHSTFVTKVTSAXiKGNVVAREPv 

PLPAHSSIYVCLTYIRVNNRPALGLGFFKSGKGYCEIAAQLRDFYSGVIRTKYIQLQN 
DLYlNRLAFGWCRLGSVPSGLQPSFQSLHFKGAAIiPVLKFTEFVSNPGSWKLFL" 

SEQ ID NO 140 CDS 97157 . . 99976 

/note= "tegument protein; ORF 63; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 63" 

SEQ ID NO 141 /translations n MASSIPAARADNGDENTGGLYKliTDNIjLTCTGSLQQLKLLMEFQ 

LKPLPTAHLLSMPTVTRFLNTAFKIDNPLVSFIQKHPVFFLMRVARLPEPVITDHQSA 
ETS TG ILS E WNVLNTAI RKPHES PAAKDND YLDNRAI LAMITE Y IHHVTSRTPSGIP 
PTPPMGISHLPCVEQILHETHRQYWNLTLPESLFIDIGEVASPLQTWLILSYCKKLQL 
APPPLFPPVDELARRLVTGHHELFVPLSTSLETYITMPVSKQRAFEIYSVFAKSKNIV 
DGTPILAFTDTELTTFTPELLFLYDFVIESLCKNQAYGCSRNAIEHFIKKGIDFMAEL 
GAFIEKTCGYRSTVSLSNVRAVKARLASCGLSKEACEDFRAMILMTPHETTPKWENFT 
DFLEMVNQLTLYGFYFYECLNQYSPTSISLAKIQNILNRVDAEQSDRALWRTPLIGSF 
PF P WKLNNVLAFFKPS TP VATLQKI YKAI PS YLMRS LFE I AANKS WGN I ALAES AP LiT 
D I QTAE PD QG P VS AQVI AKYCS RLQ I S ATD YD AA I VSSPGFAAEF I KTKLYP I LSEVL 
RNTS KKNRSL FQ I RWL I VF AAED ARDL AP I RRS L ALAY FQ I MD I LEEKHS PES F YNLL 
DYLQETFRCIRQVIPEATCPQEFLQYLFTFQNIPIAASFIQTSMTFVDDLKNGIPGIL 
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DLVSLGAAFYNMKLLYDSTLDTVEIPTEEGQPIVVSMFVFKSTIRVLEKLLQEAVIAL 
TQTSEPMYA7VHIRLMQHLTYMQKIAGHEIMTTQLPSVFHEIHEGYLQCFKRFKRLMLH 
VTGSCCYSLTRYFGFLYQPPLIPDTIVQKILNFNDKTDTTDDILKSLSQPVRQGPLSA 
ENES S SRLS K3sTLTVT)LLQKLYDDFRTASTNNNPTS IKLEYSGNYNETQVS VCWSTYNL V 
TYTAPDDTLKFTPVNTEALDRMFAE " 

SEQ ID NO 142 CDS 99980 107 62 6 

/note=" tegument protein; ORF 64; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 64" 

SEQ ID NO X43 / trans la tion= "MELPPIFSKFKIEGVATTHQADCRFGQYAGSQCLSNCVIYLAQS 

YFNRESPVTDTNDLDDVLRQGATLDFILRRSGTLGYWQYAQliHHIPSFIKTNEWTAAI 
FQSQE YFGL I GLDAAIREPF I ESLKS I LTRNYAGTVQYFLFI CGDKAG AVI I KNKTF Y 
LFDPHCVPHVPNSPAHVISSSDPTAILEYVSPPDREYTGSFLYIMPSEYVNPEHYITN 
HYRTITFAKVHGPHIDISTGIEPCTIEDIPSPPRSPDVTSKSSNLARVPRTTTDTSSA 
KPPPATLSGLRGAEPPTSYPDPATNDADTKLIiTPAPAQTAVDHPEFQTTPGATLLLSE 
LSASRGRKRKLSSLQRYSDSDEASSDDEGAPRRRVHDDAISAEVIWMDDDISPLYSPS 
ATPSFDDVFDSPPMSPEFTYEDATEDTDGAFLEQIARDAETPFSAFDDLITDHDFSSI. 
DKKIEQL I KYE APS QHLPNI SDKQNGRAVRE AAALQAMDKI MINI I LEHGLITDAQAR 
GPSACKNVLQFFILWGEKLNIPISDAKQVXjELDLQLIPLHTAISEGKFKQGAFKKHLT 
TKINRCLASMRATHADAQKKLASAFNVEGSQISSSEAKI SVRAIjKEQIANHLSPGFLA 
VYSADEVKHLRDKIQDLKTGIEQRNKEIQQEELFFDAMLTALDTFQPPPKTAFPMEIF 
PHRKTEVMLDHLASITTRLTEDATEALNNYLETPPDQGTHITNIPNFSSIVANIISTL 
KILTYAENDMQLNVTPMATYRRQLLYLGGEI^TIFNLEWPYETVPPVQELPLVARAKA 
KMESVTKMEKNQQALDQILGDAJETLLDT^^ 

GSSRNQRFEKLKJ^IHDLASSESFIIMLLNNTRLDNISDNLAKIDGILTNNTRFLSNA 
TVSKTLOTLGGSLIRECVEALNKRSPSSLNNARLLAVQTILGHASVPDHETLTRIVSG 
VAS AQKESAGDDPDRWTR VTGHLNELKL VTTQSR VDKATRRKLI.MI ITRDLKEAEVSQ 
ETVXETRWQENVLKFQPSTSKEIEDFLQSAPSAKARKFAEKHLRTIilTQFNGHERPPS 
EATAVPMDYTPTPIPTPQAVSTATAEKGKAAWNKIQQAFQDFNFHLIDASDWQEMASE 
YSRHGSSLPGTVGPKXWFMESISNTIiDDILiTQKLASLLPNGPAFRPPAFDWIAPYQT 
RVNAFLKTIGLPMVWLADKIHHQCQTVSHAVQSADLQQATVGTSLERPAAEYCRIJuS 
DMQVAFNDHGIAVRSEAAAYTDAINSPANWTPPKPNLEAPKKLITATDALTVEDFPD 
FLKTS I LQQEQRL I ALQRAE FQQLEAS I S AAERLRQSTRDE I AGKMATAI TQL.L PRAP 
VAISSRPLNLSKPIDFLSSTVYDKILDKEPYETAIAGFAWIiEIATKSVlWYSQQNETQ 
QLNVLLSEVEKQSTVAQRLHDLELSAKNTDDVK^ 

QKLESIESLLRATRTAGEISSELERIGTQAVGTITVRDLGTL.SDQCREAANFLRQASL 
PEGFSDIGTKLSELQAYIKYKKQFLEHFETTQPNVFQRFPLSQNITENVPARPAMDSV 
ARLTNHLHVRGSAPHFTTWIETLPTVDPEKPTHVPAHGGAPLHRQITYSNVLEALFSL 
CSTTLTPVPTAPGLEIATRARRGAEAATWMDRQWPDIAQTLQDVLDTYEHTTAHANRD 
AAFNTFLAMCVFTQIIRGASRAWLPKLPSTAVDFPEEIVLTPRECTTLVTAMWPTLA 
AAILRLKSYSEALGLMSRFLPLMFQALPHLTLEAQVKNGPHNTPPQLRCFAKTEAIPY 
FPAQWQSANLEQSLWGQTDFLQICDNNQRKARVAAVTWALTTIDGWLDQLWSTFKPM 
TAASDDTYVDLVETLHLTTFGPRGPTPRRETTTEHPPYEYGQPTGYCISGQSTTPVQA 
SNTPVSAFEAVLGAMVFHVPIRIFLAATPKRLGQARGGMGIiLTPILECVPDVEPFKSL 
YNAPRKPVPIETLPASLHPHDERQVFLRQAQWI.SYRFTPHEAARSSTPPLLWIDPEN 
LVTATYSSGGPANFESRPFYVMPGPYPPDWPKTLSVTSNTSVTHLSHDEICNLFTTLS 
REHGTVQGRDIFAAAPTNVTPEQTANPPAWETDNRLITQTETAKKPHIIPASPKARTD 
PPVETTTHHSQGQASQHANSNVNQPGQITSHASRNTPSTAPQASSSPEKFNTQTVPRL, 
I SQTSETAH I NQPASGQVTEPKG I FGTYKPRVLtTEPAKPANAGVASRQPEATTTVPKL 
PINPPTARVFIGTASKLSPAVEESHGATPDAHQSKIDREKYAESRPRRTPHLEEGPRE 
PHVNTPTSAHINVPSSQGQKTVHGRENPGLQTATPSAPQPTASNPRIQYTLPRTDGRL 
LHDESEVESTPTEEVKRSPKTQDVSHGPEPDDSRWTAPLGPTIEIHRLEHPQILKNIT 
SLTVPTPRVTPIPPTNIWIPLSHWIQHEEITRAKNVLMRFIQNVRRKLQASSDALSE 
AIARIKFLYL." 

SEQ ID NO 144 CDS complement (107637 108146) 
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/note="capsid protein; ORF 65; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 65" 
SEQ ID NO 145 /translation^ "MSSLRVKEPIVQGRLEHDYPNHPLVAEMNNLPQGDMSPAQYAIA 

KRNYLVFLTAKHHYDMYMQKK^GILRK^HLRGLRGKKDASSSISGVLSGSGSAAPSVA 
PVASTLGSNSFTTISSGPHSLIGSMGPAPGGGGPGSVASSGIGSTSLSPSDATTLDTR 

RSSQNKKSK" 

SEQ ID NO 146 CDS complement ( 1 08152 109498 ) 

/note="unknown; ORF 66; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 66" 

SEQ ID NO 147 /translation "MASGRLPNIAEDEAACHGRGSYPAHRWIJ5GSRLGLDIAASIRSI 

GLCPECYVCFVTYGLGAWDGRPPKWACTL.ISAPSFQTALNEIATGWRPDNPPKNGDVR 
SRLHDIGRSLLEAYAWVLRCICTGVGCPSDEGLSLTAVPRSAWSRYLWSFQRACCLV 
CKTLNCRQRFPLVTCLPQHALDIjPvXiRKKWNGGGCVSMQLNVPSISRRIjGANLNESVP 
GPSDAGLLASLRELAPTVPCGNPFNAIiLRSLTFRALLSMSRWLPIGESTETEISRDL 
GQKVIAYNVLFPCISLPWSQWARSVLEKTVPAPRAAA/CLECGYCLNFGRGKFETVN 
FPPTNVFFSRDQKEKQIjSICATTGR VYCSYCGGSHMRVISLFEITCVGDPYLRCVLAN 
NAAHAIRDANSLVSVyVPCLASPDCATGLLKHLRVAELFYLTSSISSLSCGKCNRS" 

SEQ ID NO 148 CDS complement ( 109524 110198 ) 

/note-" tegument protein; ORF 67; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 67" 

SEQ ID NO 149 /translation^ " MSSGKRLVDELCDLWSYLGPSGISLDLERCQDGAPVYAKGGAV 

PVCTVRLQHGCVYHLEFVYKFWLHKLERIjAYPFAPCFVIINNGItATTIjKCFIjCKPRDA 
DAQFGKNLPINSDVYLERNSSVSLGQDDFMKFKARLVFSGDLNVYSSMVICRTYFTEH 
RQVXiQFLVVTPKSAKRLKTLLRTVFALTGHSDGLGAIiRRTGSVARPSGSELKDIGRGE 

RAAMTN" 

SEQ ID NO 150 CDS 11060 9 -. 111982 

/note= "glycoprotein; ORF 68; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 68" 

SEQ ID NO 151 /translations "MFVPWQLETLMRHWPSLRGLVEQSFLPGTPDGAFNSPVLIHTQD 

SLQPASSCRVCSLLFTLVRTFPPPDSFFEDYGWLCLTCLYAPRSWTATLMVAADLLEL 
THVYFPQCVKDGPVYTAQSILGIDVQLHFFATRCFRPIDREQILHTSHLNFLQTEFIR 
GMLEGTIPGSFCFKTSWPRTEKDDQQPTVACCSVGRGSHTNRDNRLPEDLEEAFNSTN 
AEEKPSLLGVFSATWAESQLLGSDTQQADTHLQPSAFPTPEDADQSQGPCLMHPTLNL 
KTKNHTAS I C VLCEClAAHPDAGPVLKDIiRRDI LENMENNVKLVNRI S YILNDPDSLS 
HVRDEHLRGLIKRCSAQEIHKHFFCDPVCVX.NTYSHCPAVLFKCPPPEKYKKLKARLA 
TGEFLDCNRIFDCETLQTLAVLFKGSQLAKIGKTTSLEIIRELGFQLRRHNIQITHPF 

QTSNLYI " 

SEQ ID NO 152 CDS 112 004 112 8 97 

/note= "unknown; ORF 69; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 69" 

SEQ ID NO 153 /translation^ "MPKQPRSRLASRAPYAPSVRRPDGPQSTRPASRHGSCKSEIMQW 

KKLVSDTQFFSALTRRHELGVDFLREMGTPICTSKSVMLPLNLKTIAPGRCVSLSSFG 
HSSNMGFNCSSCTPTDRSAVSLDANALGEDSARKNSELCSVALTFYHHAEKWQHKGF 
YLSLLSHSME WRKS FTQPGLLYAHL V LKTFGHDPLP I FTVDADERLALWAVFHTRDL 
HLGETSLRL I MDNLPNYD I TVDC I KQTY I MKFTP S RPDN ATVTVP VNS I CE AVATLDC 
TDE FREE I QRGTAI INSQGLL " 

SEQ ID NO 154 CDS complement ( 1192 11 1197 3 5 ) 

/note="FLIP; ORF 71; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 71" 

SEQ ID NO 155 /translation - "MFPHKRLVDFGRHLEADDREAVXiWLFDRPASDDTPEGFANGLCP 

STGEPGIPLPVLLEAVFLVGRLDLVSTFFLLDVGFI IERLRSSPSYFSPYKHLMLS ID 
RQLSERDVKNLVFLTGDQLGRRRNQSPTFFRWLSQMEKAALVSPSNYMVLSDLLQAVS 
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RRDVAKWAANAPG » 

SEQ ID NO 156 CDS complement ( 1197 94 .. 120558 ) 

/note="cyclin D homolog; ORF 72; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 72 " 

SEQ ID NO X57 /translation- "I^SVGPVPTGTIDPVTjYQDRALSNLLAHEASFVTSTACYGTIQT 

EVWGMRVILGTWMRSVARAHQADASVFPIiAVSILDRYLECRSIPRRRFQRLGAACIiF 
LAGKIRDLNPFKAAFLCFCAAEDFSVADLLKQEKSILKALRWKLEAVLPTDAIGPTLF 
KSGFTKEQLFALHSQWESVHKAIWPATGGLPPSLVAAACALFSLGAAAPPPARLAE 
AVGVSAATLAAAAESVATTLREFDEDHILSNARGSS" 

SEQ ID NO 158 CDS complement (12 0866 122212 ) 

/note="latent nuclear antigen; ORF 73; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 73" 

SEQ ID NO 159 /translation- "MWGSRQHRSGIVSGHGLRSSCRGHCGRRGGTREQAGRRGRGRGT 

AAPAAAPAPPAPTTSGPQVRAVAEQGHGSDTETATESRHGSSQGSPSGSGSESVIVLG 
SPTPSPSGSAPVLASGLSPQNTSGSSPASPASHSPPPSPPSHPGPHSPAPPSSHNPSP 
NQQPSSFLQPSHHDSPEPPEPPTSLPPPDSPGPPQSPTPTSSPPPQSPPDSPGPPQSP 
TPQQAPSPNTQQAVSHTDHPTGPSRPGPPFPGHTSHSYTVGGWGPPTRAGGVPCLRIiR 
CTSHNSHEDEAPERQQEQEGEERQQQPARPPRPPRPPRYPIPIPYPSSEEEVPRKYRP 
QRRFYRQVLGPRIDPPRPGPWCHGVIFCNSDPYSIiYRLARCLQFPGIRASSVRVLPDA 
PGSPVIPAFCITVFCQSRGTAKAVKKARRRWERHHPSAPHFQASIVRMDRGLPIQH" 

SEQ ID NO 160 CDS 122 866 123 62 7 

/notetr"Rl5; similar to Kaposi's sarcoma-associated 
herpesvirus K14 and ox-2" 

SEQ ID NO 161 /translation^ "MSGGITLTLLLATLATVRCAIjQTHYAAVPVHSTASIjGCVLTTPH 

DVXilVTWQKQESPSPVWATYSSEAGTWQPPFAGRVDIPEHKLTRTTLKPFNATLED 
EGCYLCIFNAFGVGKLSGTACLTVYVPLSMSVTFYPPINPTQliVCRAEASPAPSVNWT 
GVPPELCSEPEVFPRPNGTTLWGRCNVTSVDPEDLENATCLWHIGGIi/^AARPIiDPV 
FSDPIiEGTSHYWGWAAAAVLGIFLTGVFLYRSM " 

SEQ ID NO 162 CDS 123 924 124952 

/note="G protein coupled receptor; ORF 74; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 74" 

SEQ ID NO 163 /translation- "MDALNNNLNLLMDFLSNYSNSYSSYDDNMSYTLDTESTLCRLTV 

VFPPTVYAIICFFIFCITLFGNALVIiYIFFKFKALANSvDVLMAGLCCNSLFLCASFL 
FSWLLYVAPQMLTSATCKVEIFFFYLYTYFGVYIWCISLIRCLLWFSRRPWVKHGA 
SGFLCVCVSLIVALALSANASLYRTALRHPETSEWICYEDAGEDTVNWKLRIRTTSAI 
CGFLVPFGLMVl^FYGLTWClWKSTKIARKGAvRGV^ 

LLRTGFLAETCYLRDVISVAMHICSLLQSMYSAFVPVVYSGLGSLFRRRVRDTWSVFR 
CFSTSGSL" 

SEQ ID NO 164 CDS complement ( 12 5 057 12 8 953 ) 

/note-" tegument protein; FGARAT; ORF 75; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 75" 

SEQ ID NO 165 /trans lation= " MAQRTNPRWAAAAIjSPEEEAFIHDNSDAESVLALVPEQCFSEFL 

LWLVTRPSDNFDNDDDDPALGVIWHLLAPLVNYAPLETRSAHLQGHHTISLPYGPDLM 
RQPTTRSSEIVQCLRDSGLDRTLRLEVGRHLSCQTRRFVADRVPPGTLAALTLGTLVE 
YDVRVQRQLPVTLQSTAWRPLPERDPICAAVMLPLQRNILPLAVQASNGNSYTVSRYA 
WIARRSYSCVFQRLPCENVTHIADSFTHLHSAIQTGAGALQNILFHATLLPGGEIRSA 
LCGFYATTPSVGAFSRARHRAINTTATLHCQQLARTGTPVLGGFLKTVHSATTSEANV 
ITTTSLLSCVPQAYTFIiRRSLFSQPIICLGSFEPVDGDGNQRSIjYLGSAAGITRITQT 
LSLAYEILEGPLFTSINRAHEPASVIGHLGALVSRGGLRLFVSQLPPTILSQLTATPD 
ISRETVNDILWKFLNVSACVVFAVLPRDTEPEPGPLDAIRRAARICGCPFAVVGETC 
EELGIQFVNDLELWNPGAWPIRQPTSAEVIATFGFDEQPVSSNWLVRPEEPEDGGEQA 
PSPTDWGLFRLASWDQLLRCPTVGSKEFVTRHVDRCSNGLVAQQCEVGPLGRPLSDY 
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HIViraTSVFTDRMARVPIYKPQPITRQDATERLVSPETWVTQGRGRNRWVGQCVAYGE 
QAYKMGINAAVGARYAICEAVTNIMLAHVRRLSDITLTASVGWNPEDDQAWLLQHTIjF. 
ACKELCRDLSINFAITSAGSTPCLSEELISATQQHQTVAPVPFNAWITATAEVKSSR 
QRVTPDLKATGNLIVX.VSFPVPHLTQGSTFEHLCLL.PSPTLPDVQATHLANLFMLTEA 
LIiSRGLWSGHDVSDGGMWTAIEMALAGNRGLQIRIPSEETPLQWLVSETPGVIFEI 
QPQHVDEWQACQNFDCRATVCGTVGQEGLSERIVISHNNDEVYSQTLTSVAANWTSF 
SDEQWYSWGPSFTPAQELYRKXJYGCNQHNLGHLAEVCRNSELTLFATPSRPPAVAALV 
TPGAPLPRALMAAFTNVGFDVAAVSTDDLRGGNILRGFSGLTIGGTVGIEDSYVGARG 
AIMGLLNDPGCYGGLMAFFRRADTFSLCCGEFGFQLLGALGLLRETPHDTPGPKTPDQ 
WDIHLEENASGNHECLWLNLHIPQTTISIMFRVLRGLVLPGWANGRYLGVRYPRDAME 
YHLNQQQRIALNYHTGNADPRMFAQHYPRNPSANSAVAAITSPDGRHLASLVDPAVTF 
HPWQWAYVPPELADMTVSPWALAFQSIiFIiWCIRNRQ" 



SEQ XD NO 166 
PCR primer 

CCTATGGGCTCCATGAGC 

SEQ ID NO 167 
PCR primer 

ATCGTCAATCAGGCTGCG 

SEQ ID NO 16 8 
PCR primer 

ATATTAAACACTCGCCGC 

SEQ ID NO 16 9 
PCR primer 

ATGAGGGGCCTTTTCGTGTGC 

SEQ ID NO 17 0 
PCR primer 

CTGAATCCCGCTGCCAAGGCC 

SEQ ID NO 171 
PCR primer 

ATGTTCCCTGTCTGGTTCGTC 

SEQ ID NO 172 
PCR primer 

TTACATCATAGCTATTGCGCG 
SEQ ID NO 17 3 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides complement (23 3 98 - .23668) 
SEQ ID NO 174 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 25065.. 25368 
SEQ ID NO 17 5 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 25518.. 26525 
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SEQ ID NO 17 6 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 114979. .115383 
SEQ ID NO 177 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 115385 116413 

SEQ ID NO 17 8 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 13 173 1 13 1926 
SEQ ID NO 17 9 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 132333 133719 
partial terminal repeat 
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SEQUENCE LISTING 
<110> Oregon Health Sciences University 

<12 0> Cloning of Rhadinovirus Genome and Methods for its Use 
<130> 53683 

<140> PCT/US99/26260 
<141> 1999-11-05 

<150> 60/107,507 
<151> 1998-11-06 

<150> 60/109,409 
<151> 1998-11-20 

<160> 179 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 133719 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 



<400> 1 

gatcgggaaa 

ccgggtacgg 

gctcgctggc 

gcttgctcgc 

gctcgctggc 

gctcgctggc 

gctcgctggc 

aggggacagt 

gtagggctgc 

gtagggctgc 

gtagggctcc 

gtagggctgc 

ggccgcttgc 

cagctgcgtg 

aggcacaaag 

agtagggctg 

tttccagcgg 

gctctgtgcg 

cctgcatacg 

tcttggttgg 

tacaatacac 

taacattgaa 

gtgtttaatt 

tgcaacctgt 

caccacatcc 

agcgaagcac 

gactggtatc 

caacaacgcg 

aacggtgctc 

ctggtggaaa 

acggtactac 

tctgttctgc 

cttcatttac 

ggcaaataca 



acgcgagggg 
ctgcctgcct 
ctgcttgctg 
tggcctgctt 
ctgcttgctt 
ctgcttgctt 
ctgcttgctt 
agggctgctt 
tgggctgcta 
tgggctgcta 
tgggctgcta 
tgggctgcta 
ctgctactag 
cgagggcgtc 
cagtagggct 
ctggtttgtt 
agggggtaca 
gttttgttgc 
ctccacagca 
taccatttta 
ctgccaacgt 
ttttcacatt 
ttctctacac 
atcggtggag 
cggagatacc 
acaatggttt 
tgtgactccc 
gagtaacaac 
ttcatacggc 
cgtaacttta 
aacaaacata 
gtttattggg 
ctttaaacct 
acaaacagct 



agcgggggac 
gctcgctggc 
aggggacagt 
gctgagggga 
gctcgctggc 
gctcgctggc 
gctcgctggc 
gcttgctaag 
gtagggctgc 
gtagggctcc 
gtagggctgc 
gtagggctgc 
ggctgctgtg 
cgagggccag 
ggccagggga 
gctaacggaa 
gtgcacggcc 
taggtactag 
tccgaaacac 
cgaagtttgc 
attggatggt 
acaccctagt 
atttctgtgg 
cttttgcctg 
tacttgctaa 
cgcaacaaca 
aatgctacga 
atcgattttc 
tatacttacg 
agaaacgttg 
catacatttg 
aatgaaaaat 
acaaacgaca 
agtgtacaac 



aggggacggc 
ctgcttgctg 
aggcctgctt 
cagtagggct 
ctgcttgctt 
ctgcttgctt 
ctgcttgctt 
gggacggtac 
tgggctgcta 

tgggctgcta 

tgggctgcta 
ctgctggctt 
cagctgggag 
acgaggacac 
aacagtaagg 
aaaggggagg 
aagttacaag 
tagtaacaca 
gttatttcta 
tccgttgaat 
tccttttgcc 
cttctcatgc 
aaatgtttgt 
caaaattaac 
cctgccgcgg 
cgcttatgcg 
tatctgaccg 
gggtaagctc 
cgaataacac 
tctttcattt 
tgttaacaga 
tctattctca 
t tcccaatga 
atcctgaaaa 



gtgtgcgtgc 
aggggacagt 
gctgagggga 
gctggcttgc 
gctcgctggc 
gctcgctggc 
gctcgctggc 
gcctgcctga 
gtagggctgc 
gtagggctgc 
gtagggctgc 
gcttgcttgc 
aacagagtag 
gggacccggg 
ctgattgctt 
tgtgtgtatg 
cacctgctta 
tagtatttca 
tagactaact 
aataagaaat 
atcaaccttt 
aattaggtat 
gttggtttta 
ttctgttcca 
gacgtctacg 
tg'ggagtaat 
gtatgcgtgt 
atcgcgcctc 
aagggtattg 
aaacggtaca 
aaagacagga 
gacaattaat 
gtcacatttt 
ctacgttgtg 



ttgtgagaca 
aggcctgctt 
cagtaggcct 
tagtagggct 
ctgcttgctt 
ctgcttgctt 
ctgcttgctg 
tggcttgata 
tgggctgcta 
tgggctgcta 
tgggctgcta 
ttgctagtgg 
ggctgccggc 
cctctccccc 
gctgaaaaac 
cttttgtgac 
acttgctttg 
tcgcatggcg 
ttagtgttct 
ccgttgtggt 
gcttgcattc 
gtctttgctc 
tttatgttat 
acgtggtgtc 
gccagagacc 
ttctacggca 
caaacaaaaa 
acgctccaag 
aggtgttact 
gcggtcatca 
gggacgtatt 
gtgtttttta 
aataaaactg 
ttctctgttc 
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120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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ccgttttttc tattggcgtt ttaacaggta 
ttacaatacg ctgcaacgag aactctgaat 
gctacattca accctcccat aatcagcgtt 
acagaaatgc tcatcaagaa gagagtattg 
ctgattcttg ctgtcaatta gttttacttg 
aggaaaacac aattaacgaa gttatggaac 
aacaaacatc atatgaggat aatgttgagc 
attttaatta ctacagtgga ctaatattgg 
tagaaaatca atatcatgga ttaatactgg 
taaatgaatt aaacatgata gaacaatatg 
ttttaaatta aaacataaac aacattaaag 
aaaatttttt tcatatacct gaaacttata 
tttgcccggt atctctgata acatggtgta 
atcacaatca aaactctcca taatgcgcgt 
aagaacactc tcataaacag attttcctcc 
tagtttgtat tgtctataga aattaaacgc 
tccgtgcggt ggttcacgta attctctgct 
gcgcttcttt tcagggatgg aaaaccatgt 
acctacaacc gatggcgtag acgtcatttt 
cggccaaggc attgttccgt ttttaccaat 
gttaactgta atgtccatgg ttttagttgc 
caattataaa caaataaccc acgtgacatg 
tagttaatca tattcgcttt cacataatgg 
cctaccttgt ttggatggat ttatggttac 
ttttacgtaa aacatattgt aatttaatca 
atgtttgcat aaaagcactt atgttagtga 
ttaaagaggc atgtgaatat atttaacacg 
ataactggta aaaacatgac gtttaaa.ctt 
tacgtccact gcgatgaaaa ctgtaaacct 
aacacagaaa aggacttata tagtgttgga 
tatgttacaa atacaaaaat aataacaaca 
ccaaattttc catgcgacag aaaaagatgt 
gtgcacattc acgggggaga taatgcctta 
aatgaaggtt atgatttaat tggtagtaat 
aacgtaaatt gggattcaaa tgaaccagtc 
gcagtggaac atggggacta cctacctaac 
acatttaaat gttcattgtc gtatacactc 
aacaaaaagt ggtcaaactc tttcccaacg 
atagacaatg ggtacataga cattggcttg 
actgtaaagt gtagcgacgg gtacaacatt 
acaacttggg ttccaccatt acctaagtgt 
atgccagaaa cacccatgcc agaaacaccc 
accgctaaaa ctgcaacaac accaaatgcg 
gacgacgtta cttgtgtaaa gcctcatttt 
aaggaaaaat acagtgttgg tgcgagcgtt 
atgcagtcta cagtttctgt tgaatgtttg 
aagtgtcata gaaaaaaatg tccaacccct 
acaagcggag aagatgcttt taagtacgga 
tatcaacttt taggaagtat ggtgcggatt 
gactgggagc caaaagcgcc tatatgtgat 
acaaacggaa aataccatcc ggtgaaagac 
tcgtgcaatc gtgacttttc tttagttgga 
tggaataaac cgtttccaag atgtgaacaa 
ggaaagctgc taacaggttc ttcaagcgtt 
tgtgaaactg gatttactct aattggcagt 
gatccaccac ttcctacgtg cgtgccagct 
gaaaccaaaa aaccaaacac gccaacgcca 
ggaacacata caccattcaa accaccacca 
aaatggaaaa ggcatgtcgt gttagttctt 
cttgctgccc tttattgttg ttttctaaaa 
aggcaaactc gcacgcatta accaatctgc 



ttgcaatatc gttgattatg tgttggttat 2100 
catcaactaa tagttatgca agccagacaa 2160 
ccaatactaa tgaatgtagt cgccatacct 2220 
aagaactacc aaaccaacac acaagtgaaa 2280 
aagtgaaaaa tgtagcctac gatggaccgc 2340 
agtatgatga tgtggttgta aaaaatatag 2400 
acatggacta tagtgatact ataaatccca 2460 
aagaagtaga tgaagttttt tacaatgaac 2520 
agaatttaga tcacaatgag tacaatcatt 2580 
attggttaga ataaataact tgtgggttat 2640 
cagcattttt gtgtaaatcc tttatttatt 2700 
tttaattcca ttttcctcaa agtttgtatc 2760 
ttcggtaaaa ttaattgatg gaaaaaatac 2820 
aatatatagt tttaagggac acttatagtt 2880 
aataacccag accgtgttta attgttcttt 2940 
atcgtctagc gttctcgcta gaaagtgtgc 3000 
caaaataata ttaattctgt tcaccagagg 3060 
tcttttcccc ataataacca cattcttttc 3120 
ctgaaaatac atcatttcgt ttctaaggta 3180 
tcctaattgt tcatcaactg caacgatgca 3240 
cactgacggg tttaacacag aagtatttca 3300 
tacttactaa tgtaagtacg taacgtgata 3360 
acaaaacgaa aaaatgttac ctgtgttaag 3420 
aatgaaaaat aaaaatatat atataacgtt 3480 
cgtgatgttt atgttggcta ctaacacaca 3540 
gcttcaatct ttatatttta ttcaaattgt 3600 
cattcagaaa taggcgcaac gctgtgccgt 3660 
tttcctctgt ttttattaca cgccataatg 3720 
ccacatttca cggaatatcg cgtcaagtct 3780 
gaaacagctg aattaatttg tcgtcctggt 3840 
gaatgtttac aaaatggtac gtggtcaaca 3900 
cccacacctg ctgacttgct gaatggagcc 3960 
aaatttggat ccaatatttc ctatgagtgt 4020 
gttcgttttt gtattttaca agacacagaa 4080 
tgtgaaattc agaaatgtat taaaccaccg 414 0 
caagatgttt ataactatgg agatgcaatt 4200 
gttggatcaa caacattagt atgcacgtca 4260 
tgtttaatgc tcgtatgtga aagtccacaa 4320 
tccagaagat acaaccatgg acaatcaatt 4380 
gttgggcctg aaacattaac gtgcacaaac 4440 
gtattagtta caaataaccc aagcacaccc 4500 
acgccagatt atcaaaaaat aaatttgtca 4560 
tttgttacaa ctgttgtttc tccagagaaa 4620 
gagcgattca tggtaaaggc tgaaaatgac 4680 
gagctaatat gtcgaccagg atttactaaa 4740 
tccaacggaa catggactgc tccaaatgcc 4800 
caagaacttt taaacggaga gtatatagtt 4860 
acaaatataa catataaatg taatgaaggt 4920 
tgtatgctta aagacgattt aaaaacagtt 4980 
attgaaaaat gtaagccacc gccacaaatt 5040 
ttttatcagt atttggacac cgtaacattt 5100 
gatgaaatga caacgtgtat aagtaatacg 5160 
atcacttgca gcgctcctaa tattgcacac 5220 
tacaaatacg gtcaatctgt taccattggt 5280 
gaaatttcta catgcaagga ttcatcgtgg 5340 
gtttcaatgc cttctgacac acctaaacca 5400 
gaagcaccca aaccaaacac cccaaacgtt 5460 
caaaatccac caatagcacc cccaatgagt 5520 
tttgcaagtg tcgcgtcctt gttattcgta 5580 
taactgtttt ttgtcttcag caggttcgcc 5640 
caaccgccgt tgattccgga attaagttta 5700 



cattattcaa ggttgccaat aaaggtggtt 
atgttgaccc gtttatagt t atcgcgccac 
ttccacgtta tactttcacg tactatgact 
tgtggccggg gctgaaaata acacaagggg 
acacggttta taaaactata tcggatgcgc 
gaaaaacatt ttaacgtttg tttagcgaac 
acgccggaca acccctggaa gataatcagg 
acgtgtacgc gtattcgaaa caagactttc 
gaccatctgg atctggcgtt ttctcgctac 
aattccctct caccgtaaaa gccgcataca 
aggtgacgtg ctttcacaga gaggttattg 
tgtttgacgg taccggtctt aacgaactat 
cgcagtttat agaaccgggt ccacctcaca 
taccggacaa ggatgagatg tttcttggcg 
tgtggagggg ctgtctcgtt cccgcggtgt 
gccaggcgtt taaagtgccg ttgtacgacg 
tgccaaggtt ttaccataaa gacgttagcg 
tcgcccaggc cctgagactc aaagacgtga 
tcatgcagga ccattacaaa attgccaaga 
tgccgaaaac gacagacggg tcgtcccaca 
cccttagtta cggctgtatg tttctcgagt 
acgatagctg gcccatattt gatggttgtg 
agcgctggtc ggccgaacag gccgttcacg 
tgctgtacct aactaaagtg cagaagcaag 
tgtacaactc ctttttcctc caacacggac 
aaaacggcag cgaagccttt aagggcgtac 
cgccgtatca cctggcctac gccgcgtctt 
attacatgca gttcttgcaa caccacaaaa 
attatgtcgg caccgccgcc aactcagaga 
caacgtgcct caacacgctg ttctatagac 
ctcagcgcag ggacccctac gtggtgaccg 
ttctgggcaa cttcgcgagc tttagagacc 
acccaaagta cacgtactgg cagctatgtc 
gaatcaccga agaccacgat aatcacgtga 
gggtgttcaa gggtatcgac tcaattgtgg 
tgattaaaaa taactttaat ttccgcgagc 
tctgctgcaa cgtgtattgg caggcgccgt 
ccctgctgtg gatcattcag gatatctgcc 
atccggcgat gggcatcctc ccctccgagt 
cgaactttaa agcggcgtgt ctcgaccgcg 
tacaccggga catgttctgc gacttcttcg 
ccccctttaa aatgcaggtt agaatagccc 
aaataaaaaa tagaatcatt ttttccaaca 
tcgtcaaacc gacgggaacc agggacactt 
tcaactcgct gcatcgcgcg ctgttccccg 
acaagatctc ccagaccaac aaaaccccag 
cggagctggt gtcgtacgta aagaccaaca 
acgtggttcc ggattcactc atgtcgtacg 
gggcatgtgg ccagattcag ttctacgcca 
agacgatcga tgccgaggaa tacccccacg 
tggcttacct ggcagaaata cgcggccgca 
agccggtcgc cgccacaggg cgcctgcgtc 
aatacacggg ggtcaacggg aacaacaacg 
cggggcgcgg cgtggaccgc aacctgtggc 
tcagcgccat gctaagaaag agacacgtca 
agcgagccgc gggacagaca atcagcacgt 
aggcgctgtt agaggataag gacaacccta 
tacgacacct ggggaagggc tgccaggact 
gtgactattg tatgttgacg gacgaggttt 
gcgtgccgtg gactatcgag gacgcgggtg 
atcttcagtt cgtagacagc gacgatatcg 



taaaaatatt ctattggtgt tcattgtttt 5760 
cttgtggcta cattatatag cacgatcact 5820 
catacgcctt aacgtcacgt ggcgtgcgat 5880 
tacataatcc atccaggcgg cacacattag 5940 
caacaatcac tgtcgctagc gacactgata 6000 
ttgaataaca cataatggct tccaaaggca 6060 
ggtctcgtgc cccgataggt gcgtgcggat 6120 
cctttgccga ggcgtccata ctcggcaaca 6180 
caatccttta cggacttaca gttgaacacg 6240 
aaaaagttga caccacgacg ctcgccgtta 6300 
tgtttcacaa tgcaagttta ttcaggccgg 6360 
gcgaggaagc cagggctctc tttgggtaca 6420 
gcatatggaa ccctctggaa tgtccgcagt 6480 
ttgttgttac ggaagggttt aaggaaagac 6540 
tccagaccca gcaggtgcag attgccggac 6600 
aagacctgtt tgcacctcac ggtcatagaa 6660 
cgtacctcta cgactccctc tttaccagca 6720 
cggcggtcat ccacgccaca gaaaagcaat 6780 
tagtgcaggc aaaacagttt tcaacgacgc 6840 
tgattgtgga cagcgtcgtc gccgagctcg 6900 
gtccccagga cgcgtgcgag ttgctgaact 6960 
actcaccaga ggctagggtt aacgcgttag 7020 
tggcgggtca actgttcgct gccaattcgg 7080 
cgcccagggg acaaaaggga gacgtaaacg 7140 
tggggttttt aaatgaggcc acgatcaagg 72 00 
cctcaaacgc cctcgatggt tcttcgttca 7260 
tctcgcccca tctgctggcg aagttatgtt 7320 
gctccacgaa ccaggcgttt aacatggtcc 7380 
tgtgcacgct atgtcacggc aacacgccgg 7440 
tgaaggatag gtttcccgcc gtaaccaccc 7500 
gaacagccgg gacctttaac gacctggaga 7560 
gcgaagagga cggaaacccg gccgacgagc 762 0 
agaccgtgac agaaaagcta tccgcgattg 7680 
acctcatcac caacatccaa agttttctca 7740 
acggagaggt catgaagttc gttaattcga 7800 
acgtcaaatc ggtccatcac atactccagt 7860 
gcgcggtgtt cctgaatctg tactacaaat 7920 
tgccgtactg catgatctac gaacaagata 7980 
ggctaaagat gcattttcag acgttgtgga 8040 
gggtcctcac ggggtgcgaa ctgaaaatcg 8100 
acaccgacgc ggggtccaac ggcttaatgg 8160 
gagccatgat ggtcgttccg aaatcaatta 8220 
ccgcgggatc cgaggcggtg cagtcggggt 82 80 
acgtggtggc cggaccgtac atgaagtttc 8340 
acaccaagac cgccgcgctg tacctgtggc 84 00 
ttctgaaaga cgtcccggac gacgagctgg 8460 
gcctcgcgtt cgaggaaacg aacgtgctgg 8520 
cgaggatcaa actgaacggg gccattctaa 8580 
cgacgctgca ctgcctcacg ccggtgctac 8640 
tgctgggctc cgcggcaatc gccacaccgg 8700 
ccgccctcac cgtccagacg acggcgcgtc 8760 
ccgtgataac cgttcccatg gtagtcaaca 8820 
ttttccactg cggaaacctg gggtacttcg 8880 
cggaaagctc cccctttaag aaaacgggcg 8940 
tgatgacccc cattatcgac cgcctaataa 9000 
tcgaggcgga aagcgttaaa aggagcgtgc 9060 
acctattgaa gtcggtaatc ttggagctta 9120 
taagctccga ggacgtgcaa tattacctcg 9180 
tatttacgtt ggataatata gcacagtcag 9240 
ccctaataga ggatcgccag gacgcagacg 9300 
ccaccgcttc ctgtcagccc cccgaggaac 9360 
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agctaccgac ccctagcgcc ggcgccctac tggccgggaa gaagcgaaaa attaacgcgc 9420 
tgctgagcga tctagacctt taggaaaacc gtggcaggcg ggcaacaatg gccagggaac 94 80 
tcgcagcatt atacgcgcag ctgtcggccc tcgccgtcga cttgagtctg gttatctttg 9540 
cggacccgcg aagtatcgac ggtgcccgca ttctaaaaac aaaaacacag atagagaacc 9600 
tgaaccgcga ccttctgccg ctgctacgcg agcaaaactc ggtagagacg tccagcctgt 9660 
cgctcgaagt ggagcacctg gccaaaaaca tcgaggacaa actcggcgag ctggagcgca 9720 
gtctgcggca gagatattcg agccgagagc attttgaaac actacacctg agacccgaat 9780 
gtcactatca ctctacggtt acttttcagt tttacggggg cgggttaata gatgtaaaca 9840 
tgtgcctaat aaacgatgta gaactgctgt gtaaaagact agggagtgtg ttttattgca 9900 
tcggtgcaaa cgaagctctg tccggattga accgggttct gacgtttctg tcaacactgc 9960 
ggggtatctc cccgatcccg cacccagacc tatacgtcac gtcagtgcct tgcgtacagt 10020 
gcctgaggga aatcgaactc gtaccaaatc aggggtccag tttactcgcg gtgttggcag 10080 
accgacactg cgatcacctc tgtaagaagg ttagggcgga gccaatacac ggcctgtttg 10140 
agacagaact gagccagctg ggtctaaaag taacaaaacg ttcggacgcc acgcagcacg 102 00 
gcgtccggtc ctctgcagat cagttaaggg agtcgtcgct ggcggccata caagatcaca 10260 
atatattcaa acgggtgtcc gcgtcaatca tggaactatc caatctaatt tattggaacg 10320 
ccgggcaaac cggcctccag accgggaccg aaaacgagtg ctcacaaatg gccagactgc 103 80 
taacacacga ggccgatatg cacgagcacc gtgcgctaat aacacccaaa ctaagcgcga 10440 
ctcacttcta cgactgtttc cgaccggatc ccatagaatc cctgttctgc ggcggtcttt 10500 
ttaactctat agacgacacc ataaacgcac tgagccggga ttgctccgtg acgttctttc 10560 
aacaggcaaa ctataccaac gttatgcgaa aacaaaacga gctgttcacc agactcaata 10620 
gcatcctgcg tcaggggagc gcgggatcgc aaaaaccggc caccccctcg gagccacgga 10680 
ccaccaccgt ggcggcaacc gcggcaagcg acgtcatcaa agacgcacag tatcgcaaag 10740 
aacagtacat gaaaaaggtg gccagggacg gctttaaaaa actaacagag tgtctgcaga 10 800 
cgcagagcgc ggtgttggca aacgcactct gcatgcgcgt atgggggggc gtcgcctacg 10 860 
gcgaggcgtc cgagctggtg aaccattttc tcctcaggcg gcgcttcgtc gcgcttccct 10920 
gggaggcgcg ctgccgctcg gatcagattt tattcgaaaa ctcaaagtac attaaaaact 10980 
cactatattc ccagcgcctc agtcgcgaac acgtagagat tatcacgctg cagttttacg 11040 
gcctgataac cggccccctg acgcgccaga gcgatctctt tcccggcccc gccaacgtcg 11100 
cgctggccca gtgtttcgag gcggccggaa tgcttccgca tcacaagatg ctggtgtcag 11160 
agatgatatg gccccagatt caaccgaaag actggataga ccagacattt aatcgttttt 11220 
accaacttcc cgaaggtgat ctcaacgcgg tacaaaagtc cgcctggtgc tttatacgag 11280 
agctcgtcct ctcggtggcg ctttataatc gcacgtggga aaagacgctg cggatatttt 11340 
ccctagcgcg cgagaaactc tccatctcca acctagacgt taaaggcctg acgtccggcc 11400 
tgtatctaac gtacgagcaa gacgcgccgc tcgttctaat ttctcaaaat accggctgga 11460 
tatttaaaga cctgtacgct cttctgtacc atcacctgca actgtccgac ggccatgatg 11520 
ataactaacc gaacgcgtcg tctcctgcgg gcgtgggtcg tgataatcgc gatcggcacg 11580 
gcggttggcg aaaacgtcac cacccccaag ggcgcgacca ccaccgcgaa gccaacgccg 11640 
ggcccgtcga cgcccacacc tcccgagaac ccacctaggg ccgaggcgtt taagtttcgc 11700 
gtgtgcagcg cctcggccac cggcgaactc ttcaggttta acctggaaaa aacgtgtccg 11760 
ggcaccgagg acaagacgca ccaagaaggc atcctgatgg tgtttaaaaa aaatattgtc 11820 
ccgcacatct ttaaggtcag acggtaccgc aaggtggcca cctcggtgac cgtctatcga 11880 
gggtggaccg agaccgccgt gaccggcaag caagaggtca tccgaccggt gccgcagtac 11940 
gagatcaacc acatggacac gacctaccag tgtttcagct ccatgcgcgt aaacgtcaac 12000 
ggcatagtaa acacctacac ggacagggac ttcactaacc agaccgtgtt tctgcaaccg 12060 
gtcgaggggc tcacggataa catccagcga tacttcagtc agccggtgct gtacacgaca 12120 
ccgggatggt ttccgggaat ttacagggtc agaaccacgg tcaactgcga gatcgtggac 12180 
atgatcgcgc gttcggcgga accgtactcg tattttgtca ccgccctggg agacacggta 12 24 0 
gaggtgtccc cgttctgcca caacgactca acgtgctcgg tcgcggagaa aaccgaaaac 12 300 
ggcctcggcg cccgcgtgct cacaaattac accatcgtcg acttcgcgac ccgccagccc 12360 
accaccgaaa cgcgggtctt cgccgactcg ggagaataca ccgtatcgtg gaaggcggag 1242 0 
gaccccaagt cggcggtctg cgcgctgacg ctctggaaaa ccttccccag ggcgatacag 12480 
acgacgcacg aggccagcta ccacttcgtg gccaacgacg tgacggcgac cttcacgtcc 12540 
ccgctctccc aggtaactaa cttcacgggc acgtacccct gcctcaatga tgttattcag 12600 
aaaaccctca acgccaccat caagaagctg tccgataccc acgcaacaaa cggatcggag 12660 
cagtactacg aaaccgaggg gggtctgttt ctcctgtggc agccgttaac gccgctaagc 12720 
ctagctgacg agatgcgcga attaaacggc accacgccag caccccccac cacaacctca 12780 
accgccaacc gcgttcgaag aagcgtcggt acgaacgagc aggcaacgga cgacctagcg 12840 
gcgccccagc tgcagttcgc ctacgacaag ctccgcgcga gcatcaacaa ggtgctggag 12 900 
gagctctcca gggcgtggtg ccgagaacag gtgagggaea cctacatgtg gtacgaactg 12 960 
agcaagatta accccaccag cgtaatgacg gcgatatacg ggcggccggt gtcggccaag 13020 
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ttcgtgggcg acgccatctc cgtgacggac 
atccacaaga gcctccgcac gtccaccccg 
ttcaggttcc tcaacagcac cacgctgttc 
atactgacgg acaaccaggt ggaggcgtgc 
agcaacgtaa cctactacta caaagactac 
atatccaccc tcggtacgtt catcgccctg 
agggtcatcg agctgtacag ccgcgcggag 
gaaaccatgt tcagggaata caactactac 
ctggacaaca cgatcgacct gaaccgcgac 
gcggacctgg gcgatgtcgg ccgcacggtc 
ttcggatcaa tcgtgagcgg gttcattaac 
atgatcctgg tgattgtggc ggtcgtcctg 
gccatcgccc aggcccccat caggatgatc 
ggcggtaaag tcgaccagga gcagattaaa 
caggaagagc gtaggcggtt agacgaacag 
gcgtcagacg gactaaaacg tcgctttagg 
caagagtatg aaatgagcaa ataaccacac 
ccgcgcggcc aatcgattcg cgcacccggc 
cgtccgcggt ctgtatcaat catggatttc 
cgcccccact cacacagagg caccgatgct 
ccgccaccag acgtttgcag gctcatcccc 
atgatcccgg tcacgatccc gttcccgcca 
gtgctgctcg ccaacgaacg gtccatgtgg 
gacccccaag accaatccat cacgtttcac 
gcggacaggt gtgccgaggt acctagccgc 
gtgctcaagc tcctggggcg aaccgaggac 
caacaggtat atttctacgc gaaggttcca 
caggccctca agaacacagc cggccgtgcc 
aaaagaattc tcaaaacgta cgacgtcgcg 
tccggttcca tgctctcgac cctcagcgac 
gagtcaaacg tggacgccgt tcgccggttc 
tggtactcgt gcgcgcgcgc cacgccccgc 
gagtttgact gcagctggga ggacctcagc 
taccgcatcg tggcctttga tatcgagtgc 
cgcgacggcg acgcggtgat ccagatctcc 
cccaatccgc caaacatact gttcagcgtc 
gttttggagt ttccgtcgga atatgacatg 
ttcgaggtgg actttttaac cggctataac 
acgcgagcgt cccaggtgta caaccttcga 
tccatctttg aagttcacga gccccgtggc 
aaaattaaaa tagcgggcat cgtccccata 
agcctctccg actacaaact ggacacggtg 
gacgtatcgt acaaggacat tccccctctg 
gtgggcagct attgcgtgat ggactcggtc 
atacacgtgg agatttcgga gatagccaag 
acggacggcc aacagctccg cgtgttctcc 
tttatcctcc cggttccaac gcccgaggga 
aaccccattc cggggtttta cgacgagccg 
ccgagcatca tccaggcgca caacctgtgc 
cacctgcacc ccaacctgac gccggacgac 
gtacattttg taaaaaaaca caagcgggag 
ttagaaaagc gaagggcgat ccggcgcacc 
accatcttag ataaacaaca gctggccatc 
accggggtgg ccagcggcct cctcccatgc 
gggcgcacga tgctggagat gtcaaagtct 
cgaacgcgtc tcggtcgcga ggtgaccgcc 
ggtgacaccg actccctctt tatcgcgtgc 
ttctgtgacg atctggccgc caggatcact 
gaggcggaaa agacgttcaa gtgtctgctg 
ctattgaacg acaaaatggt catgaaaggg 
tttgtccagg agcgatgccg cgccatcctg 



tgcgtggcgg tggaccaggc gtccgtcagc 13080 
gggatctgct actcgcgccc cccggtcacg 13140 
aagggccagc tgggacccag aaacgagatc 13200 
aaagagacgt gcgaacacta cttcatagcg 13260 
gtcttcgtga aaaaaattaa cacctccgag 13320 
aacctgtcgt ttatagagaa catagatttc 13380 
aaaaagctgt ccgggagcgt tttcgatata 13440 
acgcaacgcc tggcgggact ccgggaggac 13500 
cgcctggccc gcgacctgtc cgagatagtc 13560 
gttaacgtgg ccagtagcgt gataaccctg 13 62 0 
tttataaaga gtccgttcgg gggcatgctc 13680 
atcgtgtttg cgctaaaccg gcgcaccaac 13740 
taccccgaca tagacaaaat gcagccctct 13800 
aacattctcg ccggcatgca ccagctacag 13860 
cagaggtcag cgccctcgct tttccggcgc 13920 
ggatataaac cgctggaaaa cgaagaggct 13980 
ccacacgcct gtacttgccg cccgccagag 14040 
cggtccccga caccctctag cgcccccggg 14100 
tttaacccgt acctgggccc tcgcggacca 14160 
cccgcccctg ccggcgccgg agccgttcag 14220 
gcctgcctcc gaacgccagg ggcaggcggg 142 80 
acgtacttcg agaacggtgc tcgcggagac 14340 
acggcgcgcg accgcaagcc cgtcgccccg 14400 
gcgtacgacg tcgttgaaac aacgtacgcg 14460 
ttccaaacgg acattatccc aagcggaacc 14520 
ggcaccagcg tgtgcgtgaa cgtgttccgt 14580 
gccggcatta acgtcaccca catcctccag 14640 
gcgtgcggct tctcgaccag aagagtgaac 14700 
gagcatcccg tcacggaaat cacgctatcg 14760 
cgcctcgtcg cgtgcgggtg cgaggtgttc 14820 
gttctggatc acgggtttac cacgttcggg 14880 
ctggcggcca gagatgccag gacggccctg 14940 
gttcaagcgg accgcagcga ctggcccccg 15000 
actggagagg cgggatttcc gtgcgccacg 15060 
tgcgtcttct acacgaccag ggaaggcgcg 15120 
gggacgtgcg accccatccc ggacaccgac 15180 
ctggtgtcgt tcttcgccat gatccgcgac 15240 
atctcaaact tcgatctccc gtacctaatc 15300 
ttaaacgaat acacaaaaat aaaaaccggc 15360 
99ggg a gggg ggttcatgag gtcggtctca 15420 
gacatgtacc aggtgtgtcg cgaaaagctc 15480 
gccaggcagt gtctgggtgg gaaaaaagag 15540 
tttcgctcag gtccgggcgg cagggctaag 15600 
ctggtgatgg acctcttaaa aatgtttatg 15660 
ctggccaaga ttcaggccag gcgcgtcctg 15720 
tgcctgctgg aggccgcggc cagggagaac 15780 
caggggggct atcagggcgc gacggtgatc 15840 
gtcctggtgg tcgattttgc cagcctgtac 15900 
tactccacca tgatacacgg acgagacctg 15960 
tacgagacgt tcgtgctgag cggcggaccg 16020 
tctctgctgg gaagactgct aaccgtgtgg 16080 
ctggcggcgt gcgatgaccc gtcgctaaaa 16140 
aaggtgacat gtaacgcggt ttacgggttc 16200 
attaacatag cggaaaccgt gacgctccgg 16260 
tacgtggagg ccctgacgac ggaagacctg 16320 
cgtcacggcg cgcggtttcg cgtcgtctac 16380 
gacggttatt ccgcggaagc cgtttccgct 16440 
gcggacctgt tccccccacc cattaagcta 16500 
ctgctgacga aaaagcgcta catcggggtc 16560 
gtcgacctca ttcgcaaaac ggcctgcaag 16620 
gacctggtgc tccacgatcc ggaggtcaag 16680 
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gctgcggcgc 
gctggcttta 
gtcgtgccca 
aagaccacca 
ctgccccagg 
tcggacctgg 
ctatatttcg 
aacgcggaca 
tcgtgaacgc 
gagctcgagc 
ggtggtcctc 
cacccgcctg 
gctcgaccag 
gccccccttc 
cgcggcgcgc 
aaacgcgagc 
ggagcgcgcg 
aaagcccacc 
gtctcagacg 
cctaagcctg 
gcggatctcc 
cgttcacacg 
cggggacccc 
cggcggccgc 
cgcggaagtc 
gtttctggtc 
ttccggttgc 
aagagacacc 
cgacgcgccg 
cacaatcccg 
caacggggtc 
tttcgcggcg 
ctggtgtggg 
tgcgtcacgg 
aagaggtttc 
gtgttctacg 
cgcgcggatc 
tacggactta 
cactttccac 
aggctaacct 
accgcgcacc 
cgcctgatca 
cacgtgaggg 
gcgaccctgt 
ctcccgtggt 
tgcatcccac 
aaccgcaagc 
gactgcgagt 
ccggtgtaca 
t tcggtggcc 

cc 999 c 9999 
tccacgtaaa 
tcattctcgg 
accctcgcca 
ttacatcata 
gcgaaacgcg 
ttcattagac 
gagggcggac 
cgcgtcgtcc 
aaccatggcg 
catcaatata 



ggctgttgtg 

taaaaatcgt 

tcgagcagtt 

acctgcccca 

tgcacgatag 

ccgaacaccc 

acaaactggt 

ccacggtggc 

caattggaga 

ggcgaggagc 

ggcgactggg 

caaacgttca 

ttactccacc 

tccgactgcg 

gtggcccccg 

gggcggcaca 

gtcacgtacg 

tgcgcgaccg 

tcgccccatt 

ttgcggttac 

ccgaaatacg 

ctgtccccgt 

aacgccgaga 

gtattggggg 

cgggttcagt 

acgggcgtgg 

acaacccacc 

cttgtggccg 

cgagacctcg 

gtcggtttcc 

cacgaacgca 

agtggcagac 

ccgccactat 

agaggtctcc 

ccggcttcgc 

cgttcggtca 

gcgagctggt 

aggtattcgt 

aagaccgcgt 

ctgaagagcc 

catttctctt 

tggacaacga 

tgaccgtgac 

ccctcaccgg 

cgtccacaac 

ccaactgtag 

tgacggcgat 

ggccaccgaa 

tcagcaccgg 

cggcgaaact 

tgacagtgga 

ttactaataa 

ggtcgcgcgc 

cacaaaacca 

gctattgcgc 

ctgagaacaa 

tccgcgccac 

agcaccacgg 

agcagcatcc 

ttcatacact 

gggaagtgaa 



caagcggccg 
agaggtcctc 
aacgttctcc 
cctggcggtg 
aatcccctac 
ggattacgtc 
gcacggcgcg 
catcctctac 
acgccaacat 
gaccaacgga 
aggtgacttt 
agggccacgg 
aacatttcgc 
tggcccttat 
ggttcgtgct 
cgatccggtt 
tcttcggcga 
aaagcctgcc 
ctttcgttgc 
aggtgaggcc 
tcacgtttag 
cgcggtgtaa 
tagtcctggg 
tctacgccga 
taatcttcca 
caccggagcc 
tgcgcctatt 
ccgccgcgcc 
tcgcgtcacc 
caggggtggt 
tgaaccattg 
ctcgagtcta 
tcacgacggc 
gtgtctgccg 
ctttgccagc 
ccgcgacaac 
gctgcgggtt 
ggcgatcgtt 
tccgatcgcg 
gtggataaaa 
aacccagaag 
ccagcgcagc 
ccgccccccg 
taatgcgcca 
ggcgatattc 
caaattcgta 
tattagcaat 
ccgggagata 
gacgcagctg 
gcggcagctc 
cagccaaaaa 
accgtttgct 
ctcgcgaaca 
gttaacgcct 
ggcgcccgtc 
cgtcccgggg 
ctatcttgtt 
tcaccgcgga 
tcagtcgcgt 
cggccccgcg 
caaacgacat 



ccgcacgcgg 
aacgcgagct 
accgagctca 
taccaaaagc 
gtgttcgttg 
agacagcacc 
gccaacatcc 
aattttctca 
aagacgccgc 
gaccgccacc 
tcaccggggt 
gggctacgcc 
gttcggactc 
cgccccgttg 
ggactcctct 
ctgcctcctc 
gaacggcggc 
cggtggaccc 
gtattttccc 
gttttcggat 
taactccggg 
aacggcgcaa 
ccagtccgga 
cgccgaaaaa 
acagggagcg 
cctattcgtc 
caaccccaac 
ctgccccgtg 
agacaccggg 
ctcggcggag 
acggcaacga 
gtggacaacg 
tacctgacac 
gcatgcccca 
gccactctgg 
ccactggaca 
cacgctccgc 
acggtggtgc 
ctgacagacg 
attcaaggct 
accaagccct 
gccgtcaaca 
gaaacaatcg 
atcgcctttc 
acgcccgtgg 
aggtacggta 
cacgcccaca 
gagattttgg 
gggcaagcca 
ctcggccacc 
ctgtgtaggt 
cgtatcgctc 
cacaaggtgg 
tccgttagat 
ccgcaaaaac 
gctctgccct 
atttacggga 
gccaatggcc 
caggtaactc 
aaactcgcgg 
gatgcgcggg 



tatacgagga 
atctggacct 
gccgccccgt 

tggcgagcag 

acgcgcccgg 
agafctcccgt 
tccagtgtct 
acgtcccgta 
gccagcggag 
atgctggtta 
agattcagct 
agggtccgac 
gtgacgcgtc 
gattccggcg 
cgcccgctga 
tttctaaagc 
gcgcgctcgg 
ctgcgcgtct 
acggccaact 
gacgcggcgc 
ggtaacgtct 
atggaaatca 
cccgtcctgc 
accatccaac 
gccgctcggg 
gtcaccccgg 
ggtaccccca 
gtgcgattaa 
gcgctctcca 
tgtcacgtgt 
tgggaacacc 
gcacgccacg 
tggtgaacag 
gcatcgggag 
gcgatcgggg 
tagtacccgc 
aaacaacgcg 
gcccgcccgg 
cgtgcagcca 
ttcccgtcct 
ttaccgagcg 
ccgtctacct 
tcaccgacgg 
gccacaaccc 
tgtacgtggg 
acacctacgt 
acggcgggtt 
taaccaacgt 
tcttcgtgtt 
gatcgcgcgc 
ttgagaccat 
acacaacgcc 
ctcaaacacc 
gcagtttatt 
atctgtagat 
cccaacgcac 
agctcctcaa 
gcggcgtccg 
tcgtaggccc 
cgctggcact 
tactcacgca 



ggggctgccg 
ccgaaacagc 
ctgcgattac 
gtgcgaggag 
gtccctaaag 
cgcggtcgac 
gttcggcaac 
taagctgttc 
tccgcaggga 
acgaactgtc 
tcgtcaacct 
tccccttctc 
tcaaggaact 
gcgacgcgga 
ccgtgtgggt 
cgatcgacct 
agggcacccc 
ccggcgaggc 
cggtggcctg 
acagggacgc 
gcaaggcgtc 
tctacgctcc 
ccacccacac 
ctggaagctc 
gcgatctggc 
cactcttgct 
cgactataaa 
gctccgccga 
ttaacgcgtt 
ccctacgcga 
ggtgcgtttc 
gtacagctcc 
gtcagagctg 
actggtcggg 
aacacgcacc 
cgtggtcgag 
ggtgtcgcga 
ggtgttccta 
ggagggctcc 
atctgacgag 
aaagttttgc 
gggaaagcag 
ccccgtgacg 
atactttgaa 
cctgaccgtg 
ctcggcattt 
ccggattcag 
gtcccaggcc 
cgcgccgcgg 
cctggagctg 
gtacctgttt 
aaaccgtctc 
cccctcccgc 
tattatttta 
attccagtat 
gcacggtttt 
ttagagagtc 
cgtccccggg 
ggagcccgcg 
caacgttaga 
cccgacagag 



16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17 94 0 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19 92 0 
1998-0 
20040 
20100 
20160 
20220 
20280 
20340 
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agtggagtgg tggcacaggt aaacgaacca gcgctgtatg tttaacccca cttcagacag 20400 
gcgccccctg gtcctgtcaa gagaggcgcg gttgccggcc cactggggga gaacgttaat 20460 
tccagcggca gtcgggggag gcgccagcgt agggctggcc gcccaacacg acaggtaaaa 20520 
caagacgaac cagacaggga acatgaccgg ttaaattact cgcttacaat cgcgggcggc 20580 
ggccggtcaa cgccaggtcc attaaaaaca caccggccgc caacccccaa cgcggggccg 20640 
cgccctggaa cgcggttcct tccaatcgca aagaaccgcg tcacaaaaag ggctcgtttt 20700 
gaacccattt tgtgccatcg ggtttcgttt- tcagatacgg aaacggctcg tccaaaaaca 20760 
cccaacgggg gtgtgctcaa acggccatct ccatctctat gtggggatgg gggtcgtagc 20820 
cctcgagact cagatccgcg cgcgtaaagt cctccagacg cgccaccttt ctcaaaatct 20880 
tcagccgcgg aaacggacgc ggggtcctcc gcagctgaag cagcagggga tcaacgtggt 20 94 0 
tgttgtaaac gtgggcgtca cccaaggtgt gcacaaagtc tcccggggtc aggcccgtga 210 00 
cgtgagcgat cagatacgtc aggagggcgt agctggcgat gttaaacggg accccgaggc 21060 
ccatgtcggc ggacctctgg tacagctggc aggacagctc cccccgagcc acgtaaaact 21120 
gacacaaaac gtgacaagga gggagagcca tccgcgcgag gtccgcgggg ttccacgcgc 21180 
acataacgat gcgccgatcg tggggccgcc tgttaattag atccaccacg taacgcagct 21240 
ggtccacccc ctgaccctcg tagttggcgt cggcccccct gtactccgcc ccaaaatgtc 21300 
tccactggaa cccgtacacc ggccccagat cgccctcgcg gcggtccccg aagccctgcg 21360 
ccgccaaaaa ggcgcgggac ccgtgcgcgt cccaaatttt tacgccgcgg cgcgacagtt 21420 
cggtggagtc ggtggagccc ctgataaacc acagcaactc ctccacgacg cccctccaaa 21480 
acaccctttt ggtggttaac agaggaaact cgtccctgag gttatatcgg gcctgaagcc 21540 
cgaacacgga cctggtgccc acgcccgtcc tgtcctccct ctgcacgccg tgtttaataa 21600 
ttaaatccag gtgcgccaag tactgcagct cgccgtggtc gccgcggcac gcgcagcgga 21660 
cggccgcggc accttcgggt gcggaaaaaa cccggagtcg tgcggccgcg attccagcca 21720 
cgcaaacggg tataattttt ttaacgtagc agataccgag atgcaccagg acaatcataa 21780 
ttacaaccga ggtgctttac aactataacc gtaaacggct gaagacgaaa cttattttaa 21840 
aggcagcttg ggcggggccg accacaggcg cggcccggct gttagcgagt aataaaccca 21900 
cgtggtctcc ccaagcgtct ccccgagcgt gagcgtctcc ccaaacgcca aaaacaacac 21960 
agataataaa ataaataaca tgtttatttt ttataaactt aacgcgcgat tatgcttctt 22020 
aacaggggcc aatagaagca acgtgcgagg cggcattcgc gcgctcaaag gcaaccgcaa 22 080 
caccggcgct cgtaacagcg taaacaaaca actattagcc attcgtaacc gtaaaatcca 22140 
accctctgcg tcccgagcca ttcgcccagc tgcgtctacg tcgagggtgt ttctaagtcg 22200 
cattttgagc ccttggcacg gccgcgtctg cgttaatggt gtttctaaat cccgctgcca 22260 
aggccctctt cggaatcgtc aatcaggctg cgccgggttc ttctgccctt tttgggacgc 22320 
ttgtccaccg cctgaagcag tttcttcact gcgtcgtccc cgggattggc acacagcttt 223 80 
ttaccgcggc gagtgtttaa tatcacggcg tccaccgagc actgcgacga ggtgtgggag 22440 
taagagacca ctaaaccggg tggcggcaga tgagttacat accccaaaca gcagagttcg 22500 
ggcgcgggtc cgctcatgga gcccatagga aaggcataat caactacaca cgcgaacacc 22560 
gcaaaaaaaa cgcacacgaa aaggcccctc atggtgacaa aaacacaaaa caaagacggg 22620 
gtcgctcgtg cgttgccacc agcgccacgc tcgccgtgct cggccacggt tgacggcgtc 22680 
cattaatacc cgcggcccgg gcccgcctcg acggaaacca ggcgtggtta gccacctgac 22740 
gcacgtgcct cccgtccaga tgtggcattg cgagtcccaa aacgggcgtg tctaactacg 22800 
gtcgccccgg ggttcaaaga acggggatcc cccaaacagg taaaaagctt tttgtccaaa 22860 
cagaaagcca acagcaaccg aaaggaatct ccaaaacaag caagggtccc gtgccggcga 22920 
taaacggaaa atttccctca cctaaacaca cgcgccgcgt aaactgttaa aacacgcaat 22980 
acttcctagg cctgtgttta ataaaacaca caagggttgt aatttcgcgg ggccctttgg 23040 
gccccgcgcg tctgggggtg cgttgtctcg ttgtggttcg caatcaccac cccgcctgcc 23100 
cgaaaacagg gcaacaaccc cctggcctag ttttttaaaa acttaacacc ggcaagggga 23160 
gagggagaag gggtgcggct aaatgggctg ttagcagcca accgcagcgg ggaaaaaagg 23220 
gggcaaacgc cgggttaaca caaacagcaa caggcgccaa acccaatata acacaacgca 2 32 80 
tattccgtgt gtttcgcggg ccggggtcgg tacggctaac ccgcgttggg cgtttgcaca 23340 
agcacgcccc cgccctgcac ccggctcccc gaggtcaccg aagggtcagg aaacatagtt 234 00 
ttcacggtcg ccagtcgcgc gcgtggcaac atctttcggt gccctccgaa cgacccggaa 23460 
aaatccccgc cgcgccgggt cgttcggagg gcaccgaaag atatccccaa acgcaaccta 23520 
aagcatcatg tttggggttt cggtgacgcg cgccgcagag gaggccggtg gtggcgctgg 23580 
cgaagatagt ggccgcagag cacgggtttc ggtagtgacg cgcctggacg gctgccacct 23640 
ggcagcaggc caaaaacacg ggcgccgcga gcggccggca ggtggcgccg cgctttcccc 23700 
cacgccgcga tttggcaaaa tggcggaccg gcagcgccgg ttggacgcgg cggacgtcaa 2 3 760 
tcacgacgga gagaagcgca gttggfccagt tgtcccgtca atcaccgagg gccgcgccgg 23820 
ttggcggata atatgaaacc ggcgcggcca ttggacgctg gcggcagcca atgggcgcgt 23880 
tgggttttgt ttacaagttc cctatatata ttatattata ccttccctgt taagggaatc 23940 
ccatgttctt aaatctaatc agctgggtta tataattaga atatcagtaa ttactattat 24000 
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tactgtatta tttattaaat tatttatttt attatttatt taataaatta attcttgcgt 24060 

tattatgtaa tcagatactg ggaaatctac tcccagttta ataaaaataa ataattaatt 24120 

ctattatggt gggggttcta aatattgtac aaaacaacaa tattttttaa atttattcac 24180 

taaaaatgct cccattttac ttgtgacgta catgttagga atgtgggagt ggtttgtaac 24240 

tatgttttca aacaacaccc ttttgggccg tctgaaactc atataagcgg aaggctcggc 24300 

gttaattgcc actggccgct aaccagtcca ccttgccagt tggagtttta ttgctgctta 24360 

tggctttacc tcgcattccg ggcacaagct gatttacgta gcacgcattg ggggtttaac 24420 

ccggttgaat ttagtggttt gcacgtgtaa aaaagcccac cgaggcttgg gtttaagcgt 24480 

ttaaagcgtt gctggtttta taacatgcga tttaattgct atgggagcta gcttgtattc 24540 

cgcatgcttt cggtttagta gttgtacaaa acagaattgt agcccgcaaa ttgttactgg 24600 

tacccaaaac ggcaagcccg cttaacacgc cccgcgctgc ggcttttgtt cctagcccgc 24660 

tcgcaataca atcgagattt taatttaaaa tggggcaatg gcgtagcgca aatttaagcg 24720 

cgtcaaaaat ttaaaatgga ctaacgtaac cggtgctttt agtgacctat ggcgaatttt 24780 

aaaatttaaa gccgtgggtt ttaacacaga gctgccagct tgtaatacgt tgcatataca 24840 

gtgccgttgt gattttatgg ttaagtggat tttaacattg aattttgtaa tggtgtacac 24900 

gtggattttt aaattgaatt taaaattgta attacgtgcg cgtggatttt aaaattgtat 24960 

ttatgtaaca ttgttaaatt ttaactgcgt tatggttgta ttttggttaa tttatgtaat 25020 

tttgctttta gatttgttcg ggtgtgaatt ccagacaggt aagagcctag ctcctaatgt 25080 

ttgccttgcc gcctagctcc taatgtttgc cttgccgcct agctcctaat gtttgccttg 25140 

ccgcctagct cctaatgttt gccttgccgc ctagctccta atgtttgcct tgccgcctag 25200 

ctcctaatgt ttgccttgcc gcctagctcc taatgtttgc cttgccgcct agctcctaat 25260 

gtttgccttg ccgcctagct cctaatgttt gccttgccgc ctagctccta atgtttgcct 25320 

tgccgcctag ctcctaatgt ttgccttgcc gcctagctcc taatgtttaa caacattaat 25380 

gtttaagcac actaaaattt aaagacgttt gtgttggttt ttatgaccag cttggtacaa 25440 

aacctgctgg tgatttttta cccaacaaat aataaataaa aaagttaaaa cttatttctg 25500 

ttgtctgtgg taattgggtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg 25560 

gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt 2562 0 

ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt 25680 

cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc 25740 

ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg 25800 

agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg 25860 

gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt 25920 

ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt 25980 

cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc 26040 

ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg 26100 

agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg 26160 

gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt 26220 

ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt 26280 

cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc 26340 

ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg 26400 

agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg 26460 

gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt 26520 

ccccgattca acgttaaagt ggtttattta gcaaacacac gttttttgcc gcggtttggg 26580 

ttgtacatac acgcgggtgc gcattggccc gcccatgcaa aaaaataacg ttgggtttcc 26640 

gaaacgcgta ggtgctttag tgtttcatgg cacgttttcg tgtcacggta ggagtggtta 26700 

gttttttggt agcgaatgca accaggtagc tcgcctgacc gggaaaggga acgttttaaa 26760 

ccgcagtgtt accgcggtgc ggcatacggt ttaacgtgca actgccggtc ccgcgcgtgg 26820 

taccttaagt ttacgtgtag caattatggc ggctgttcag ggccctccgc cgccccctga 26880 

ggaagaaaat gaaaattctt tgccagttga cgtttatgct atagagggca tctttcttta 26940 

ctgtgggctc gggcaggcgg agtacttgca ccatcccgtg tttagcccta ttaaggaatt 27000 

tatcagcgcc tttctcaagg acagcgctcg cctgtacgag aggctcttgc gccacaccga 27060 

ttaccgctct ctgcggggac taaacgccat aggccaaggg atgctgcaaa taaacacgga 27120 

cggacgccac aactggggtc gcgctttggc cgtgttgggt cttggtgcgt atgtggtgga 27180 

taaggttaaa gacgacgagc gtcttttaac gttcgccata gccgttctac ccgtgtacgc 27240 

gtacgaggcg ctggagtctc agtggtttcg ttcacacggc gaatgggagg gactcaggaa 27300 

ttactgcgag cgaatactga ggcatcgccg caacgcgagg agacacatgt gctacggagt 27360 

tgcggctggt cttctggcgc tagtggcgct gtttgccatc aggcgatagg tgtgcgttta 27420 

accgcgtatt cccccaccct gaccttaagt gagccgattg tatatgagac caataaaaca 27480 

aaacaagcta acgtggtatt cgttggaaca ttttttattt gagcagttcc tcgcagaaca 27540 

ttttttgtat ctgtgacacg ggggccggct gtgcgctggc ctccaccggg gcccgcgcgg 27600 

actgtccgca ttctggatcg aggggcgggc acgcgccagc gcggggggca gaggctgaag 27660 
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gaatggcgtt tgacattaca gattcttggg cggcgggggc tggagttgcc ggctgcgtgg 27720 
cgagaccggc gtggtggctc gccggctgcg ggggcggtgg ttgcgtttgc tggatgccct 277 80 
gcgcgctctg gtacggctgt ggggcgtgga gatgggtgtc cgttccggaa taccactgcg 27840 
gctggtacgc taggtacggc gcagacgcgt aggtctgtgg ccacgagaac tgaacgggcg 27900 
gtggttgcgg tcgttggtcg gcgtcctgtg cgccggaggt ctgcgacgcg gctcgtttta 27960 
ggtccctcag atccgcctga atgtccaata tgtttttgga cagcgccatg acgtccttgt 28020 
gtatgccgac ctcctcgcct ggaaacacgg gctcgtcgga aagctccacg tcccgtttgc 28080 
gcttctggtg gcgagacggt gccggagcgt agccgtacgt cggagaatag tacgtggccg 28140 
catacggaga gggttggggg gcaaacgccg actggaaagg gaacggtggc gcgacgggcg 2 82 00 
gtgcgtacga tgggacggtc agctcaggcg gcggtatcca cgatggatac gccgccgttg 28260 
ccggcatcga gggtggcgca gaaacgtacg tgcggtggcc ctggttgcgc atgttatcga 28320 
gactgctctg caccatggtc agaaaggtgc tcttgggaat ggtgatgttg tcctccggga 283 80 
gggcgctcat ggttcgggtg tcgcggttac cgcagtgctg agcgcaggga aactggcttg 28440 
cctttaaata cgtacttctc gcgactctgg ccacgccttt atctgttttg agcaggtctg 28500 
tgcgatctcg gataaatccg gcgtcgatcg ccttggccat cagtgtctca agaggggccg 28560 
cgaattctgg cggtgtcacc tgacgttcgc aaagatctaa acaattgaec gttattttac 28620 
cggcttcctc tcgcgtaaga gaatcaaact tggagacgac ccatgtaggg tctggtccgt 28680 
aaacggcgat tgacccccta cgcctcccca gtgcgcacag agagacgtgc tgaaacgcgg 28740 
gtgggtgatt ggggtcttgc agctcttctg ggtggaggga cgagagggag agctccggga 28800 
gccatgtgtg taacatctcc aacagggggt cccgaggcat ggggtttagc tggatctggg 28860 
ccgctcgcga gtcccccgcg agccgtgacg ccagggccaa gaaatcgtgg ctcgttagct 28920 
tgcccaggca aaatatcccg tgtgttactt ggaagagacc gattgtgtga cctacgtggg 28980 
cttcggggac gtgctctatg tttatcggta gagggtccgt atacgggagc agggtcgcca 29040 
cgattgaggg ctctaaatac agctcctttt ctatctttgg taggctgacc acgtccacgt 29100 
atcccccaac gtacacggga gtcatgttgg gtcggtgtct gcgtgagccg aaggagatga 29160 
gcgccggtct tcgggggctg atgtggcgcg ttatccgttg tgagaattta aatacgtttc 29220 
tccctggcga gctcaggttt cttcacctgg tgctgtgcga aatgtacaac tacggtctta 292 80 
acgtgtacct tctgaaggag gcaattgcca ataccggaac gagggacgat attgtcctcg 29340 
ggcgtaaggt tccggtggag ttctggaaaa tcgtttacga tgggttaaag gagatggggg 29400 
tttctgacgc aacgttgctc tcagaaacca aaaggggggc tctgtggctt tacttcaacg 29460 
ggcgtccgtg cctgttaaaa ggtctaggcg actacgtttt ttgccaactg gggttaagcc 29520 
acagcgtacg cgtggttccg gaaaacctaa cggatgggaa ttatctctat aacctaggca 29580 
gcgtcatacc atgtcgcctg cttgtggcac tgagctactg tttggcgttt tgggggcacg 29640 
ctgaccacga gccctgggtc aggttgttcg ccggcaagat ttttatactg tacttaataa 29700 
tttccggcca cattatgcca cggaagtcaa tcttagaaca ggtgggaacg tccgggtacg 29760 
gtggcttcgt cgaggcagtg tgtcgcgatg ttcgcgcggt acacgggatt ccggcgtggg 2 982 0 
attttgcgtc cgcggctccg gcgttaacgt ctcaacagac ggactacttg tttgcgttca 29880 
ataacagcgt agtgtagggt cgcctcacac taccgacgtg gcgaccggca ggtatcccag 2 9940 
gaccaaaaaa taaactctgt cgtagtcgtc gtaaactccg aattggcttt tcattatctc 30000 
gtgtgccaga ccggggggcg atgggtgcga aaggatggcg gccaggcctt cctcggccgt 30060 
taaacgtagc gacgtttgag cctgaatcag gctgccagcg tccctgaacg tgagcttttg 30120 
gttcaggatg tcaaagattt cgttaaatct ggaaccggtc aggttgacga acggacgagt 30180 
gtgttttaga tgggatatgg cctttgcctc cagcgccgct agatacgcac ccgggaacaa 30240 
aaacgacgtg ggggcgtctg gcgcgcgcga aagaattggg gagatgtaat tttctgtcag 303 00 
gaaatcaaaa agttgtccct ttttaaggta ttgtttggac ccggacgggt cttctacctt 30360 
ggaaaatctc ccacccagga gcgaggccag gttgaggcgc cggtggggcg atctgggtgt 3 0420 
gatttgggcc agctccaata gcacgtaaac caaaactaaa cagcctagtg ccgagaggcc 30480 
ggctcggagg ttccattggt acgcttgaaa gggaggcacg tcttcgctga acaaagtggt 30540 
cgcaaggatg tatacgagat ctggtccagg tggctgaccg gtaccggggt tcttggaacc 30600 
gggtagcagt ccgctctgtg ccaacgtcgc gtatatcacg tggtttgtaa agaagtccgc 30660 
gttgtacttt gtttctttgt ttaggggcgc tagagaggcg cgttgcctcg gattcgaata 30720 
tgtaaactcg aaacccaggc ccttcgtgtt ttgttttagg tgagagacaa gcgtcgccag 30780 
ctttgtgccg aggttatccc agaggccgtt tcgagttggc ggcaacgggg tacctgtccg 30840 
gtgacagtag tacccgttga ggaggcacag cgcggcctgc gtgtccgata cgttgcggac 30900 
atccgcgtaa aagtcttgtg tgatcgaggc ggctgttgag attacggcag acattagctt 30960 
tagagacgtg gagttttgaa agtttgttgt gcctcgcaaa tctctgggga acaccctgcg 31020 
ttgcatggcg ttatctgtga ggctactgta ccacggcccg tattgtggca gccattggtt 31080 
ttggttcatg tacaggcttg gcaggagctc cagacggaga tcctggttta ccgtgaatcc 31140 
cgggtctccg ggggctattg tgacaacgaa cgcgtctcca tgtgtctgcg gggcgatttt 31200 
gggttgcggg tttagtgatg tttgggcatt ggcggtttcc ctgcccgggg accgcagatc 31260 
cccgaccatg ttt tgaattg tttctaggtc gcgcatgacc acttctgttt cccgttggtg 31320 
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cgtttgcagc 
tagcgccagc 
tttgtgaata 
cgctggtttt 
taaaaaaccc 
actcagtatg 
accagttttc 
ggtgtaagcc 
ttaggcacgt 
gcacacaatc 
ttttggttgc 
ggtgtttgat 
aggccggaag 
aaggagggtc 

gggacgagtc 

atgatgtgtt 
acgtacccac 
cacatccgag 
tgtccgcgtt 
cgcggcagct 
cggagcgtac 
gtaggggtgc 
gtaagactaa 
atgctaaggt 
ctatgacgga 
aaacaacgct 
ccgagcccat 
tagttaagca 
gcaagtttgc 
ggctggtggg 
tgtccgcaac 
atctgtttca 
cgctcaacag 
gaatcacgca 
gggtgatgat 
ccatagagga 
aactacatga 
acgtgacctt 
taattgttga 
ggcaggtgat 
gctacattca 
tttactataa 
gagttggaat 
acatctattg 
actttggaaa 
tatacatttt 
agttctttgc 
gttttattaa 
gcggaaaaaa 
tccggcatta 
aatatgcaat 
tatgtatctt 
gcgccgaaat 
tttgtttcag 
gctattgaag 
ttttttgaag 
gttaacgttg 
aggct ctgct 
gcagagtccc 
caagaggtcg 
attctaaacg 



acgttgtcta 
tgttgatttc 
aacagaacgt 
cgtgggtatc 
gttaagcgcc 
aattgtttta 
gtgtcctctt 
ctgccatcgc 
cttgagttca 
tgctattcgc 
cctgagtgtt 
taaccgtcgg 
ggcttcgagg 
tggctttggc 
ttccgacgat 
ccccgtggta 
gtctccgtcc 
gtgcccgcct 
ctcggctaga 
taccggggct 
tactacacgc 
aatctcggga 
aacgtcgctg 
tgatgtaccc 
atatagaaat 
attgaattca 
gaaattttgg 
aggcacccac 
gctaccgttt 
aaacggtagc 
ggtggttttt 

aatcctatct 
ttcggaggcg 
aaactacttg 
gcaatatctc 
catatgcaat 
acagagtatg 
aatggaggtc 
cgcaggagaa 
gtcaaatgag 
aacgctaacc 
taaggtgttc 
ttaatggaac 
taatggaaga 
aaagaaaaag 
gtaaatcttc 
caggttttac 
cgaaacgtga 
taaatgcaga 
taacagaaaa 
gtgttgcact 
atcgtgattt 
cttataatac 
ttatagatag 
caaagggtgc 
ttactgcagt 
ggtgtgttgt 
acccaacgtt 
aggtttttaa 
tattttcaat 
aaattgtata 



gttctagccg 
gcatcacaaa 
tgtttggatg 
tcatacagca 
gccgacgtgg 
agtgttttct 
ggcgcggctc 
tgtgatattt 
attacgtagc 
ctacctaggg 
gggcgaattg 
tatactgcta 
gtcgcgacgt 
gacgagcttg 
actgatgacg 
gatacgcacg 
ggaacgccgt 
aaaagggcgg 
cttcaatatg 
cagttttctt 
ccggttgaaa 
ccctcgtcga 
agtgttagtc 
aggccggtta 
gcctttttaa 
atgactggca 
acgtgtgttt 
gggaaattga 
cgtgcgacag 
acaaagccag 
ccgctggttc 
cttttctcgg 
cacaggcgca 
cgacaggtag 
acaccggagc 
atgaatt^cca 
atacccatgg 
tgtttgggac 
catctagatg 
gctattaaac 
aaattggaag 
tgtaacggac 
aatttaccaa 
tgcgtggtac 
tttgctgcga 
aagtgaccat 
tgggaatttt 
attatttacc 
aatgttcaag 
ttggatatta 
gttgtttggt 
attggtcgtt 
tcttgatttg 
tccgcttaat 
ctgtcaaaat 
aaatttttta 
acggcatgtt 
taagttatat 
cttgccggca 
gtttaaaatt 
tattactaat 



gagcaggttt 
cgcgtttctt 
gggggcccaa 
tttttctgac 
tgttgataaa 
ggttcttaaa 
tgcctatata 
ttattgcgtt 
acgttttttg 
aaacctcgaa 
aaagaaactt 
agtgggcgcg 
gcttacattg 
tccgtcagat 
ttgacaccga 
gcttaatgag 
gggaattact 
tggtgccggg 
ttggacgaca 
ccgagagcga 
gcggtgacaa 
caaaacctag 
ttaaaaactt 
ccgtgccggt 
tctatctaga 
tggtgccgca 
attcaaattg 
tcacttctgc 
ccgccggcat 
cgaattggat 
acgtaaagta 
cacacgacgg 
ttcaaagtcg 
cgtgggcgta 
aaatggttca 
gattgactca 
tggcagaaat 
tctttaaaga 
atgcgtgtgg 
cacgagcagt 
gcaatggcgc 
aaatacgtgt 
attaattgga 
gattctttac 
tcaagtatag 
gtaattaacc 
aatgtaatga 
aactcaacaa 
attacttttg 
gtatcagtta 
gttcctagca 
aaaaattcaa 
gcatttttac 
gccattgatt 
ccttcaaatg 
tttataaaaa 
gcagcgttag 
gagctaaatt 
aatagcatgt 
gtatataata 
tacatgtaca 



tttcgcagcg 
ctttcttcta 
aaagttgcgg 
gtgcgcatct 
tgtaggcgga 
tagcagccag 
tgccacggaa 
gccgggaaat 
atgttcgcta 
aaaaatccta 
taaaatatca 
agttccagcc 
atttgcaaac 
gcgggatcgg 
aagcaccgat 
ccctggaagt 
gcacccggac 
cggtggcgcg 
gagttttggg 
gcacgaatat 
gagaaatttc 
tcacggtgcg 
gctgcggata 
tcatctcatg 
gggggtcatg 
ggaaaacgtc 
ccttaaagaa 
tcgcgtatac 
cggtcgcaac 
tgtttttgac 
caacaggcta 
cgatgtggtc 
aggccgtaag 
ccatgccgtg 
gctgtgtgta 
tcggtttcta 
gttagtatcg 
gctacgaaag 
cctctgggga 
gaactggcca 
gtattagttt 
atgatgaaaa 
gaaacgtaag 
tgctggaacc 
ttaatgtgca 
ttacggtgga 
cacatgcttt 
acattatgga 
attattccaa 
caaattcaag 
catttccagc 
actatgctct 
caaaaaattt 
atctcaaagg 
aaaatgatat 
atcttcaaaa 
aaagtttaat 
tggaaacgtt 
tatcgttatc 
cacctaaggt 
ccaaatacag 



tgtgaaccct 
tcagagatgg 
taatttgccg 
tcccattggc 
aacgctgatg 
ggccttaccg 
tctgtcagtt 
acagccgcgc 
ttaacggtaa 
aatttaacgt 
tcaagcgacc 
cgtttgcgtg 
gccatggccg 
aagcctagat 
cttgagtatg 
caaaactatg 
gccctgtatg 
cgtcccaagg 
gatagagaga 
gcggaaatac 
acctccgggc 
gggttaacac 
aaggacgatg 
cagccgcatc 
ggtgttggaa 
ttaagctgtc 
cagcgcagca 
gcgtgtcaga 
ctgcaaccgt 
agacacctac 
acgccggatc 
gttttgctaa 
gaagaaaaag 
ttctgtacgt 
caaactgtgt 
accctaacaa 
gttaaagaac 
cttcaaattt 
aatatttatg 
gctcttgaaa 
tatatttttt 
atcaaatgta 
caaagaat ta 
attgtctgtg 
taataatgac 
ttttaattac 
aacacaaggc 
tcttttttac 
tgtaataatt 
tgtaaaatca 
gttaaaaggc 
gggtgtaatt 
tacagaaatg 
aaaacttttg 
cttatctttc 
gcagcaatta 
gcatttatta 
gtcgcatatt 
agtaaatgat 
tggtggtaaa 
tgaaaattat 



31380 
31440 
3150.0 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33600 
33660 
33720 
33780 
33840 
33900 
33960 
34020 
34080 
34140 
34200 
34260 
34320 
34380 
34440 
34500 
34560 
34620 
34680 
34740 
34800 
34860 
34920 
34980 
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cagctaacca 
aaattaaacg 
aataatgtag 
tttcgcgtgt 
cgatccgacg 
ggtttttggc 
aattgtacca 
tacgtcataa 
cttaacagcg 
ggtaccgccc 
ctgtgtgatt 
gtgtcaaacc 
gataattttc 
ggattgtaca 
atgtctgctc 
gcgtaaaact 
ttagcgagaa 
acttgaatgg 
gttaataatc 
tctatggagt 
gattcgaagt 
ttaaagtgaa 
aggtgttgga 
tgcataattg 
gaacccatgg 
gtcgctggcg 
cataaggtcg 
gttatctgtt 
aatggcctgc 
attggtgtgt 
ctctcgcctt 
accataattc 
gaatagatgg 
ccgaccattg 
agctgcaccg 
cctttggtag 
tctccagtga 
aaacgcgtga 
taaccgtatt 
cagaatagcg 
cgtaccaagg 
ttccaccaaa 
ctccagcgtt 
tattgggttc 
tgtcgcagtt 
agggaaatta 
aagcccgtgg 
caccggtaaa 
tatgttaacg 
gaatgacggt 
gtcgataatt 
catattggta 
gaacatgtct 
gttttgatcg 
cgaaaacatg 
atacagcctg 
gaatattggt 
taccttgtac 
aagcgattga 
ctgaaatttg 
aataattacc 



acacatttag 
tcacagat tc 
agatatcata 
tttcgccttg 
caccgcagac 
ggctgttgca 
ggctggcatg 
gctccgtgcg 
cgatgtttgt 
tacacattcc 
cgattgtttt 
ccacagttca 
atatccatta 
gaagacacgc 
tgtattttct 
ttaaaaggtt 
cctcggatag 
ccagctctcg 
gctgcggagc 
ttaaacaaaa 
attctctctg 
acagaatatt 
taggctttag 
tcacggtgtt 
caaaaaatat 
gttgcttgag 
gtggcgcatt 
tgttatttgt 
agtgcaaaaa 
cctcaaagtc 
cgtccgttcc 
gcagcattga 
tgctgaattt 
cggcaagtgc 
cccccccaac 
tttcaggaac 
agtcccaaac 
ccggtcgtag 
tttttacttc 
cattcgtgtt 
cgggcggcgc 
cactcgataa 
gtgaatattg 
aattcatccg 
gccgccttaa 
gtggtccaga 
aggtctaacg 
aagcacctga 
ttaattgcag 
tctttcacgg 
gactcgaagg 
ttgaaaactg 
ggaagaggtt 
gcgttaaaaa 
gcctctagga 
tcgcacgaat 
acctcgggtg 
ggttttttgc 
tttactatgt 
ttttgctttg 
gtttggacgc 



gcggaatgta 
ttctgtgttt 
tatgattaat 
ttttttaagc 
gtcaaaaaga 
cgcttttcac 
gaagcaagta 
ccctgatcac 
ctcggccgtt 
gatattgtat 
gagctatgat 
agccaacctg 
tttgtggtta 
actcagtgct 
gtttaagctg 
gtttcccgtt 
aggcgtgccg 
attttgggga 
ccgcgacaaa 
caccatgtct 
atacgggccc 
ttccaaaccc 
ttttaaatta 
aaacgcggta 
gtctgccagt 
gatggatcga 
tggatcgtat 
aagagttacc 
aagagagtta 
agaacttcta 
cacgcacaag 
taattccgga 
tttaaacaac 
cgacgagtac 
taagcgagcc 
cgcagcggcc 
gtctaatagt 
ctatgaaaga 
gcacgtatat 
tgcttttggt 
ggttcagggc 
aacgcttgtg 
tgtttttcac 
aaagtttttt 
accaggaaac 
aaacgtctcg 
tagagttcca 
ggtttcccat 
ccccgttgca 
agaatttttt 
taccggcgag 
agattttgaa 
cggtctggtg 
taaaattgta 
cgcgaggcgt 
ggcacacgtg 
gaaacgttag 
cgggatcct t 
tttggtagtc 
agaggttttt 
gatgcagttc 



atgaacatgt 
tatccgtata 
caaatagcaa 
ttaaggtttg 
accggctcgg 
gccacgcgta 
accgcgctga 
gctcgcgttt 
tatccaaatt 
aacttttccg 
gagaaccagg 
ttttccccgt 
atgaacaacg 
attgcattag 
ttttccatcc 
tctttttgga 
tcacatattg 
ggaaacatta 
actcgaaaat 
tccaggcgct 
gccgccttac 
aaacgctgca 
ggtaaaaaca 
atcgggctct 
aaacagtccg 
agttttttat 
tcttgcgtgg 
ggtggggtct 
ctcgcaatca 
attactgggt 
cagctgttta 
cgttttgggg 

aggggaaggg 

acgacatgac 
tttaaatctg 
ttggcttcgc 
gtgtcgaatt 
cataaaattg 
ctttggaatg 
gtgactaatg 
tcccagatcc 
tagagtttgt 
ctggtctatt 
aagtaagggc 
gttgcatccg 
ccgcccccgt 
gttcgcaatc 
agcgaacata 
caggttgtcg 
cattacaaag 
agaccagcac 
gttgtgtaaa 
gtcgtgacgc 
atctctatct 
ttcgcctggc 
ttccgccaca 
atagtacgtt 
tttgttaatt 
tatgattaat 
taaaagtacc 
gggatgcgaa 



atgaagtttt 
tattgttcac 
agcccgatga 
atttagatga 
aactggcaca 
taaacgagtt 
tgccgcttac 
acgaggtttc 
gttctcattt 
ccccgaggat 
gactacagac 
attctccttt 
gtaccgttgt 
tatttgcttt 
tggcctaaac 
tggttacgcg 
attcaatcac 
aaaaatattt 
ccttctgaaa 
gttttacgcg 
acgtattcgc 
ataggacgtg 
ttatagagtt 
cgcctatttg 
actgtacatt 
tatccggcgg 
tgtttgccag 
tgacggtcgc 
ctaattttga 
ctctgaacaa 
gcgcgtaggg 
cgtacaatgg 
gtacggccgg 
agacgtactc 
agatttttat 
ggttatattt 
acctgcggtt 
gcgttgcgtc 
aggttgcatt 
atgcgttttg 
aaacgcaaaa 
atttgcgctt 
ctagcatatg 
ctaccgtata 
tctttactag 
ctaactatgt 
gaaaccggga 
aagtgcagca 
gagtagaaca 
ttattagtga 
gtgattttgg 
ctcatgtgct 
gcttgttcat 
tttggtgcgt 
cccagaaact 
gacatgcaca 
gactttggcg 
ttcacctgta 
tcgctcgcca 
ggattggatt 
atccaatccg 



aacaacaatt 
atccatgtgt 
catcaccata 
aaacaagcta 
aggagcgtct 
ttcagtcata 
gaacatcacg 
ggaggtgttc 
tacaccgccc 
aggatgtccg 
catgatgtac 
ctttgacaac 
ggaaattcgg 
tattggaaca 
tgttaataaa 
gggactgaga 
ctctgcgcca 
gtcggtcgcg 
gtcgtgggac 
cgagagcatg 
gtatagcatt 
ctttaaaaaa 
gttggcgcgc 
tggctggcgc 
tgactcgctg 
aagcgcgcta 
ctcgtttatg 
cgggacggaa 
gtttacggta 
tgatacgcct 
atggggtgac 
catcatgttt 
tcccggaagt 
gccgttggat 
cattgatttt 
ttgctttatg 
ttaagtttga 
ctaactcgtc 
ttttatatcc 
aaaagtcaaa 
acgaacaaca 
tatttcttgt 
gaacgcttaa 
ttttagaaat 
aaaccacaga 
attggttttc 
aaatgtgctt 
tgtccccgct 
cactcacggt 
gcctggggat 
tattaattac 
taactggatc 
atcgatctgg 
attgaattcg 
cggtgttagg 
ttgcgacgcg 
tcacaacaac 
gatttacgtc 
ctttgatgtc 
cgatgttgtg 
aagtaaaacc 



35040 
35100 
35160 
35220 
35280 
35340 
35400 
35460 
35520 
35580 
35640 
35700 
35760 
35820 
35880 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
38580 
38640 
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JL ii.J IJ ?y 



caaaatgtta 
tagcgtgagg 
cagatgt tec 
aaggtaacga 
cgaacagtac 
ttcgtaggtg 
gatagcatca 
accaccttgg 
acggcacgcg 
gttttcaaga 
ateggaattg 
aacagggccc 
ggecaeggag 
aagttttcaa 
tttgggcgtg 
tgcgtgcgtt 
taaggtatct 
atacattatc 
tgatgacatc 
eggggcegta 
cctggtagat 
aacactatca 
ggtgtccatg 
cgccgtcaag 
tcacgaaacg 
eggagtcatg 
tgaeggggge 
cgtaaccgcc 
agtggaccgt 
gcaggactcc 

ggtggctttg 

caggegtatg 
ttccacgtcg 
gattgtaaat 
ctgtcatccg 
agatccegga 
ataegggctg 
ectaaaggeg 
tttggaggtt 
gtatagggcg 
gtttcaaaac 
ccagtcaacg 
ttatgtcatc 
aattgeatta 
cttccctatg 
gtacgcccat 
aatcgeggge 
ggaccccaat 
gtcctcgcgg 
ggataegtae 
ttacgagacc 
taatgaaaac 
agcctgtacc 
cctgacgtac 
ccacctggag 
tgacatgatt 
ccgtgttgtg 
cgtgagccag 
tgacgttgcc 
ggtggcggcc 
cgtgccggtc 



aattttaagt 

attttgaatg 

ggcagtgtgc 

cagaggattc 

agegacegta 

aaaacggtgg 

tcaattatta 

cggtgtataa 

taaaaggtgg 

atctcattat 

gccggtgacg 

tgeaacagea 

gccaatctgc 

ctgctgttgg 

tataccaacg 

aacaccgagt 

gtaccgacca 

atgaaggect 

gagctgetat 

aagacgatta 

acegtattga 

gacccggtct 

ttcaaaagct 

ggaaagcagt 

gtttttaagg 

gagaccaegg 

atgtccggtc 

gtgacgtacg 

cccaacgcgc 

ctcgccaaaa 

gaaagtttac 

cactatactt 

gecacaatta 

aaaaacaacg 

cgcatgcaca 

cacgttcaca 

gtgtacaact 

acgatgacca 

cacccgatgt 

acccacagga 

ageagaggee 

atggaaatta 

gaatgectea 

accattgaaa 

ctgegattta 

taccgaaaag 

cacgaggtgg 

cttttgccgc 

caccccatgg 

ateaaegtea 

agaaacaacg 

gaacagcaca 

aacggtcacg 

aacggcccag 

aaegggaege 

cgaactct tt 

actcageggg 

actgtcctcg 

gagaccatgt 

acgctccacc 

gcgtttaacg 



aattcagatg 
gctctaaaag 
gegtaaagge 
gaaacatget 
ggattcgttc 
gtaaaaacgt 
aacccggtcc 
aagggtatag 
tctctgtttg 
tggtggccag 
gggcgttcgc 
taatggaggc 
ttcgtcaaat 
gaaaagaege 
taattcagtt 
ttaaggacct 
tegegtatgg 
gcaataagca 
tcattgacag 
ccgcctctct 
atgttaagct 
acaccgaacg 
accttatgga 
acgtgctgtc 
ggacgaatac 
aaaacgtaat 
eggegtctta 
gtcgegtcat 
agcccagcgt 
caccgatcgc 
agcgaatgta 
attttttccc 
agggggtcga 
tactgetgag 
acccgatgcc 
ggtaeggaca 
attaccaggg 
cagatgagtt 
ttgatttttt 
ctatggtggg 
tgcagtttga 
tccaagatac 
ttcaeggaca 
cttactggaa 
tetgegttea 
tttttggcga 
tggggcgcag 
ectttgetta 
tactcatagg 
gaggaaaaat 
eggatcatga 
tggctgtgct 
tetgeggcat 
tgtttgctga 
tecgegagat 
geaegtegtt 
accccgcgca 
ttaacgggtt 
tttacccggt 
cgctggtcgc 
tccccccggc 



cagttttgea 
tttaaagtaa 
agcaaagcat 
ttgataattt 
gtcgctttga 
atgegtcteg 
cgcgtactcg 
tccttctccc 
catagccctg 
gcagcaaaaa 
tgccaccgta 
ggctctggag 
gaaagagtcg 
cagggaggga 
tgtgaagttt 
caaacgaatg 
ggaeggcagg 
tcacattggt 
agaaacccca 
ccagtttggc 
taggtccgcc 
gggtctaaag 
taactcgttt 
ggttctctcc 
gtatctgtct 
gcgaaaactg 
cgccaattac 
gcgaacgttt 
cgacgatgat 
ggcggctgtt 
caacgagacc 
cattgggttg 
gcacccagcg 
tttcaattat 
gtgtgggcaa 
aagatctgag 
caaaaaegtg 
gctgcaccca 
cgttcatcag 
taacattcca 
cagageggeg 
ggcgtttgac 
ggaagacaaa 
caatgeegga 
cctgggcaac 
actcgtggtt 
gcccgcgtcc 
caatgaegtt 
cgacgagggg 
ggaggaccta 
cggccgccac 
ggaaaagctt 
gggegtcgat 
cgtcgtgaac 
gctcgaggct 
tctcacctgc 
actgttgacc 
cgcggcgtt t 
gccgttcacc 
aaattacgtg 
cctcatggcc 



caegggggeg 
ttctgataac 
tcctcgaatt 
acgccacata 
accaaaaccg 
atatgggaaa 
ttaceggtet 
aataccagac 
tccaaggcct 
attgecagtt 
gcaggacacg 
gtacgaccct 
gcagccagcg 
ggagttcaat 
ctggaaacgt 
acggatggaa 
cggcccacaa 
gecgagatag 
ctcgattaca 
gtggacgcgc 
ccgccgatgt 
aaggctgtta 
ttcctcgaca 
gacatggtgg 
geategggag 
ttaaacatgc 
gttgtcaggg 
gaccagttca 
egggacgegg 
atccagattg 
cagtttccct 
cacatgcccc 
gaacaatccg 
caaaacgcgc 
gctttgggtc 
caccccccga 
gcacacgttc 
acctcgcacg 
cagcctggtg 
caacccctgg 
gccgtggctc 
acgtcgtacc 
tttttgatta 
aaactggcgt 
ggtagtattt 
ttgeagcagg 
gagctgatta 
tttaccaacc 
tacgaaaegg 
gtcggtgaca 
gtccttgacg 
ttttattacg 
tttgacaacg 
cccgacgatg 
teggatatae 
ccgtttgtta 
actcacgacg 
getategcag 
aagctgtaca 
acgcgcctgc 
gagtacgagg 



gacaccgagc 
aegtgecgat 
cccatatttg 
ccattaatgg 
tgtcggacag 
atgtacatcc 
tgecaacgat 
gccggagcgc 
etagegagae 
gagttttaaa 

c ggg ca g aa g 

ttccatatat 
gactttttaa 
ttgagggtct 
cgctagcggt 
aaattcagtt 
aacaaaaaca 
agctgtcgac 
cagaataege 
tggagagggg 
ttattctaaa 
agtcagacat 
aatcagacat 
gggcggtgtg 
agecaattge 
taggtcaggt 
gegaaaatet 
tgaaaegcat 
tggeggaegg 
gagataaact 
tcccgttaaa 
gtcctcagta 
tegaaaegtg 
tcaagtccat 
aggegtttec 
acatgaacct 
eggatgtege 
agaegctteg 
cgcaagccgc 
cgcccaacga 
acgtgctgga 
cactactctg 
attctccttt 
ttattaacag 
etaaggaegt 
cgctctcgaa 
actgtcttca 
tgetcaggea 
aaaatgacag 
tggttaacat 
tcggtccctt 

tggttctgcc 

tggccctggc 
agattttgga 
accccaccgt 
cccaggcctc 
aegggagata 
ataggtctcg 
gcgatcccct 
cggcccagcg 
agtggcacaa 



38700 
38760 
38820 
38880 
38940 
39000 
39060 
39120 
39180 
39240 
39300 
39360 
39420 
39480 
39540 
39600 
39660 
39720 
39780 
39840 
39900 
39960 
40020 
40080 
40140 
40200 
40260 
40320 
40380 
40440 
40500 
40560 
40620 
40680 
40740 
40800 
40860 
40920 
40980 
41040 
41100 
41160 
41220 
41280 
41340 
41400 
41460 
41520 
41580 
41640 
41700 
41760 
41820 
41880 
41940 
42000 
42060 
42120 
42180 
42240 
42300 
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gtctccaatg ctggcctacg ctaacacctg cccgatgacg cccacgtcgt tgagcaccct 42360 
ggcgagcatg cacatgaagc tgtccgcgcc ggggttcatc tgccacgcaa agcacaagat 42420 
tcacccgggc tttgcgatga ccgccgtccg aaccgatgag gtgttggcgg agaacttgct 42480 
atttagtgcc agggcctcga cgtccatgtt tttagggcag ccatcggtta tgcgtcggga 42540 
agtcagggcg gacgcagtca cgtttgaggt gaatcatgag ttggcatcgc tggacatggc 42600 
gctcggttat tcttccacca tcacgcccgc ccacgttgcg gcgattacct cggacatggg 42660 
cgttcactgt caggacatgt ttctcatgtt tcccggggac tcgtaccagg acaggaccct 42720 
caacgactac gttaaacaaa aagccggatg ccaacgattc ggtggtcctg gccagattcg 42780 
tgagcccgtc gcttacgttg cgggggtgcc gcactcggac aacataccgg gtctcagcca 4284 0 
cggacagctg gccacgtgtg agattgtttt gacgcccgtt actgcagacg ttacctattt 42900 
tcaaaccccc aacagtcccc ggggacgggc atcctgcgtg atctcgtgtg acgcgtacaa 42960 
caacgaaagc gcggaacgtt tgctctttga ccactccatc ccggattctg cctacgaata 43020 
ccgcactacg gttaacccat gggcgtcgca gcagggctcc ctcggagacg tgctgtacaa 43080 
ctcaacctcg cgccaggtcg cagtgccagg gatgtacagt ccgtgtcgcc agtttttcca 43140 
caaggacgct attttgcgta acaatcgggg cctgaacaca ctagtcacgg aatacgcggc 43200 
ccgcctcacg ggaacgccgg cgaccagcgc gacggacctg cagtacgtgg tggtcaacgg 43260 
aacggatgtg tttctagaac aaccgtgcca gtttctacaa gaagcgtttc ccacgctcgc 43320 
cgccagtcac aggtctctgc tggacgaata tatgtcgaat aagctcacgc acgcccctgt 43380 
gcacatggga cattatatga ttgaggaagt ggcccctatg aaaagactat taaagatcgg 43440 
aaacaaggtc gcctattagt ttagctcaga cggtctggag ctaacgagag atggccctcg 43500 
ataagagcat cgttgtctcg gtgacgtcta gattattcgc cgacgagata gcaaatcttc 43560 
agtcaaagat aggatgcatt ttgcctctca gagacgccca ccgtctgcag aatatacagg 43620 
cgctgggtct ggggaacctg tgctctaggg attccgcggt ggattttatt caggcatatc 43680 
actatttgga caaatgcact ctcgccgtgt tggaagaggt cggtcccaac agtttacggc 43740 
taacgcgcat tgatcccatg gacaattatc aaataaaaaa cgcgtaccaa ccggccttcc 43800 
attgggataa ctactcagaa ttggtagtta taccaccggt ctttgggcgc aaagatgcga 43860 
ccgtctcact ggagtctaac gggtttgatg tggttttccc tgccgtggtg ccagaaccac 43920 
tggctcaaac agtgcttcag aagctgctgc tgtataacat atactacaga gtggcggaga 43980 
cgacgcccac cgacgtcaac ctagccgagg tgacgctgta cacgaccaat atcacttaca 44040 
tgggtcgcaa ctacgccctg gacgtggacc ccgttgggtc gagctcagct atgcggatgc 44100 
tggacgacct gtccatttac ctgtgcgttt tgtccgcgtt aattccgcgc gggtgcgtaa 44160 
ggctactgac ctcattggtg cgccacaaca aacacgaatt agtcgagatt ttcgaggggg 44220 
tggtgccacc tgaggtacag gccctggatc tcaacaacgt aagcgtggcc gacgacataa 44280 
cgcgcatggg tgccctcata acctatctac gaagtctcag ttctatattt aatctgggcc 44340 
gcagatttca cgtttacgcg ttctcatcgg acacgaatac cgcttcctgt tggtgtgcat 44400 
ataactagaa acgggcctcc ctgtgctttg acatgtcgat ccccaaaatt atgacggtgt 44460 
ccagagacaa cgagggtacg gtgtgtgaag tcgcggtgga caacggacga cacagagcga 44520 
tgatttatta ccctaagacc accaacttag caaacgagcg cgcggacgtt gttaaggaag 44580 
cttttgatac cgaaacccca gtggacattg taaagcaaat tgttaacgag ggcctagcta 44640 
tatccaaaaa aaattgcgtc cgtttggcgt tgtatttata tttttatttg cagtacgtgt 44700 
gctttgctct gctcctcact tggcagttaa acccgtacat ggacccaccg ggtctggtgt 44760 
ttgcggttaa ccccatgggt ccaaaacatg tcacgaaact accgcacccg gctattgttg 44820 
cggtaggttg tggggcagac gccatctgta agaactgtag cgtccccgat atcaaaacgg 44880 
agcttggaat ggtttaccac aacgggtcta gcgattctgg tcagcgcgca cactatgggc 44940 
tggccctgtt aaaggcggcc tggcttgtca tgggaaatgt gtgtccggaa ccagtagtgc 4 5000 
ggcaaggcgc tgcattactt ggtccatgga accggacgga gtggtcggat tttaaatcgg 45060 
caatggcggc aaccacgttt tgcggatcca gaggcgttct gtggtcaccg attcatgaaa 45120 
aaaacctctg tcgccccacc tggaatgatg taattaacac atcagttttt acaaatgaat 45180 
cactctgtcc aaatatacct gtggtgcccg aaagtgtaat agtgcttaat ggtgatgcat 45240 
gagaacaata aacgtattcc cacgcacttc atgtacgttg tttttattgg tctggtaagg 45300 
tattaacagt aatgggaggt tccgctggtg cctataaagc aaaacggtca tagagtaaca 45360 
atattgtgaa tggggaaaat agtctgtaaa tagtttatcc gaacagtatg actgcacaca 45420 
cgaatggggt tttaaccacg acgggctttt caacaagtca gccggaatcg gttcaagttt 45480 
ctccatttta tcgcgtaatt acaaaacctc ccgttatggg cttgtttttt tgcgtggcta 45540 
tgtgcgttat cgcgttggta tggtacgtga tgcggagggt gtgttgtaag gggcgcgttg 45600 
ttgccgattc gtgtcgcgac ccgcgtcaac ccgcgtatga gatgttgaat gttaggttgc 45660 
gtccccacgg aaccaatcca tagagaactt tcacgtacat agccaaagcg cacgacgtgc 45720 
tctctagctc gtttattcta gcgatttaaa aacaaattta tcgttggttg ccatgtagtg 45780 
ggccatgacg gcggcgacaa gcacgtcgtc cgacatggtt ttttgtttcg cgcagtacgt 45840 
gtgacctccg tcttt tagtg gaatgcaacg tattgccttg atttgatcta ttagatacgc 45900 
tacgggatca aacgatagct taatagtgtg agacaccaca gtttggctag cgctaaacgt 4 5960 
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tcccgagttc aatgcgtaaa taaaagattc 
gtaaatgggc gaacgtataa gagtgttctt 
cgagcaaatc tcgtttaaca cggttgctat 
ttccacggcg acgttaacgt gcgtgatctg 
cgcggcggca catgacgcta tctggtacgc 
atgttccacg cctaataaaa tgcatttaat 
gatgccggtg ccggacgcgt ctgtgttgtt 
acacggcttc atcgcgccct gtatatcttg 
ttgtgacagc gaagagtcgc taacgatttt 
ccccattagc tccgtggaga atgcgccttc 
ctcatcaatc gtaatatagg caggaatatg 
gtcttgcaag ttaaaatcat ctttatgatc 
cttttcgtgg gcgtccttca aattaagtaa 
tgaggatata aaaatcagtt ttgcgtcctt 
ggcatctttt ttaatgaagt tcgcctcgtc 
gatgctctaa aggcaaagaa aaacacaaaa 
ctaaggacaa aatctcggaa gctgactttc 
ttagagatct aatttcatct ggagctgacg 
acggacataa attggaacgg attgttcttc 
cttataccac gatcgctgca aagaatgtca 
atttcataac gtctctaact tataccagtg 
cgggggacgc aactgcgtga tcgtgtacac 
aaactgcgtt ttggataatg tgcaggacgt 
acccgacgcc ttcatcgatc cgctcgtgtc 
tatactgagg tactttgaga cggtcggtgt 
gaatggacaa ttgaatggca tcataggtag 
tccggtaatg agcgacggcg aaggtggcag 
aattatctcc atatactcca ctaaaacggt 
aaataaaaaa tacgaccaca ttgtaaaaac 
aaccggcgat ccgtcgcggg tcagtgcgac 
tttatcctcg cgaacaatat tacgtgcggt 
ttcgctgcat ctggacgcac gtctaagact 
accgacccgg gccgcgggga cggtccgtcc 
cctctgttta ccacaaatgg aaaatttcat 
acgcctgact ctggaagctc gtggaccgtg 
tacagaaacg caatgcgagg tttaaaagat 
gacggtgagg ttcatcctct aaccgtctta 
acgcgacccg tgtccgcgtg caaccctcta 
gtacgggacg actacgtgac acaccaccgt 
caagtaatcg ccccgacgcg caccgagatg 
ggattccgcc ctcccaaaca agccaactat 
ttctcgtctc gcgcatatta cgtgtgcgta 
cagcggcaag atccgcggcc gttgccgcta 
gagggcgatt tgtttttagg agccaagcag 
tgcaggtgtc agaactatac cattaagcaa 
gaaacagtcg acttaattaa ggagcatttt 
gtgtcagaga ggagcgcttg ggtaaaggcg 
tggaaggatg ttttgggatt atgggagcat 
ctaccaacat cggaaccttg caatgccgac 
gatataactc ggtcaattag' cgggcagtct 
accgcctggc tggtgctccc ggggggcttt 
gaggatttaa tgtttgtggc ttcgagatat 
gtttttaaac aaggaatgca tatggttgcg 
atacaccgcg accactgcgc attctgccgt 
gatgtcactt aactttttaa atgttatgat 
gtgcgttacc gtgtacatga atggagatat 
cataagggac gtgcctggta gggcggacct 
tgcgccgagg agcgttccga tacccgggcc 
cgggctaaca aaacaggaaa ttatatacac 
agatgccata aaggggaaag agttccacca 
ggcgttttgg caactgttcc tctctgtgga 



aaacgccttg gctttctctg gccccaacat 46020 
gtccacgtgg tgtaaaaaac taagagggac 46080 
ggccactccg gcatcttgac tgctgttgcc 46140 
cgggtgaagg gtaacgatcg ctcgaattaa 46200 
ggcggtgccg gttagatctc ttagaaaaaa 46260 
caccttgtga tttaccgcaa tcaccgcacc 46320 
ggtgtacgcc gggtctatgt aaacgtggag 4 6380 
tgacgtgctc tttacgcggc acagatccaa 46440 
atgcatgctt tgagcggacg tggcagcgtc 4 6500 
caaaaaaaga ttagtcgtgc tccgtaccgt 46560 
caggcggtag cacgggcacg ccacgacagt 46620 
cggacatacg tagttcacca cgttaagcat 46680 
aaaactagtt gatttgtcag atgagttact 46740 
ttgaagcata aaacccagaa tcgcggggag 46800 
gatatatagg aggtggaacg tctgtccccg 46860 
cgagttacgc gggtatggag aacgatacgc 4 6 920 
aacagtgtca ggcgttcttt caccgtccca 46980 
ctttaaacca ctttagccta tctgaatcag 47040 
tgcttgacct ggtggggaca gaatgtctct 47100 
aatgacgcgc gtcaacagcc caatatgtcg 47160 
tttggattgt aagcgctatc acgtatgcga 47220 
tcgcgaaaat ctagtgtgtg atttaacggg 47280 
atgttcgtac ggtcctccag aacgccgcgt 47340 
acacggcacg agggaatgtc ttaaaagcga 4 7400 
gaaatctgag gcatattcta ccgttgtcaa 47460 
attaatagac gctacgttta acgagtgcct 47520 
agacctcgcg gcgagcattt acatccacat 47580 
atatgataat cttctattta aatgtacgag 47640 
tatcagagcg caatggatgc gcatggtctc 47700 
gggttgtttc acgtgatact tccgcgaggg 47760 
gaacggcagc ggttttttgc acacacttgg 47820 
ttatacgtgt ggggacgggt atttcaaaac 47880 
gggccgtggt ccggactggc gattagtctg 47940 
ccgtttgatg tagttatact caaggccgat 48000 
aagttcttgt atatgtcatt aattgcggct 4 8060 
aaagtttcgc aatgtaccga tgccgccgtt 48120 
aaagaagcgt tggtatcacc ggacactgct 48180 
cagatgttga ccggactctt acagtctagg 48240 
gcgctcgaac gcccaggtaa tgtgagggga 48300 
ccaaacggat cgccaagtcg tgtaaggctt 48360 
ccaaagacgt gggcgcaggc gcgtcacgtt 48420 
tatgataacg aagaactaga taccaagtgg 48480 
gattggtccg atccggtcgc gtacctgtta 48540 
aatgcgtttg tcgattctat agaaaagacg 48600 
ttttttccgg ttttgataaa tagggacaac 48660 
atagaggcgt gcttcgtgat tagaaaccag 48720 
gcgctgtttc gcaacgatag taacacgtat 48780 
gggcctcata agctgggtac ggctataaaa 48840 
gtaaactgga gctggctcct gtgtgacgag 4 8900 
actgtttgct tagttgtctc gcctaccctg 48960 
gttattaaag gccgctacga cctatcaagc 49020 
ggccacccag cgtcgtcaca ttcttaaatc 49080 
acacccgggt acgtccgcgt ttgttcgggt 49140 
ttttgacccg ccggtaacta gcgaaaatgc 49200 
cgtaatcatg aaaccaaaag aatttggccc 49260 
tctagatttt tgtgccacgg aatctgtcgc 49320 
gtgtttaatt cgttttggta ccctttctaa 49380 
gttgaaccca catccgcgag aaaccgtgcc 49440 
ttcgcaaaca gtgccaagag gacagatacc 49500 
aataaatccg tttttgtggt ttgacggagg 49560 
ttttatgctg ctctgtcccg cactcgacac 49620 



agttccgtcc ctggccagaa tcgttgggct tcttacacag tgcgataaga gcacgtgtaa 49680 
aatttgtacg ggggcccacg tacacgttaa cccgtatcgc ggatacacgc cacctgactc 49740 
gcaagggacc tcaccctcgt gcccctgcct tatctcgtgc ggggccaggc gcgcggcgga 49800 
tgtcctggtt accggacacg ttaatctttt gggcctgctc tttgacccca aagcctcccc 49860 
caaagtgacc aagctgcgtt taaaaagaaa cccacgcccg gtaccgatag aggacgccat 49920 
gtcgggcgtc acggccgaag ggaccgaggt gcaacccact tcgctaccgt gggccctcat 49980 
tcgcctgccg gatttagcca gtcgcgtgat gctatacggc tgccagaact taaaaagcat 5004 0 
ctgcttacgt tcttattgaa gcacgtcgca cacatcagcg aactcggccg ccgtccgggg 50100 
ctcgcgtaca cgatggttcc gttttccttc tttatgttca ggttcttcgc cggaaaccac 50160 
ctggatagcg tggcgatgac ctcggagaaa acggcattgg cgacgtgttt ttggtgggcc 5022 0 
acgtaaccca cgtgaacgtt ttcaaccgac gagagaagaa cgctgataat ggcgaccacg 50280 
atccatgttt tcccgtgacg acgcgggata acaaaaacgc tggctttctg cttaaacgtc 50340 
tgtaacacgt cgtccgtggt ttcaaataaa ccaaagtgtt gcttaaacgt ggcaaataac 50400 
tggtgtgtct tttcgggtgc tttgatagac gcgataaaat aaaacgtgtg catgattaat 50460 
tgttgttgaa agggctcaag gcaacaaacc ccgggcacgt aactcccatt taaaaatgac 50520 
gacaggttgg ctaaaaactg ccgcagttct gcataaacgg ggcactctag gaaagcttcg 50580 
tgggtccgtt gggccgattg gtattccatg tactggtcct tggcgtttgc catttggacc 50640 
cgacaacatg tttgtttaag gtagtttgtg agttcctcgg ttaaccgggg caacgtagcg 50700 
tggctagaaa ttcggggctt tcccaacgta ggggccggca tgggttcttg aactctgcgg 50760 
tacgtttgaa gataatcgtc taaaacagaa ctatatgcat taaccgcgtg aataactccc 50820 
aagcatgggt gggccattcg ttcggtcttg tcgttgcccg atataataac gggcggttgc 50880 
cgaaaccagt tttcgcaacc accgtccgtg accacgcgca agttattttg aagacgttcg 50940 
cgatagctgg ttaacaacat gttcccttca agttttttga ataacgggca ccccgaaacg 51000 
gaacgccgct tcgttaaagg cgttcagtta gctttagacc tgtgtgacaa cactccggga 51060 
cagtttaaac tagttgaaac acctcttaat agttttcttc tggtatccaa cgttctgccg 51120 
gaatcgcgcc cggttagaga ctgtccgcag ccggaagggt ttgactttga acacattcac 51180 
ctcccaaaac taacacgcat gcagcgtgtc ctggggcgat actgcgacca tgttaacaac 5124 0 
gacgacacgt gcgttaacgt aaaggcaagt tcctcgaatt cacagggtgc cttgttttat 51300 
ctgccgtatg gacaggacga atggaattgg gcgctcacgt taaggaaaga caagttggtt 51360 
aaaatggctg tagagggctt gtcaaatccc acgacctgga aaggtttaga gcccgtggat 51420 
cctttaccgc tcatatggct tctgttttac ggttcccggt cgttctgtcg ggaaccagag 51480 
tgcctatatg aacgcaattt tggtatgaag ggacccatac tcttaccgcc acatatgtat 51540 
gccccccaaa aggacgtaat gacttttgtc catcatgtaa ttaagtacgt taaattttta 51600 
tacgtgaacg ccggtggggg tcttgaaact gaaccgtccc cgccgttcga ggcctcgcgg 51660 
ttgcgcgcag ccatcgctcg tctcggggac gtggaagcgg atgacgcata cctgtccgca 51720 
aagtgcatgt tgtgtcacct gtacaagcaa aacgatacga tttcgattca tgaaacacac 51780 
gtgggcggag tcatcgcatt aggcggagac ggtgcgagat atataacgtc tagtgttcgg 5184 0 
gctcaacggt gcacgagtcg gggagatttt gttttaatcc ccctgtacaa cattgaaggg 51900 
ctcgtaagca tgataaggga acatggcctc ggcagcagct aaaaaaatgt taattaagtc 51960 
ggagctcgag tcggaaatca acaaaaaact gtccatctcc gtatttgaca ggtttggggc 52020 
cgacagtgcc gtgtttaacg cgcagtataa gggaaccagg gaatcgctgc ggtcgtacaa 52080 
cagcctaaaa aagaaggacg atctggcgac cgttgtcgga acgctagaaa cgtcgctgcg 52140 
tgaaaaacaa agcgaattgg gattactaaa ggggtttaac aggaaaaaaa ttgaagagtt 52200 
tgacgctgtg gcggacgcgg ttcgcgacct caaggacgag ctgtacggag aactggagat 52260 
tctaggtacg cttgacaatg aatctgttcc cgtggaagaa gagtccccca aggacgacat 52320 
tattaggtgg aaattggagc gtctgcccag agtgtgcccc aaaagccctt gatcccattc 523 80 
ccaaggttca gactgacgtc gacagaacag catcgtccca tataaccgtc attaaaacac 52440 
gtaagacgat cgcccaactg aagataccta acaactgggg ccagtgtagt caccaggcga 52500 
cggactggac cgccgtgctc ggacgcggct cgtatggtgt ggtgaggtcc atgtctctcg 52560 
gccgctgcgt taagcatttt ggcagccggc gtgagttttt ttacgagtgc atttttaacg 52620 
atatagtacg cgcctgccgg gagaaacatc ccctgaaccg cgggggtgac cgtatactat 52680 
gtttcctaga gccgtgcgta ccatgtcgcg ccctgatatt cccgcagtta acagggaatc 5274 0 
tgctaaacgc ggatcttaaa cacgtgaacc ctgaacggct ggccgttgaa ttctctgagc 52800 
tgagggaagg cgttagtttt ctaaataata tatgtggcat cgttcactgt gacatcagtc 52860 
cagaaaatat actgataaag ggggaactga caactgcgta cgggagactt atgatcggag 52 920 
atctagggtc cgcctcttta cacacgggaa ccccttggac cggagtgatg gtgacctcca 52980 
aactcgggtt cgtgcagcac acgtaccatt ttaaggcacc ggccagattt atctgtaagc 53040 
acatttaccg gccgtcgtgt ctcctctacc ggtgtctgct gtcgtgcgcc gggggcccgc 53100 
aggcgcatat gctaaatcag ccgttccaaa tcactccaca actcggtctc acaattgaca 53160 
tatcgtccct gggttatagt ttgctagcat gcctagagaa atatcttcag ccagctgacc 53220 
catttcccca gcagggagcg ttggcggacg cttcctccga atccgcccac ccattgttct 53280 
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atttgcgttg catggtgcca agagtagtca 
ttccactcga tttaggcatt gactcatctg 
cgtacaggcg gttttttgcc aaccagtgta 
cgttagaaaa cgcatcctcg cggctgtgta 
tgttggtcag ggactacttt agtcattgcg 
gatgagccga tggttcagga gatggcgctt 
tcgaaaatga gcctggccaa ctttttaaaa 
aacaaaaagg aaataagcat gccggctata 
aaggtgggag attttatcgg caacaccgac 
agggggaggc gcctgacgcg cctgtcggaa 
cacgatcgac accacgtgtg tgcattgata 
cctctgtggg acgctctgag ggacggcata 
aaacaacaga attcgtccaa aaaaattttt 
gtagcgggcc cgctcgcgtt tggactgcgt 
acgttgctgc atccaggcga ggcgcactgt 
aacggggttt ttggcgtctc cttggatttt 
ggtttggagt ttcacccaga ctgcaaaatc 
ttttccaaga tggagtgtga cccgatttac 
agcatgcaga cgcttaaggg gtttttgtac 
ggagaggaca ggctccccag tgaatcggac 
gtgtgtccgc ggaaaaagag acgcttaact 
attcacaact ctacggcgcc ttctgatgtg 
ggccaaatta atattaaagc tcatctgagc 
tattattatc aagtgttgct ccagtctctc 
ggaactaaaa atttgggaac ccagaaaaac 
cagtttcaag acccgagctg ctgcacgatc 
gagataccga cccttttaat cgtgacaccg 
ctggtgaagc aggcgaccga attctgggcg 
ccatgggatt tatcctctct gtgtgcaaac 
agcccataga tgtatccaaa gaattcgatc 
taacggcaga tgggactgcc cccgcggcgc 
ataaaaacaa taaattgtca gattttgttt 
gatggtgtgg ggttgggaag ggaatgggat 
gtattttgga tatgcatcct cgtaatcgct 
cagttccggg gagtacttaa cttttgtcgt 
gtattcgggc gactttgcgc accatttcca 
taggatgcaa gtgatcggaa tcatagagat 
agatatgaat accttgtcgt atcgaacgac 
aacgccaacg tagaatccga tctggtgctt 
gatataccaa gctattgtaa ggacaaaaaa 
aacgagcata tagaaactgt ttccaatagc 
ctccatggct attaaggaca gataaagatt 
ggtgtctagc aagctattct caggaatctg 
cgaattgcac gatatcatag cctggacgtg 
tacaaacgag gctaaaagta tatgtttgta 
ccacagcctc aaaatagaca taaaaactac 
ctggaggctg gtcattgttg tcgcgtgcac 
ccacgcatag taacacagca gatatccgtc 
catttctggt gcttccaaaa acaaagtcgg 
taggtttaac gcgctgtagt tgaccaacgt 
gaatgtggcg gtcagtggca ctaccgctga 
tagttttacc catgacgata agataaacga 
aatctgctcg cgggtgcgtt cggcaaacta 
acggatttaa ataggctccg cttttaaata 
gcaccatgaa cgcccgggag gtggcactca 
gcacgcacga gcgcgagaaa ttaataatct 
gcggaattca gggagatgcc gcatatctat 
ggggaaatat accggcgata aaccgtcacg 
atatgcagtg ggaactccgt actcgtttac 
agcccggtgc ggttaacgta agggtcagcg 
gagcgttgga tcactcgtgt tcggtgcgcg 



tcgccgagat tttttctgtt gcctgggacg 53340 

gccacgcgcc agctattccc ctgagagaag 53400 

gtttatatag ggcgcaatac aaagaggatg 53460 

actcaaaact taaactagtt ctccagaagc 53520 

gaaactgcgg agatcatgga ttttttctca 53580 

ctcgacatcg atgagcagca gcggctcctc 53640 

catgagcgag taagggcgtt ttttagcgat 53700 

cggttcgtgt ataattttta tctattcgcc 53760 

gtgtacgatt tttacgtcac ttgcgtgttc 53820 

gtgtacgacg cgtgcctaaa catgcacccg 53880 

gaacaggtca cgcgcggcca aaacatcaat 53940 

atttcgtcgt caaaatttca ctgggccata 54000 

aacccgtggc ctatagtcaa caatcacttt 54060 

tgcgaggaag tggttaaaaa gatactggcg 5412 0 

gaaaactacg gattcatgca gagtcctctc 54180 

ggaattaacg tcaggtctga cccaaaagac 54240 

tatgaaataa aatgccggtt taagtacact 54300 

gctgcgtatg ctaaacttta tcagaagccc 54360 

tccatatcta aaccggcgat cgagtttgtc 54420 

tatcttgtgg catatgacaa agaatgggag 54480 

gcagtacacc atctagttaa aaagtgcatg 54540 

tatatattgt cagatccgca ggaaaccgga 54600 

gccaacctat ttataaacgt caggcatccg 54660 

gttgtacagg agtacatcag tctctccaag 54720 

tttatagcga ctggcttttt tagaaaacgg 54780 

ggtgaatttg ccccgttgga tccacacgta 54840 

gtgtattttc ccagcgtggc caaacaccaa 54900 

gctagtgctc gtgaggcatt tcccgagcta 54 960 

gccccaccaa caccgtagat gtgaaggggg 55020 

ctattatagg agaagaaagc attgtcttgt 55080 

tgtacaaacc caaaaccaag ccatccaaac 55140 

aagcattcgt atctttattg gagtggggtg 55200 

tgaggggagg atgaaatgct agaatcatat 55260 

cacgtcttct tctgtttcat ataacgcgtt 55320 

cttctcttta gggccttgag gaaggggtgc 55380 

atcgtttctg a'taattctta agatgatggc 55440 

gttcacggct atgactttgt gcaggctggc 55500 

gggaagaagc agaatcaggt aactgacaaa 55560 

taggtacttt actagaaata gttctgtgtt 55620 

caagtttatt gaaccaaaga caatatcgga 55680 

catcatcgtt ccgagggaga atactaacat 55740 

ggcgccaatc ggttttccgt aacgcaacag 55800 

ctgctctaag acgcgtagag accaggtggc 55860 

cgtaaacgat aggcaaaaat gtatacagta 55920 

cgacagaacg tgaataaata actggatggt 55980 

actgtgggag cctatgagca cgatccatgt 56040 

gcgcttggcc ttgaatatgg ccaggatggc 56100 

caccaaaaac gcccacgaga tgtagacaaa 56160 

cgtcagatgt ttagcagaac ttctgaccgt 56220 

gttaaagtag cacggaaatc caagtcccgg 56280 

cattataaac ataagtccaa gaataaccag 56340 

gtcgctccgt gaaattttca tagttcaaag 56400 

gcagtaaaat cagcaagtgg cttacatgtt 56460 

tcacattctg tgcaaaggag ttgagccacg 56520 

cgggacatgt tttgcacata tcgctgcata 56580 

ggcaggttca tttacttgta tgtcaacaat 56640 

ttgtcaccga aacattaagc aatactgact 56700 

caccgtccat aaatgagcac ggccgtaatt 56760 

ggaatcccat cattcaattg ttaagtcgcc 56820 

agccgaatat ggtaatagtg ggctgcgaac 56880 

tgactggagc ctatcttcat tgcgatacca 56940 
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ctatggactt tagtttggat tctgttgtgt 
tgttttctca ctgtttagtt tccaacattg 
actatagggc atcgagtgcc acgcaatgcc 
ttcttgatat ctttaattgc gaatctcgtg 
tgagtatcgc caccgcaaca tttcacgttc 
gggccctggc cgaatttttt agggcgctgc 
tagcgccaac gttaacgtac gtgtgtccgg 
cctacttttc cgcgtttcca cacctcccgc 
taaccgcggt ccaagaactc ccaaaagcac 
attcaatctt aggggatcag cttctcatac 
tacccgttcg ttggccgctt tgggtggagg 
aaagcgttgc ccgcatagtc gacccacatt 
tgtgccagca cctaaaatgc caccgtgcgt 
ccatctgttc cagcgacctt cgcctcttca 
ccatttttac gtgggccgcg gcgagcgggt 
tattcatcaa ggctccccag ctcagcgcag 
gttgccgtag gaaacaatgt tacgaaggtt 
acagcaaaag gtcgtgaccc gtcttgaggc 
ccacccgcct gactggctca agtgtgaggt 
agccggggtt ttgtatgccg gactcgcaag 
gtggctggac tgtctagtag aaggcgcgac 
tggggcgctg gcgggtatct tacccaccct 
gctgctgcca cgcgcgtggg taaaatcggc 
tgttacgcct ccacagtttg tcgtgacaaa 
gccgttaccg gttgacgttg attttatgta 
tgttgtgtcc aacggggagg gtacggagag 
gtttgaaaag gtactatgtt tattttaaaa 
ggttatttga aaatatttta ataccgatgc 
gcaataaaat atgttaaaag tctccgcgtg 
acgaaaggat tccaaataaa tgggtttcag 
caacagctta tggagtttga accgcactag 
gtttaactcg tggtatgccg ccaaggtcga 
cgcggaatca ttacacgtat acctctcgtg 
gataaatgat agggggggct tatcgtagtt 
tgttacgctc gcgctcgccg gcttaaacag 
tatgtccagt gcgtcgatca tgtcctcgtg 
aaacgttgcc gactgaagcc tggtgcgcac 
tcctcgaaca cacagcttag gcgtattgga 
ttccaacatc agtcgcggaa aaactggata 
ttcaggcatt accgcctcag atcggtgctg 
catcggttcc aggctggcta atttgttcgc 
cagacagcct gctgtccaca tacagctggt 
gagagtgatt tatgaggtcc atgtgagtca 
agagaatgga aaccgtgtcc tgcgagtagg 
ccacgggtgt gaggttgtac gtttgaaaca 
tgagcatctc cataaacgga gggacgtatt 
ctttagggat gtgacattcg gataggttaa 
gcaacgtatt ctgagtgagt atattaatat 
tctgaagcaa ctgcgtttca atgcattgaa 
caaggttaga aattgtatcg aactgctcgt 
tatttgaaac ctgtttccgg agaccctggg 
tgtttttgtc cacaatgttt tgagagacgt 
tgccgctcac tgttttgttc tccgcgagct 
ggtcggtttt ttgagacgat tgctctaagc 
acgtgctacc gaccagcgct tttagaggcg 
gctccgcggt cctgtgacat gtcataatcg 
aggcgacgcg tccattgtaa attataggcc 
gtctgtaggt gccgagataa gcagccttat 
agtctggaat ataaatagac gtcacatcgg 
tgtcgacaat ccgcgatatc gtaacgtctc 
agtgtgcttg tatcctggcg atcgtgtcgc 



ccccaacccg cgaattttgg ttctcagaga 57000 

aagtttacct taaaacaacg ggcgggttat 57060 

gaaaaagggc gaaagatggc gcattgggta 57120 

aaatacaagt tgccgggcag aagtacacct 57180 

tctgggtgga cgaggcgtgt atgtggaacg 57240 

acaataagtt gttcggcgac cgggaaggcg 57300 

gggccactcc ggagggaacc cccttccccc 57360 

tcgtgtttgg aagaccgcga aggctcgacg 57420 

aaattgcggt acactggccc ccgtttaaag 57480 

ctggcatttc acctaaaaag ccaggtaccg 57540 

atgttaactt gagtctctgc gagacgacag 57600 

ctatagtaat cataaaattt tcatcactgt 57660 

ttgttaaaaa tgagttagaa tacatagcaa 57720 

tccaagagga atacaaccgg ttacttgcca 57780 

atacctgggc ggccattgat aaaacaacag 57840 

ctgtaagtgg tttctgccca tcactaaata 57900 

aaaaataaca gttcatttcc tttcacagga 57960 

gcatttggga cttcccgtac aggaaacttc 58020 

ctgctccgcg tccgtgtttt taaaaatacc 58080 

agaccccacc agggaagcaa aacgggactc 5814 0 

gttgttgctt aacaactcag tgttaccgat 58200 

ttttgccaac aggcggtgtg ttaatttttg 58260 

gcccatatgc cctcccctac cgattgactg 58320 

gcgtggacca atctgctggt acaaggaatg 58380 

ctacctacag gaggcactat gtgtttttag 58440 

tcacgcggac aatatacgac aattagagaa 58500 

cagagtctgt ctggggatgc tgtgtaggct 58560 

gttatgaagg tgtctggtaa aaattgtgcc 58620 

gccctcttct tctccaacgt tctctgttag 58680 

taagcctttc atgtcgtctg cgttttcgtc 58740 

gggcggaacg cgcaaaatgc cttggggtgg 5 8800 

cgctatcact tcgatcccat acaatacaaa 58860 

taggtacctg gcccggtcaa gcgcggacag 58920 

ttccagcata gaaagaatgc agcagagttc 58980 

ctgcacgcgg tcagaaaaaa cgtgcggata 59040 

ttcgaccacg cacgcaaaga caaacccagt 59100 

gtgcacgcct aaatcaggaa cgtttagtcg 59160 

agccatgcaa atgctatttt ttttatgtac 59220 

attatagggc cctctggacg agcattctgt 59280 

ctcccccatt agacgcttca gtatttggtc 59340 

aatatttgtg gcctatgtca tttatataca 59400 

ctgcaatatc gtgtaaactt aaggaaatga 59460 

gtttggcgat tatgtacgta aacgggccta 59520 

cgatgctcgt tcgctgcccc tgattgtcga 59580 

tttccccttc ccacaggcta gttagctctt 59640 

gtgaaaaaaa gctgtttact acggagctgc 59700 

tggttgggtc gcgcagaccg gataaagcct 59760 

cgtgtctggg cctgcctccc tctggtaaac 59820 

tcttctttac gtagtcgtct cgttctttct 59880 

tgatctggtt cgtcaggcat ttgaacacgg 59940 

tcgtactttc cgtaccggtc ccaaagagct 60000 

ctgtgacgaa atcctcaaca acgtctgtta 60060 

ttatcagcaa gtttaggagc tccctcttgg 60120 

gagccaatat gtctttatac atgtcgttgc 60180 

cggtatttag taactgacat aactttgcgt 60240 

cggtaaataa ccggtacaga gggctatcaa 60300 

gtgaaacaac ttcaactgtt atatcttttt 60360 

ctccgttaaa ctccacagac acctcggcgt 60420 

tcattatcga ggtaatctcc tgtgccagtt 60480 

cggaagcgta cgttttcctc atctgttgac 60540 

cgtcgggggt gcgcctgaca ataggcacca 60600 
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acccaagggc gaccacccag tcgacgtatc gttcgtacga taaaacagtc ctggtgtgta 60660 
gtacctgatt aatagtgttc agtaacagtt gatcgactgt tagacgaacg gtacgtgccc 60720 
acgattcaaa cgtagaccgt ttgaattccg gcgtcaggtg ctcgacaccg cgccaacggg 60780 
tcttcagggt ggtctcgaag ttggtccagt cctttaagac gttgtcgtat gtgcagctcc 60840 
ccaatgccgt cttgtacagg tgtatgaaca gctgtcttcc aaagatccct ggatttcgaa 60900 
gactggagtg tagcgtttga ccgcgaacgt acgcgtactt gccctgtagg atctcgaaaa 60960 
gcgagatgga gagttccgtc ggatggacca aacacgtcga accaaatccg gggttcatct 61020 
tgaacatgac gtccgatgcc aaagtcaggg gtgtcgtgga tcacgtcagt cgcctgtcaa 61080 
atataactac cagcccaccg gaaatgggtt ggtacgacct ggccttcgat ccggctgaag 61140 
actccgggcc gttcttgccg tttaccgttt atctaattac gggaactgct ggtgctggga 61200 
aaagtaccag catatcggcc ctgtaccaaa atttaaactg cctgatcacg ggcgcgacca 61260 
ccatagccgc acagaaccta tcgcgtcgcc taaagacgtt ctgtcccacg atcttcagcg 61320 
cttttggctt taagagccga cacatcaata tagccgtcag aaaagctcat cagaccggag 61380 
ccgtatccat agagcaaatt cagcaacagg agctatcgaa gtattggccg gttatagtgg 61440 
acattatgaa agaggttatg gcgaaaaaac ccaatggcat gtacgggact atatccaacg 61500 
cgaattttga aaccctctcg agaatgaccg gaccgtgttt atggacttcc aatattattg 61560 
taatcgacga ggccggaacc ctgtcctctt acatactcac caccgtcgtg ttcttttact 61620 
ggttcctaaa cagctggcta aatacccctc tttaccgcca gggggcggtt ccgtgcatag 61680 
tatgcgtcgg ttcgccaacg cagacaaacg cgttccagtc aacttacaac cacgggacgc 61740 
aaaagacgga gatatcgtcg tgcgagaaca tcctaacatt catgataggg aagaaggtcg 61800 
tatctgagta cgtacacttg gagagaaact gggcgctgtt tataaacaac aagcgctgca 61860 
ccgatctgca gttcggacac cttctaaaaa ttttagagta taatcttccc attcctgacg 61920 
aagtcatgag ttacgtagac agatttgtcg ttcctaaaag taagattatg gatcctttag 61980 
aatacattgg ctggacccga ctctttttgt cacatagcga ggtaaaggcg tatctaacaa 62040 
acctacacac atgtctaacg ctagggggcg ataccaggga cacgaagctc tttacctgtc 62100 
ccgtggtgtg cgaggtgttt gtgaagccgt ttgaggaata caagcgggcc gtcaacctca 62160 
ccaacctcac cgtgaccgag tgggtgacaa aaaacctctt taagttaagt aattattcgc 6222 0 
agtttgtgga ccaggacatg tccatagtcg ccacggaatc cactgaacgt tcgacccagg 62280 
ttacctttat caccaagttt gttaaaaaca gccacgtgtc tttaaatgga aaaacaaaaa 62340 
aatgcatatg tgggtttcag ggtacgtact ttgagttcaa aagaatccta gacagcgaac 62400 
tcttcgtgga gacgcattcg caagaccgtc cggaatacgt atacggtttc ttaaacacgc 62460 
tactgtacaa cgccatgtac tcgtttcacg cgtacggcgt gactaggtca cacgagaaat 62520 
atctgcaaga cctaaaattt gcacccctcc cggccgctct ggcaaccggg cgtgtagacc 62580 
ttcaaacggt tcgtgaagag ttaaacctgg aagacgacat cttctaccac gtgtgcagtc 6264 0 
ccccgccacc cgcgggtatc acctccctcc aggttttggt cgacacgtac tgcgccctaa 62700 
aggacgtgtt cgcctccaga ataaaggtgg cgtgtcgctg gtttggcggg gagtttgaga 62760 
aggaaacgtt ttccgcgttt acggttaaca tggtcgtaag ggacggagtt gactttgtct 62820 
ccccttcaga acgtctcaac gggctgttgg cgtttgcatc gaccgttgaa tcgtataaaa 62880 
ttaaggggta cacgttttta ccggtagcgt tcggtcgctg tcagggtttg ccactcagtg 62940 
atgacctcag gaagaagatg ccctccctgg tcgtgcagga ctctagcggt tttatcgcgt 63000 
gcctagagaa taacataacc aaattgaccg aaaccatgga ggacgggagc gttttccaag 63 060 
tgtgctgtgc gggggactat ggggtcagct caaatttagc catgaccatc gtaaaggcac 63120 
agggaatgtc gttggagcga gtagccgtag tatttgggtc ccacaagaac gtccagacaa 63180 
gccacgtgta tgtagcaata tcaagggctg ttaactcaaa ttatttggtc atggacagca 63240 
acccccttaa aaccctcctc agagaaccag tcgataacac ctccgccaag catatagtcc 63300 
gcgccctcca caacccaaac acaaccctca tctactaaat aaataaaaac aagcaaatgg 63360 
aaaacattgt gt ttttattc agtccaacca cggccacgga tagttgtcat ttccacacac 63420 
cgggggggtg gccgccatag tttgacgacc agaaacgccc gggccctctg ggttagaagt 63480 
agccccgccc cccgtggtcg aggtcatacg tctcttaggg ccctgggttg taccaacata 63540 
aatcacgtcg tcatcaccgt tgcagtccct gcgtgacgcg cgcctcttgc aggcggcacg 63600 
aacaccgccg cttgtaacgt gatctgcatc gtcgtcgctg tctgaagatg aacacaggtc 63660 
aatcacatca gtggcgcccg tgccctcatc ggcgtctgat tcgtcccatg tctcaacaga 63720 
gcctcggtcg tccacttcgt cgtcatctga aataataatg gtctcttgtc ctgcagttga 63780 
tccagcaaca acaggttctt cttcatcgga agaccatccg ccctctgccg gtgatctaga 63840 
tatatcacct gaccaggcgg gcgaggcccg tggcgaaaag ccgtcagctt ctccgccact 63900 
ttcatcagtg tcctcgtagt catctataaa aaaatcgtct tcgctgtctg attcagatga 63960 
ctggggtgcg cggtgttggc cagtaaccac cgcaacgggc cgcggaccac caaacgcggt 64020 
cagtggcgcg ggcgcgctcg gtggtgtggt ggcacctgtc gaagtaatga cggtaggtag 64 080 
ccggccgctc tggttaaagg actcaccgtc ccgtgcgtgc ggatctattc taggtgggga 64140 
ggagaccggc ggttcgtctt cttcatccac atcactgtag tcgctgttct ccgaatgaga 64200 
gtcggtttcg tctggcgttt ttggagtagt gggaggggaa tcttggtcaa aaacatcctc 64260 
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aggtgcagtc 

acgggacccg 

cagcatcctg 

cgaaacgttc 

gcacccctcc 

aactgcggcc 

gccttgagga 

ccccacagca 

gtgaaccaat 

atggtgttta 

gcggtgaacc 

atactgtagt 

tcctggccaa 

attctatcta 

agagccgcga 

ctaagtagca 

actattgttt 

gtgaaacttt 

cgctttcact 

attgcatata 

aaatccttat 

gatcagcttt 

aatgaaaaga 

tacaacatgg 

taaataaact 

gtacttatat 

ataatgcaaa 

ttaacccgaa 

atatgcaaca 

agctaagctc 

tgttaatgtc 

gggcttgtaa 

ttagcccatc 

gctccacaca 

taacatcagg 

cactggttga 

accacacgag 

ggctcattgc 

ataaccacga 

tcacaaaccc 

aaaccctagt 

actgctgctt 

gcctggcggc 

gctctttgca 

gcatatatat 

tttctatcgt 

acccaggatg 

ttccatgtcg 

cacccttaac 

taaccccgaa 

tactattaga 

aaggtaaaca 

aggtaattaa 

gagcatatct 

tctgtgtact 

aaaaactgcc 

ttgtctttaa 

tttaaaaaca 

aaaaattttt 

gtgtctatca 

aggcaccagc 



atgtcaatca 

gcgaacaaaa 

ggagtggccg 

acctgtaaaa 

gattctgaac 

agggagaacc 

cgagatgtct 

taaaaacgca 

cccagccaag 

aaagcagaac 

caggtacagt 

ccggagcaac 

ggatcacgac 

tcggtggata 

gcttctgttt 

acgtttgtgt 

taagccaacc 

tgtacttctg 

accgtttatg 

attcaacaat 

aaaaaacgcc 

atatcgcaat 

agcaatctca 

atttactaca 

taaggtatac 

aatatgtata 

catcaaataa 

aatacagaaa 

atatttcagg 

ctcaaagaca 

ggaaatatat 

aatatcatcc 

caaatggtga 

aacggaagga 

aaccggaccc 

attgacggtg 

ggcggaataa 

catttccaga 

caagttcaaa 

gaactctcta 

tctgagcacc 

gtgaggattc 

gctcgcgcac 

gtcgcgccat 

ctgttgggta 

tttctctgaa 

gtcgacacct 

ttcaaaggta 

tggaatggaa 

acagagctaa 

ccagggtgag 

gctcgtgtat 

acttttttta 

cgccaatggg 

ttactagtgc 

tggcctcgcc 

aaatgcaact 

cgttaaactg 

taaccaaact 

tctccttcac 

tcagggggga 



ctggggggcc 
aacgtct tgt 
gcggggtcct 
cgcactgcct 
tagatagtca 
ggttgccgcg 
ctgtccgttt 
cctctgaagc 
gt tggcgtgt 
tccccgtttt 
attggctatc 
gctaaaggct 
tttaatatcc 
aatcacggtt 
taaaaatgga 
ggagggttct 
ctccataaat 
tctgagaaat 
tgaaagtctt 
tgaagctctt 
ctaaggtgaa 
gttgctgtac 
aacggctccg 
agcccggtac 
atacttctca 
gaattatgcc 
tataacacaa 
tgactacaca 
gtaacaccat 
atttcctctt 
gagtccagga 
agtgaatcag 

gtgggtgaca 

tgcatttgat 
gtcaatgacg 
aaaaaaaatg 
aacgggtccc 
tttgttaaat 
caggaggcaa 
ggaacgcaca 
atctggttat 
agcttaggtg 
ctttgacagc 
atctcggcca 
acaaacctca 
tctatagtta 
ttaaaaaatc 
gtcacgatgg 
acagccatga 
ataccaatga 
taaccacgct 
atttcagacg 
ttcaagattc 
aatggctgat 
cgatgaacct 
tt ttagaagc 
ctggtacatc 
cgtgtcctgg 
cgccaggtac 
ctcccctttg 
ggtcggatcg 



tccgtacccg 

tctctttttc 

tgacgggtcg 

atccccgatc 

ttcgcaagtt 

tgggcggtag 

atcagcgtcg 

ttggcagaaa 

gaccccgcct 

gcccaacagt 

tctttaaaaa 

agtccggtgg 

tccggttcgc 

acgttgctca 

gaaagattta 

tctaacactt 

ccccgcgtct 

ccaacagagg 

caaactcggc 

ctctggggac 

atccatttac 

acagccctgg 

gaaatataaa 

aacatgtaat 

tgttttatac 

aatacagtca 

aatgcacgct 

aacacacctg 

gttaataaaa 

caattgatga 

taccctcgtc 

ctgctaatgt 

ctttgggaca 

cgttcacgag 

cacatatgtc 

tattcgtgtc 

cggcgctaag 

ctgttcgaac 

cgcggctggt 

tcaaagcgtt 

aaactgcggt 

gcctgcaggc 

cgcacgccag 

taggagattc 

ctccctgtaa 

catctttaag 

gcaaaagcga 

atctccagtt 

caccgtgaac 

ctgtcacccc 

atcctttaaa 

cctgtcataa 

aggagggcgt 

gaatccacac 

aaatttggcg 

tctatgcaag 

tttatgacag 

tagtcctcgt 

tgaaacgatg 

aaggttttgg 

tgagggggta 



ggcggtaccg 

ctaggtgccc 
gataagaaca 
agttaatatt 
taaaatggtt 
ccaggggaga 
ccttgtctat 
ggcagcttat 
tccctctttc 
ctaagcaccc 
tatttttgag 
cttgaccgcg 
aacaatacga 
tatccataac 
aaaattcaag 
ctggtgacat 
ggtaattact 
tggccgcgta 
taaaaactca 
actgtaattt 
acatatttct 
acataaagct 
gtcactaagt 
taaaaaaaat 
ataatatgaa 
tggtaagtat 
tccggatatc 
aaaccaattt 
tacgcagcat 
ctcatcaacc 
atcgccacaa 
caaggatgtt 
agaccccgta 
gggcagcata 
cattaagatt 
acatgcgtaa 
gccgtgaatg 
atacgagtct 
ttccacccgg 
cagacaaaat 
taagagacga 
ttgctgacag 
ttccagagcg 
gctcgcgtga 
aaggggaata 
tataaccaaa 
ccggatggct 
agactctgtc 
ttc'ctgatgt 
taccccaagc 
aacccatacg 
aatggatacc 
gtttacagtg 
ttagtgctcg 
tgtttaggtc 
ctttaatcgt 
tggtccaaaa 
atataacctc 
gtacggacag 
t tacaccgac 
acaattcgtg 



gcaaccccgt 

cgggaatcgg 

tagccatggc 

ccagtcaacg 

gcagcctaga 

cggatggcgc 

agcctttctt 

tatgtagctg 

caccgtcaga 

gtgagaagga 

gcttggaggt 

atggtaagga 

ccaccacata 

gcgcttcagc 

ccatgagtcg 

tttactccaa 

ttacgtatac 

taagcagttc 

tcaagactat 

aacttgtaca 

gtggtagagc 

aaatttgtta 

ggtattattt 

ccacatgtaa 

cgctaatgct 

attttaaggc 

cgcccagctg 

tattctcaac 

gcacattttt 

tcggtttccg 

attaattcca 

gtttctaatt 

aaattggcac 

tttattttta 

ggagtcgtga 

ctatttttta 

cggcaggcac 

aaaatcaagt 

acatccccgc 

tccgaaaccg 

gcggccacac 

gcccgtagct 

aggcagaagc 

tcttttaggt 

a ggtccgttc 

gacgctaaaa 

ctatccgcgt 

tccctgttaa 

ctcctaaaac 

cacgcccccg 

tggagtttgt 

gacgacaatc 

gtgtaggtgg 

gatcaggtgc 

gtaccaacca 

gtccaatagc 

aaaacaaaga 

ctcaacaaga 

gtcgtgaaaa 

catgtgagat 

ggcgtggggg 



64320 

64380 

64440 

64500 

64560 

64620 

64680 

64740 

64800 

64860 

64920 

64980 

65040 

65100 

65160 

65220 

65280 

65340 

65400 

65460 

65520 

65580 

65640 

65700 

65760 

65820 

65880 

65940 

66000 

66060 

66120 

66180 

66240 

66300 

66360 

66420 

66480 

66540 

66600 

66660 

66720 

66780 

66840 

66900 

66960 

67020 

67080 

67140 

67200 

67260 

67320 

67380 

67440 

67500 

67560 

67620 

67680 

67740 

67800 

67860 

67920 
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tactccagtt ttaggatctc cggcaggacg cgtacgagtt cttcgtcaaa tcctacgcag 67980 
cccgcatcca ccaggggcag tactgagtta ataagtcccc gaattctatc gttcgtaatt 68040 
ttgtggagct cctttagaca gtaaaccatg ttgctccgac actgcggtac gatgaaccgc 68100 
tctccttgtc tgtgttgcat gagcacctgg agtatcttag acaggtatag ggcccgttcc 68160 
actctaaggg ttgccgcttg gtagagcgga ttgcgggcat tgacggtcag tgtgttcatt 68220 
tcaccgacta tcatgccgag tggcggatag tggaacttgt acatatgatt taaaagatgg 68280 
tctttcccgt aatggcgact catggtcgtg ttttaacgtt cacagggtgt taaaaaactc 68340 
agaaggtcct gcgtggacaa cttttgcgga ctctccgaac cgctcaggga taagctgtac 68400 
cacgcggtgc ttcttataaa cgaagcatac gtcaaaaaac atgacccacg tgacctaaat 68460 
aagctcacag aagaagtctg tcgatgtatt gtaatggaat gtgcctccct gggtcccatc 68520 
tccgggctca tcgcggatct gaatctcttt aacctgttct gcctctaccg gggttcccgg 68580 
gttaaaaccc gcggggccgc cacctgtaac gtcccgtgcg cagaatgcgc ccagggcatc 68640 
gtgagaattc tcaccgagag ggccctttgt tgtaccgaaa agatgttcat agcgtctgcc 6800 
tgcagcgggg tcgttatacc gccccagttg gccagagttc tgcacgacgt ctacgcggaa 68760 
atgaaggcca agtgtttggg ggcgtggcgt cgtctcatat gttgcaggcg gcctatcatg 68820 
gctattgccg actcggtcct cgtcacttat aataccctgg atgccgaggg aaaactggaa 68880 
cttagactca aagcactgtg caaacttgtt tttcaaccca tctttcttca aagaatctta 68940 
gcccctatgc agctactggc caacgggaag atggttcctg acaattattt taccatcacc 69000 
ggtacggccg agaagaggcg ccctgtcgtg actggaagta ctagcgggat gacgtgtccg 69060 
ggaagcagcc ttgtccccga ttccttaatc ctgccagtat gcgaaccggg gcttctcccg 69120 
gcacccctgg ttgacctcag taatgtctta gaaaatccag aaatcatcct cagcgcccca 69180 
cccctgagtc aatttgtcat cacaaacacg caccccagtc tgcctcagtc agtcagcatt 69240 
attacgccaa cccagggcgt tgttcccggc caatgtttta tggacacgtg gaaagcggtg 69300 
tcacagagca ttcaccacca ggcacagacg cctattttgg ccgccgcact aaccggttcg 69360 
acatctgcgg cccctggccc gcatatcgca tgttccccag ttgccggcac gtctcggcag 69420 
gtggaagggt ccgcgggcgt cgattgcggg aaaccagcat gcgttccgca gcccgcgtta 69480 
ccgcccaatg tccccgccaa gaggatggaa acggtagcac agttgggaaa cgctcccgta 69540 
aaaaacgtcc acatcggagg ccgcgtatac gctccactgg ttaatatacc aataatagac 69600 
ttaacgtccc cgtcagggtc cggccagagt ccggccgata tcgccaacac tccagagtcc 69660 
cgcatggcgg ccggctctcc gcccttcgcc gaaaccgccg caacggtccc cgctaagaga 69720 
aagcagccac gcgaggacgt ggcagacaaa agactgaagg gcgacgttcg gggcgccgca 69780 
acagtaaacc accctttccc gggaccgtcc gggatgcgcg ttcgcgagca gggcttattc 69840 
gatttaatcg aaagctccac ggatgtaacc gcgaacgcat ctggacccaa aaacgacgac 69900 
gacatgctag cggctatcct acaggacctg tatggactac agtccccccc ggccatcgat 69960 
tccccctcca gcaactcgga caatgaggag atatttccag aggttagtcc gccatctagc 70020 
ggccacggat cgccttgaaa gatgacccca agacgcaaga tcacaccacc gagacagtcc 70080 
cacgcaaccc ggcggatgtc taaccgtaaa aatatcaccg gttggaaagt cccttggtgc 70140 
gacaaaaatg agcttcaaac ttacccattt tctgtgctat ttcgggacac taaaaaattg 70200 
tcgccagtgg cgacaatttt taagtctgtg taaatgtcac tctttctcaa gtttctgtcc 70260 
tagaattttc acatecagtt tcatatcata taaagacacc tgttttgttt tcgtgaatct 70320 
ggcgtgtttt tatcttgaga ggccccctaa aaacatgcct cgtgtgaaaa cacaacccaa 70380 
gagaccccaa gtgcttgaat ttatgccatt agatctccac ggtggaacac acacggagat 70440 
ggattctcaa aacctgtgtc ctgacggcca ggatctgctc gggtcttata tctatacgga 70500 
gaataacggc ccgttttccc aaataatgca caatggacag agcaataccg ggacaggtga 70560 
aagcttcggc agctacgctg ccggcgacgg ttttctgggc ggttctgtgt cagggatgta 70620 
tggaaacaac accggagagg gcgcgtgttc taaaagaccg tccgcgtgcc gtaaacgctc 70680 
ggctgcacta attcacgcgg cgtccgaggc gtctgtggcc gagcaaggca cctcacaggg 70740 
ggcacatgcc gtatctgacc ggataggcag agacggtggc gctgacaata gactactcaa 70800 
ggtgagtgcg cggctgtcgg acaaaacaaa gagcgccctt cgcagccatc cttgcttgcg 70860 
ttgctattct ttgatgttta acacgtaatg tctgtttata ttaaccctgc catagcccga 70920 
accgaacact gtgcgatgcg attataaaac gcaacactcg gacacgtcag atggacgcgc 70980 
aaaacgcgcg cggcagtgca agacctcagg aaaaaataaa gcccaacagc tgcaacaggt 7104 0 
acaataagca aatttagtaa tttgtatttt ttaagtttcc gaggttttcc cgctggcctg 71100 
cggcaccgtg tccgtgcttt agttggaata cctctcctaa caagcacgta ttccaacagg 71160 
agctggaaag taaaacaacg caactgaccg tcctcaccgc tcaacacgaa gcccttcaaa 71220 
agcacgcatc atttttacag aagatgatga tattgatgtg caaaaacggg aacagtaaaa 71280 
aaagcacgtc atgatttaaa gccacctcgg gttactagtt taaaactgcc taattgattc 71340 
tattcaccct tcaataaaat aaaattttaa tacgtttgcg gtttgtccat ttcctgtctt 71400 
aaaactatta aacaattcat ttgggaataa ggatttatgg gattaaggga ttaaattttt 71460 
ttggatcatg ggatttggga acatacgtct gggatggagg ttatgcttca tggtctgggt 71520 
ggcgtggatt gcacggggac ggtcggtgtg cccaacctgg cacctgacag atgggaaata 71580 
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cgaggcggta tacaggcact acctcgaaga gtgccgcaaa catgaaggct cggggagcct 71640 
ggacggttcc ggacagacaa aggggtctgg aaccaaagca accaccgaag ctaatatatc 71700 
gataagacct aacgttgtca catcaggtca aaataaagag ccgcctggga cagcaccgag 71760 
ggccgaatca tcacacgacc tgccacgcat caagcaggtt aacgctctcc gattatcaac 71820 
cccggaattg gcgcaaccac tcccggtagt aaaatcgact ccgcgcgagt cacagtcagg 71880 
tgggacaccc tggaacgcgc gcccccacgc gttcattatg cacacaaacg acatgctcaa 71940 
cccatctgtg gtcctgtctt tcagagccat ccgtgcgcgg tccacacgcg ataccgagca 72000 
gtccgttcgc gatcggaaca cggtcacgac cagctatcgt acccctggcc gcccttccct 72060 
ctttcaagcc agaccctcgt ctcacggtgc gcgtctaccc ccttcgcccc gaacgatggc 72120 
aagatacgcc gagtcgcgaa caatatgcga ccaaaattga ccgcaaagaa taccacgccg 72180 
tccgttcgac gcagtgcccg tatggacgct gcacgggacc tcaagcgtcc gtgtcagacc 72240 
cattcctccc ttcgccgacg ccgtggcacc tggttcaccc cctgtgggct atggccggaa 72300 
tattgggcct cacgatcgcc ttgtacacga tatatcaaat atatgcacat agaaactaca 72360 
cgacgcttca ctgcgaataa aggtttattt attttgcaca ctagtccgcg tcgttatttc 72420 
tggtcctgga ctgggcgcgc ctcctcctga gcgccccgtc cgcgctggca gcgattccgc 72480 
ctctggtggt ctcgtccatt gacacgtcca cgcggatact aattccggcg agggcgtctt 72540 
ccatctgccc gcgcgtcacg gccttggccg tggccgccct caccttagct tcaatcttac 72600 
gggttgccac gttacataag gccgaaacgg cagctgcgat tttggcctcc ttttcggtgt 72660 
tagagatgac ggggtcggag ccggacctgg tgggatcccc agaatttatc agctttttga 72720 
gctgcttgtt ttcaacggac aatttttgca cctgagcggc taactcctcc attgtcagct 72780 
cttttttggg ggcgcgcgtc ttgggacgcg tggaagacat ggcagctact acgcattaca 7284 0 
gagtacagcg ggacgtttaa atactttcag gcggcccctg aggtcactcg tgcgccacgg 72900 
tttcaacaaa tttaaaaaag cacatatatg ttaggtacag aaatgttgca accgcgacca 72960 
gaacagagtt gatcaccgcc catatactcg aaaaactaga cagcacgggc tggtacgtgt 7302 0 
ccgccgagca atcatagtca taaaaagagg gcgttacgga gggagccgcg gaggggtttg 73080 
tagtcacagg aggcgtgccc gttcctgtag tcgaggaggt ggtgacggtg gcgcacaccg 73140 
gcagacacag atataagcac gccaacaggg ctaacgcgag cacaattgac cctgtcattt 73200 
tcaagccggc gttcggcagg caaccctcca ctttggcacg cgcgctctcc ttatatacta 73260 
ccagttaaac attatggcag aagtgaccgc ccatacggt't ccgtacgcct ttgactcctg 73320 
taagtttgaa ataataccaa aaaacaactc atctcgtata gcgttacgca acaaatttcc 73380 
cgtcgtggtc aaaccgggag aacctctagt cgtgcccctg ggactaaaaa ttatccgcgc 73440 
gccccagtgc gcattctttc tcagcggagc gccgacggat gaggtgtatt accacaccgg 73500 
acttatagat caagggtatc gtggagaaat aaaactcatc gttctcaata agactaaaca 73560 
ggtcgtcaca ctttaccggg gagaagtcaa cgtctcactc atcgcgttta tgtacgcctc 73620 
gcccgggccc ctaaaatgcc cgatactgaa cctaccacat tactctctgg acgctggatt 73680 
tgatgttacg tcgccacatg ccatgaccat acccccaacc gacagaaccc cgtttacgtt 73740 
gagtttgtat tataagagtc cacagctcag caccccccac gtcccactaa tcgtggggcg 73800 
ctctggactg gcgacaaagg ggcttaccgt cgacgcgacc aaatggaccc agtcgttggt 73860 
acatctgagg ttttataact ttaccaaaga gccaatagat ataccagcaa acagccgcat 73920 
atgtcaggta gtgtttatcc acgaagacca cgtcccaagc gggtggaaca ttcttagatc 73980 
ccgcgtacaa ctcggcagca ccctccagat atcgtgggcc aaaatcaggt ttaccgacgt 74040 
ggccacgctc cccaaaaccc acccgctcaa ctcccgccac actcaaagcc aaaccgaacc 74100 
ggagaccgcc cgcggcgcaa aggggttggg gtcgtcaggg ttataatttt aaaacatggc 74160 
attattttaa agttgtttat ttttaataaa acaatccaaa tttacgttaa ccatcagctt 74220 
gcgcatcctg tattattttg tctggcacga tattatccgc caccgccaac tggggcgtcg 74280 
gttgtcgaat caccggatgg gaggggttgg aggacgcttc ggtgtctacc agcacattat 74340 
taatttccgc agcccacgtg acagaatcat cggacgcgat cgtcgtagcc agtttttcca 74400 
tttccaggag tgacacggtc tcgtcggccc gtttattcaa aagcgcgtga aaggcgctct 74460 
cgtccaccgt aatatcccct atagccaggg acaacatggc gttccagata acgttctgta 74520 
tagactttag cttagatatt tgaatcctaa ccgtctcctg gcaccccgcg ggtaccctag 74580 
ccgcgtcaaa acgacgaaga tactcggtta ttttattgga ttgaacggcc aacagatacg 74640 
tctgccgaag ggcggcggtg cacatctctg cctgcttcac atctcccata tccactcccg 74700 
ggggcagacc catggctatt tcggtccgcc agtgagtagg acactcgtta agaggatctg 74760 
gcaggggacg atattgacat cccacgcgtc caaacggact aaacccgcaa cacgtccaaa 74820 
acgaaccagg cgcggccatt gccaaacaaa atggtagatg aaattagggc aattttctct 74880 
actagtggag atatggccga agtaattacg gatatactga ctgaaacgca agcaacggcg 74940 
tccttcttct gcgtgctcca cgatcggggc gacgcgccta taaatactcc acatgccgta 75000 
attaaactct gcctgcccgc caagcgccca ggcggcgggc caaggtgttt accgttgatg 75060 
gtgctgaacc taccggcgtg gcaggttaat ctattcctaa caggtgacgc accattgacc 75120 
tcggataaca ttaaagaccg cattgacctg gctcagaccg aggaaatact cgaacccata 75180 
ttaagcgtac tggcatgcaa acggtccgcg cagcagacca aacatgactc gtttaaatct 75240 
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aaggtggcct ggttcagggc aaagttcgtc tcggccctaa gaaaagtgta caaaatgacc 75300 
ccatccccct attggatgat aacgctgctg ggctcgttcg aggcatcgtt tgtactggcc 75360 
ggcacgtttt attttttcca gtctcatata tgcacggcgg agacgctggt gcatctaacg 75420 
aggttattca gctccagtca gggccagagc ttggtcaccg taaacaccta tgacgaatta 75480 
gggcgcgtat tcgggcggtc agatttcctt ggaattgttc caaatttttg ggcgtatcta 75540 
aaatacaaaa tgcaacagga cgacgtggag tccagggcca tcgatcaaac aatcaactcc 75600 
atacgtgggg ggttgatgct gtcacctcag gacctcgtac acttcatcta cctgtcattt 75660 
tacgagtgca tgaacgcaca gacgttcctg tcgtactctc gtaccacgtc cagtttaccg 7572 0 
acccccgcaa cggttaaccc tccgcagctg tgtcggcgct tagaagcgga ttttaaggag 75780 
cacgtgatgg catattacaa taaagcaagc tacctgagta cttacataac cattttaacc 75840 
gtacccgctc ctctcccgga cggatacgaa aactttcagg aactagcatg tcaatactgg 75900 
tgtggacaat cgagagacgt ggcggaaatc atgactagaa ttaatgacca gtatccacag 75960 
ctgaatctaa caaaagattt atccggtctc cttgacctag cggccctaga tcagtattcc 76020 
ggaggcccca aggaaaacct ttttacggtg gcttcgagaa ttcccacata caggtgcgag 76080 
tttttgaata aacagtattt tgttctaatg cacgcagact gtatagacgc gtactggaaa 76140 
caaaacatta tcgtgcccga agacgcccaa ttgcaaggcc tgacggatca agatctcacc 76200 
tccaggatat tttactgcga cctcggccta tctctaccta catttaaaca acaaattcta 76260 
gtttcacgcc acgagtattt caacccacga ctcccagttt atagatgggt gttggatttt 76320 
gatctaaagg taaccgaagg tagacggact ctaaacgata tctacaacat atgcgtaacg 76380 
ctgcggcagg tcatattaga aacgttgcag ttaataggtc cactaaaacc gaaccacccc 7644 0 
gtgtattttt ttaagtcggc ctgtccagcc gtaacctggc cggatgatat ctcagacacc 76500 
gcgttctgtc actgtgacgc aaaaataggg atgagaattg taaccccgtt tccgagcgga 76560 
tactgtttgg tggggtccgc tccactcgtg tccctgaccg acattttaaa ccgcgtggtt 76620 
aaactggaca cgcggctagc ctcagagtat cctgggatct tagaagataa aggaccattt 76680 
gactctggca tttacgccaa ggggcgatgc gtgcgcgtac ctcactgcta taaggttggc 7674 0 
ccggggggcg agctgtcacg gctcctaaaa attattatct gtcaccccga agagtcagat 76800 
aaatcggcgt atttgaaaaa cgcgtttaag gtttctaatc tgctacatca cgcccctgga 76860 
gactctgtca ccaaaaacgg ccacctggtg tacgcgatca ctgacgaaaa cgagggcttt 76920 
ctagaaagca aaaccaagaa taatcttccc aaaacaatca cggacctggc cgaaaaaatt 76980 
gagcgaacca cagaaaaacc actaattgat tgggcagcga ccgcagtgtg gccaaaacta 77040 
cacgatacca tacagcgttt ctttccggat gaccgcatcg gccaatttgc atctgtgagc 77100 
ttcatgcact ccggagacaa cattatacaa gttaaaccac aaaaaggaaa caactttttc 77160 
tgtattaatc ataagcaccg aaaccacact cagacagtcc gtgtattttt aaccctacat 77220 
tccacaaaag aaagcgaggt cacggtgacc tttatgagtc agtgctttgc tgcaaaatgt 77280 
aatcataata gcccaactgc acatttttca tttatggtac ccataaccgg cacgtaagag 7734 0 
acacgaataa aatcatccgt gatgggccag ctctccggct actttgttaa ttttaacacc 77400 
gcgtaacccc aaaaacaaca tctggacgag ttttgttcgc gaacgtgaac atggcacagg 77460 
caatggtgag cctggaatac atgaaggata ttttggacgg taagaagacc ccttgtggct 7752 0 
cttacgactc ccatttaaaa ccgcaactta tcaaacggct ctttttatac gatttacatg 77580 
cggtatgtgt ttcacgcact aatttgtttt atagggggca tatcatcgtc ggacttcgac 77640 
gactcgtcct cggacgaaat ggacgacctg tccccaacgc cggagccaga accgtccaca 77700 
acgccgaaca gctttccgga ggggcccaaa tcacaagtgg tggcgttacc taaaatccgt 77760 
aaaagatctc gatctgagac gccggtaaaa attgagcaca gatctccact taaccgctcg 77820 
cgatctcgat ccagaacgcg gtccgggtcc ggtcaacgat caaaccagtc aggcagatac 77880 
gtcaagagat tcaaaccaac ggttgatgca ccccgtcatc gagaaccgtg gcacaggggc 7794 0 
ggtaagggaa aggccccgtt tatccgcaga gacgcaatgg ctggccgcgg tcgacgcaca 78000 
tacggccacg actatcgcgg aaaagccgct ttaacgcgga gcattaaaga gtctattaaa 78060 
aagatgcacc ttccatccac catgctctct cgtgcgcacg ataaaaaggt attcgaggga 78120 
ctgttgccac gacacctggg acagtgcttt caggtgtgcc tgccggcgcc gccaccgctg 78180 
caacccgagg tgttcacgga tcgacagctt accgctatag tcaagtctgg cgggcgcaga 7824 0 
gacgcgctgg tggccaaaaa agttagcctg gctaaactaa caagcctata caaacccctg 78300 
ctcacgtttg tgacggggag aaacaaccag gcccactggt tggcgacgcg caaaaacacg 78360 
ctagcgtctg cgggactcga ggctctcgcg gcctttatcg aggaaggtct ggcgtgggcc 78420 
caggtgtgcg tatctcaaaa cagatcgttg aacgacagca acctagatat cattctagac 78480 
agtagtcaga gcgtgtgcac ctggttcatc tccaaaatca ggcatttgca catacagtgc 78540 
ttcttggaga atcagggtga ggttagcctg gtgaagcagc taacttacct cgtgtgtata 78600 
aataaccgcc tggcggaggc ggcgaacctg gcgggagagg tgaaactgaa ttttaaactc 78660 
ggaatgctga ttgggttcgc cctaaccttg ccggcgctcc ttgcagagca taagttgtct 78720 
ggcgagagcc tgtacctgtt taggtctttc ctggagaaat atagaccggg agatgtcatg 78780 
ggactcttaa attcaatagt ggtggagcac tacaccaaat gtcgtagcgc ggagtgtgtc 78840 
attacaaccc acgccatggt aggatctggc gaaaacaaca agggactttt tttctttcca 78900 
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gtgtaatata taacccatgt gtgtaagtag 
gcaataaccc acataacaaa ataaaaatgt 
gcaagggttc tgtgctcgga acaaataggc 
tagccttggt atacagcgga tgttatttac 
cgtaacagta aaccgggtcc aatcgcgcag 
tatagcggtt tatttgaaca tacaacacac 
ttcattcatt caaagtgccg atatatttca 
tgtgattgat cctcctcctc gtatagggac 
tcgggtggcc cgggatcaga gggtcgtggt 
gtatcaaaaa cgcgatgtcc caccccagat 
gtattgccac ggaaccataa attaagaaat 
caactgcaca aaacaatgcc gtcctccagg 
ataaaatcgt catccatata aggcgccggg 
atgcagacgt ggtgttcatt tttatccgtt 
gaatgggtaa aagtctgtgc ttgaaggacc 
ccccacgggt cgtcaacaga tggcgccaaa 
ggcagggcgt gttccccggg caacgcaacc 
tcttcaccgg aggcggttgc ggttgcagac 
gtaccggcgg gacgggtacg aagtagcccg 
ctgcccgcct cgtcggcgcc cttgcgctta 
gccgctaccc cagaggcccg agggcacaac 
ctgaagacca cgtttgcaaa tgaaaaactc 
taggcactct ttcttatcgc acccaatagg 
ccggacggaa gggggatgtt tctctggttg 
aggtaatcaa atcccacaga ccacggggtc 
tcgtctgccc aaaacagtcc cgaatacgtt 
cgaagtccca tggcggacgg accaccgctc 
tatgccaggt tgatactcac cgctttcagt 
cggctaccgc cgtgggcgca gatgtgtgca 
ctaattccct cagcgcccag tcagctggct 
cccatcatct tcaccgctcg ttcccgcgtc 
gcccagctct actgccgggt catgcgattg 
gaatgaacag tcggtgcgac cctctgatgt 
tgtaccgcat acagacgagg tttcgcacac 
gacgggattc ggcgactgga ccgacagaca 
ttcgtctcca gggttcgggg accttgcaag 
ctgaacgggt tgcctggatg gcaggtcctt 
catttcccgg ttctccaggt tcttcacgca 
acccttctcg cactttttgt gcataaccct 
gacccccggt gacggtaaaa acaaagtcct 
gaggacggga catggtcgac tgcagacccg 
ggtctctcca aaatatagaa acgttaattg 
acgcttgttt ttgtccatcc gactccgagc 
ttcacgtcca gacccacccc cctggtcgta 
agtttcggtg gaagacatca gattacacat 
gcgaaacagt ctatatcgag cacccggtcg 
tcgtatactt ttgtggtgtc tcatgtcgta 
gtgggacggt ttttcgccct gacatatgcc 
ctcgctattc catacggacc gatctgcagt 
cttgtcctca ttctcccaaa ccaatccgcg 
ccaggccctg atatcgactc cacggcccgc 
gctctcaaga caaatgaaac ccttgcgttc 
acggagtcat gtgccctaaa catatggcta 
tgacaaacac ctggcactca ataaaatttt 
ggttagtatt aaccccatcg ccaccatatt 
ctttattaca gatagaaggc gctcaccggc 
attaaacatt agcggcggac aacgcccgcc 
gcggtacagt gctttgcact ccgccctcgg 
cacaagcttg tggaagcgcc gctccgtccg 
caaccaagtt aaatgcacga cacgcgttgg 
attcgacgtt cccttgaacg aacgccaggg 



ttacggtata ttattcacgg gcgtttaaat 78960 
gtaaaaccaa acacgcgtca agctttttct 79020 
agagttcgcc acctagcggc acgttctcta 79080 
ggaagtgcca catgaaaccg tggccgggtg 79140 
tttacacaac acataaaata ggcgaccgtt 79200 
atttcaggag tttcggtctt tataaaaatc 79260 
cgctggtgga caacaataca tatcggcgcc 79320 
caggcagaca cccaaatctg tgcaaacgga 79380 
gactgggcta actttaacat gtattcatcg 7944 0 
ggcacgcgtc ttggttcaga ggtgtttgag 79500 
ccatagccat tcatcataat cccatactga 79560 
gcgtgtaacg cgttcacaag tctgctcaac 79620 
ggcaaccaca actgaagcgg cccgggagcc 79680 
ggtctcgtgg ataaacgaac gcccatccca 79740 
ccataataat aaacctgaat atgcataaat 79800 
cacgcagaaa gaccgctcga tgcagatagc 79860 
gcgccccctt gactattgtc ctgcctaccg 79920 
gcctcggtgg ccggggcggt catggtgaag 79980 
agcagctgct ttaccgtgta ttcagatacc 80040 
aacatccgaa acctcaagcc ccgcagttct 80100 
ctgcataaca tctcttcatc ggagcgcaac 80160 
cgtctcaaaa cgtccttttc ttctatgaaa 80220 
cgcgccttgg cctggcccag cctagacctc 80280 
cagtagtcct cgtacatctt cccatcccga 80340 
gtagtcgcca gcacgaccct ggttttttcg 80400 
gcaagatttg cgtgagtaac gatccattct 80460 
tgaggtgggc gccaggtagc catggctgta 8052 0 
tcgtgttaaa ataaccaaac tccgcccatg 80580 
gccgtcacaa aagaatccca cctgcggcaa 80640 
accggttcat ttcactcacg agcacgccac 80700 
cggatcgtcc attgaatctg gtccggagtc 80760 
gctcgtcccc tctcctggat cttccatctg 80820 
tacagtcgtc ccggggtaaa aacaaccctc 80880 
gtaacgcagt ctcatggcga tgggcgcatc 80940 
ca gggcgatc tgtgcataat ctctaggtgg 81000 
agctctaaga taatccacca tatcaaaaac 81060 
atactcctcc tcattgttag taagcacctt 81120 
aatgccagta tccgtaaggg taatcaacac 81180 
tagatcttct ctggcaaact gacagtcaac 81240 
ctcatcttgg aaacaacacg cgtggccagc 81300 
tattccagca tgaacacgtt cgaggccaac 81360 
gatggcccct ggcagccgat gctcgcgcac 81420 
caataccgtt gcggtaaaaa cccgtcttct 81480 
ttggaactca ctgattaaac caaggcactg 81540 
tgcacaactt accaccggtt ccggcaacag 81600 
cccataggcc ttattaataa atttaagctc 81660 
caagaatctc attttcctga atctgccata 81720 
tcgcatgttg cagtagtcat caaaaaactt 81780 
tactttgttc cacgggaccc ggacaacggt 81840 
gtactcgccc gattcgaccg cggcaacgag 81900 
cattccgcgt attgggagct tgacggatca 81960 
acggcaccag ttcataaatg tatctaaatg 82020 
agactactca aactgtgctt aggggaagag 82080 
gaggtcgggc attgacccgt gccttcctct 82140 
caaggaacaa aacactccag atacaaaaca 82200 
ctcacaaaac cggtcgcgcg cacaggtaac 82260 
acagacactc gtgccacagc tgaataatta 82320 
gtgtaggtac gcccccaaaa tataggtaaa 82380 
gagatgttct ggccatggct gtcatatagt 82440 
cgcacgacag gaccgcaacc gcccccgtag 82500 
aatgccccaa ccatctaatg taaatgccgg 82560 



aggggtcggc ccaaaacgcc aacccgcgca aaagatcttt atccaggaag cgcaacgcct 82620 
cggcctgaaa ctctgctaag tggcctggaa tctccggaac cagtgcctgc tcggcgatcg 82680 
gtgcggcgca tatgtgccct tgagggcgac gttcggacaa tgggagcaca cggatgccgt 82740 
tggacgattc agtcaccact tcaccaactc tgtccccata atagtaaaac attatacgta 82800 
gcatccaaca ctgttcaccc ccggctccca ccgcaccagt gtaccgaaca ccggcgccaa 82860 
ccggaccaaa cttattcacg gcctcctcgc gcaacccccg taacatagca gtagtgccgg 82920 
aggcctgatc gcaacgagca cacgacctca cggtaattgg taaaaggcga atgattcggc 82980 
atctgtccgg atatggagtc gaaaggttgt ccttagttga cacgtccgac acggtttgac 83040 
tatgccgtat ggcgctggat agccagtttt tgcattcctt agcggttaat tccctgccag 83100 
ccgcgtgcaa gatgcccctg gatcgacagt agtcaataaa aatgtttttt tccgcctcct 83160 
cgactcctct gctccctctg tcgtgattcc acggtatgcg gattaaagtt ctttcctcgt 83220 
caacccaccg cattccggga tgacgacctg tctcgcagca ttccaccagc catcctcgaa 83280 
ggctggatgg tttggtcact cggaccggcc gctccatatc ctcgccgcgt gcacggcaat 83340 
gccaaacctt gtccaagtta cctcctgaca agcacctgcg accctcacag acctacgcgt 83400 
ggcaaacggg aacttgtggt ctagtctgct gggacagtac cttagcgtta tttt tatctg 83460 
tcgtgcactc ctgcggcgga agtttgtggt ttatgagcac caccgggaag gattcatgtg 83520 
tccctgcacc agctcgtcct gacagaccac cacagaaata ggaacttgtg ggctctcttg 83580 
ggcaccaggc tgctgtgaga gatacagtgt tgccccgttg cacggcgcat ggctctctgg 83640 
tcccggagag cgggcaagct ttctcatgaa acccatgaag ttaaaaattt tctgtttaga 83700 
gaggaacatg acctctcgtt ggataatatc accattacag tcgttacccg ccgcgtacag 83760 
cgtttccctg ttcctattat cgacgtacac tccggtttcg ttgccgtgta cgtagatgcc 83820 
ctcgttatac gcgaccagca tcgtttttat ttgcgttatc tgttctgcgg tcagtttttc. 83880 
acgatctgga gatggaaacc agacagtgaa cccggtgcca tagaaacaca gatgaccggg 83 940 
atggggaacc gggcgcggat gcaaacgaac acctctccga tcttgggacg tgtgatccag 84000 
tgccagaacg ccaaaataaa acacctgaat tctcaaccgc caaaacggca caatttgatg 84060 
aggcagcggc gcttctggga ccccgagacg atgtgctcgg gcgtacgctg atctcgtcgg 84120 
a 999 accaac gtgggcaagg gggcgggcag cggcattccc ggctctaacc cgatgccaca 84 180 
cacctgctcc aaataattta taatcccttc agagttctca tcaaggatac acgcatagca 84240 
tccagaatcc atcaaaggtt ttacccggaa ggcggtccag ccgccaccgt gcgtccgcac 84300 
gaccccatgc tccacttctt gaaatccagc attctccctc aaacccctca aaaatcgtct 84360 
cttacactct agcaggtttc catctagccc aacctgcaga cctctctcca cgcagtaggc 84420 
gacaacggct ggagttccag ctgccccctt cttccaagac agcttaaagc tttttttccc 84480 
tcatcgcacc aagttaggtc tgaataggtt ttctcgttga gatggtaaag ggtccacgat 84540 
cgtagatggc tgtagttaga atctcgattc gccatgacag cgtacaccta tacaataagg 84600 
cgcgatacga gtgtactttt cccctgatga tatttgcagg gctctatctc ctcaaaacgt 84660 
gcacctggct aagcaagcag ctctatttct gccacttcct catttataaa accatattgt 84720 
ttctggtctc acgttttgca gctcacctgc aacgacacgg tgaatgatcc cgagtttcgc 84780 
tcctccagtg ccttgacaaa cactggccca gagttaaaaa gtgaagcaaa aggcatagct 84840 
ttcgatttca gtcaccggca gtgagtggtg gacattagaa agtgtttggc ccacacattc 84900 
agctgtgcac tcaagcccga ccaatatggc acgtgaccat tcccctatca caacactagt 84960 
acagaaaaaa cgaaaccaca cgcgaggtga cattttccag gttagagaaa tttattaagc 85020 
cggcgaatcc atctaacaaa acagcaaatg tttattcaaa gtgcctatag atttctgttt 85080 
gataaacaat aatagatagt ggagccctcg aagagtttcg ctcctcgccc agggaacaac 85140 
ccgaaaccca gagtttaaca aaggccgcgg ggggacccgt gtcgcgcggg gagggggacc 852 00 
tggccatatc aagcaaataa cgatcagtgt caaatgccca cacgtgcatc ccggtgtagg 85260 
cacgatgtgg caaactgtac ggttccacgg tatttccctg aaaccacacg ttaggacccc 85320 
cggtaatcgt gaacataatt cccctgctgg tgctgcagaa aattattcca tccacgagcc 85380 
atttcaaagc gtcctcgagt ctctgcacca cagacgactc agtcgtctgt ggtatcggag 85440 
gaaaccacgc ctgaaggggt ccatccaaaa cacacacgtg gtgtcccgcc ttatttgttg 85500 
gttttgaaga tagcctgatt ccctgcccgg tggctgtcaa cagctcagcc tgcaattgcc 85560 
catagtaata aatctttata tgcatgtgac cccatggatc gccggcagat ggcgcagcac 85620 
acgccataag tctggcgcgc agaaggcccg aagaatacgc tgcatcgtcg tcctgcgcgt 85680 
cgtgtccatc ttgatgacct tcccgatacc gtctctcttg atcacttacg gacggtcccg 85740 
aaa 9ggcgtg aagcattccg cgcccgagca tggcaaaccg catattgcgc aactcaaggc 85800 
ggagcgcgca cacgcgcgga cacaccggac aagacatctc ttcggatgaa cgcaacctaa 85860 
agaccagcgc cgtgaacgga aattgatcgg tagggaattc cttatcttcc tcaaagtact 85920 
tgctctttct tacggctccc agcaggcgcc ccctggcttg acacagcctg ctccttccag 85980 
acggcagtgg gatgtgtctt aactcacagt aggcatcata gtgttgaccg tcgcgttgat 86040 
aatcataatt gggaaacgac ggtggtgtag ctgccaacac cagcctcgtc ttttgctcgt 86100 
ctgcccaaaa caggcccggg tacagaccac tatcggcttt gaacgtcagc cattccttga 86160 
gggccatgcc ataacgagat gggccccttc tcgat tcccc agcggccatg attcagtcgg 86220 
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caacgtttaa aacccatgta aagtttcagt tcaatttaaa atgacacaac tccgcccatg 86280 
ctgtgacaaa cataaagaaa gtggcacacg tgtgcgcttt ggccgtcctg tcagatgaac 86340 
cccgccggta acatttactt cctcccatac ggtgcgaggg tagatacaga tgtgcttccc 86400 
gttgttcccg aagcaccgac acccgctcca gacggtccct cgtccccgct gtcatctccc 86460 
gcggcaccgt tgctgacatc atctgacaca tcatcattgc acacgtagcg aagtttcagg 86520 
gcaatgggag catcccatgt attttcggac tgagtagacg ggcacatgaa aatccacgca 86580 
cagtcacgcg gcgggacatc tcctggagtt ggcgatctcg ccagctccct caaataatgc 86640 
aagagatcga acgcctggag cggctgcgaa ggccgcagct catagtaatt ttcagaatag 86700 
ttggtcaatg ccgtcatgtt gcgttcttcc aggtttttca cacagattcc agactccgtc 86760 
ataacacaga tcaaacctct ctcgcatgtt ttctgcaggg cctgcaaatc tgccctggca 86820 
aactcgcagg gaatcaactg ggacgacggc aagaaacacg tcaatggatt atcggcacaa 86880 
cacaggtgtc cgggacgttt gggatcagga aggccacata tacgcacacc cgagccggct 86940 
cgcaaaattt gaactccgac gttctctcca aagtaaaaaa aggatacctc aacagcccct 87000 
agcacaggtt gcgcagaaga gtcctggagt cgtttggatt tccgtttata aacggagact 87060 
ctcagtggcg ccttagccct cggccgccgt gtcctctcga aaacatcctc tcgaacgcaa 87120 
ctgcccatgg aacagctatg caacgatgtg gtcgcgatga ggttacacag atcgcagccc 87180 
accgtaacct caggaagcaa ttgaaagagt ctgtatcttt ccccagagac gcccccggcc 87240 
ttgctcggcg gtgtcagctc cctcaaatac ctgtgagatc tcactgcata taacatccta 87300 
attttcttaa atttctgggc atgtgaaccg gtttgacaca cgccacgcgc tgagcaaaaa 87360 
tcgtcaaata tcttgttata atcttcatcg acgcgactgt cggtgcaccg gttccacggg 87420 
acacgtataa tggtcttgtc ctcgtcatcc cattccactc cggggtatcg cttagactcc 87480 
acggcctcga tgaaccaagc ttttaaatcc atatcgcgtt ccgccatctc gcgtattcaa 87540 
tacctctgac cgagcggctc agttaaaata ccgagccccc agaaccccag gcctccgtaa 87600 
caaaaaggat ctagatctga aagaagccta actatatacg cccacgggaa aactctgtaa 87660 
caaatgaaaa aaatgttctc agaaaacgaa gagaggaaaa ctgacgacac gcaatctagt 8772 0 
caaaccagaa cctaaaggcc gtcactgacc catatctttc cccttcccga gcgagtgcgg 87780 
tttctcacat gccgtgagca aacgaaaacg ctacacataa taagacacgt gtgagaggaa 87840 
aactttattg cagggacagg gcaaaagcaa gctgtgcacg gtaacagtat gtgtcactgg 87900 
gccgcatccc cgcacgcccg ccacagacac tcgtgccaaa tttgaatgga cagggggatt 87 960 
gtgggacacg gctcgccttc cggagtggga acgcctccta ggtggagaga cgcataggcg 8802 0 
gctggaaatc gcgttccgtg tggcggggtg gctgacaccg cactcagata ttgggcgagc 88080 
gaaaacacgc gcagaactcg cgcgttgtta attatatcca ccggctccag cgaacctccg 88140 
ttacccagca ggaaggctgg aacgtgaccc atataccgaa tgtaaatccc agacccgtct 88200 
gcgtaaatca tcaacccgcg tatgatctcc ttccccagcc actttagcat gctcgtctgg 88260 
tgaggaaaca tatcctgtgg ggtatgcggc aacagggcct gttccggccg aatgcccgca 8 832 0 
caaacgtgtc cttgcagtgg tcgctttggc gccggcaaca ggcgaacgcc gttgggcgac 88380 
ccgacctcca tgttcccaac caaccgccca tagtaataaa caaccagccg tagcatccaa 88440 
cactgctccc cgtcgcctcc cacggcgcct tcgtactcca tcccggcacc gagataccgg 88500 
aatctggcgg ttacctcgtt gcgtacgtcc aggagcatgc cgccggtggc atccgcttcg 88560 
ttacagagcg ggcaagatcg tacaaagatc ggcaacaggc gaataacgcg gcacctgtta 88620 
ggcgccggcg ccgacaggtt ctcctcagtg gatacatctt cgacggtctg actgtggcgt 88680 
atagcgctgg ccaaccagtt cttacattcc ctgacgctca atcgtctgtt accggtctgc 88740 
cgtatgcctt taaattgaca atagtccaga tacatgtttc tctcatcgtc agagacgcca 88800 
cccgcgccct ttaaatgatt ccaaggtaaa cgaaccagag tcttttcctc atccagccaa 88860 
tgcattccag gatgcttatc attatcgcaa caatcgagca accatgccct caggccagac 88920 
ggccggttta cgcggatcga ccctgcccgt ccttccgcca tctctgaggg tccgctcgca 88980 
taaactggct taaaaatcta tccccccccg tgcacctgta ctttacagtc acaccttcgg 89040 
gcacacgccg tagggctggc agggacggca cccagcagct attttcatct ctagcgccgc 89100 
ctactagcgg aaatctgcgg tctacaaaat ccaccgagcc gcccgaatgt caccatgagt 89160 
caattcatcc tgacagatca ccaaagtgac agggacctgg ggtcttttac tggagccagg 89220 
gcgaccccca aggtaaagag tgactgcgtt gctgggaaca gtgtcgccgg gaacgggtga 89280 
ccgggcgagt ctgcgcagta gacccataac atagaagatt tgagatttga cagaggatcg 89340 
cacagcccgt tgtgcgtggt tcccctctgc gtcgttccct gcgcagcgaa ccgtgtgcct 89400 
ggttctgtta tccacataca tccccagctc attgccatgc aaaaagacgc cctcctcaca 894 60 
cccgtttata atctcacaca cggtttctct ctgctccggc gttagcggcc catccaactg 89520 
aggttttgga agccacatct tatacgctgt cccatagaaa cactcgtgac ccggtttggg 89580 
gtcttggcgc ttatgcaatc gtacccctct acggtcctgg gacgtgtgtt ccgctaccag 89640 
cgacccaaaa taaaaaactc gtatacgtag ccgccaaaat ggagtgatct ggtctggatg 89700 
c ggtggtgca accgtcgcca acctggcggc cctggcatat acggaccgcg ccgcaccgtc 89760 
agcctccgcc ggagcgggag gaggcagggg cgtccctggc tcaagcccat ggacgcaggc 89820 
cgcctctaag taatttagga cggtctccga atgactatct aagatgcagg cataacacag 89880 
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cggatccacg gcgggtctca agtaaaaaaa tctttctccg ccaaaacgcg ttgttgttgc 89940 
ccgggcatcg tcctgttcaa acccagcgtt ttcccgaagc acccttagca gtcttttccg 90000 
gcactccgac acgttgtgct ggcgtccaca ttccagatcc ctatccaggc agtacgcaac 90060 
cacaggttgc atgccactca tacctcggtg ccatgatatc ctaaagctcc tcttttcctc 90120 
gtcgcaccaa cacaagtgtt cgggaaacgt gttcgcttcc aaattggaaa taatccatct 90180 
tcgcaggtga ttgtgagtga tttctatctc ggtcattgtg aacttggtcc acgaaaacta 90240 
actcaaaagc cctcaccact cccctaaact gtagctgaat tttatgtcca ccttataggg 90300 
tgcacgtgta cgtatatgcc ccgtgaaaac ctcgcacttc ctcataaatt atccccaaag 90360 
cgtttcgggt cgctcaccta aactaactcg agcattacac ggttcatagt gtcccgggtt 90420 
tcgccactcc cccaccgcga cagagaaacg aaacgttcaa tttaaagggc accaagctca 904 80 
actttattta aggagaaaaa aaccaccgcc atcagaaggt ttccgcaggc acacaccccc 90540 
aagactattc ttggcgagtt cgcggcacgc tgtagttttt ttctgattat ttgtaattcg 90600 
catccaatcg ccccaaacat agtcaataaa aaaaccgtaa acacacacac tctggtgaac 90660 
aaattatgcg ttcctggccc caagacgtgc gtgagcgtca gcagcacagc ggctgctaaa 9072 0 
acataaacca gggctaaaac gtttgaccga aacacgagtc cgaggcaaca aaaaagccca 90780 
acgtggaaga caaaaaaata gtgtacaagt ccgaggagca gaggcgtcgt caggccaaca 9084 0 
gacagcgtac ccgcggccgc ccctccaatc aaaacagtca cgagataaaa gtcccatcca 90900 
cacacctcaa gcgcttgctt tatagagtat gtgatgtacc gcctagaggt gagcgaaaaa 90960 
attgcatggc gtcgctgcag gcccgcctca taaagcggat gtgaccgcag cgctctgagt 91020 
gaacaaacgc cacctactat aaaggccacc aggggcagca aaaacaccgg ggctaaaaaa 91080 
aggttcttaa gaactaaaaa ataatacacc gagaacgctg ctaaaaatcc aagttcgtag 91140 
tatctgtggg caatcgtggg gcacaaatat accacgtcat tcgccgcaac aacacagagc 91200 
agcaggcaca ggcacgaccc cacaaaaatg gaaacgtgaa cccatggaag gtcaacgccg 91260 
agggtggatg ccatagaacc ggcggtaata agaaacgccg agatggtgca cggtacatcc 91320 
acctgccgcg agagccaggc cggtagccca aatatacaga atagcaaaag cagcccatgt 91380 
aagccaaccg cgccccagga gaatataatg ttctcaacgc taatcagggg ctctacaaaa 91440 
tacagcgagt gataaataaa acacagaata aacggggaag ccgagaccgt cccgctcata 91500 
aaagatagcc aggcgaggga cgcctccgac gtgtacgtgc ccatggcgcg gcgtttaaaa 91560 
caacgggttg aacgtcgcgc gaggcgtctt ggtttttcct ttcccgcgac tgggtgagct 91620 
gctgtgtttt cgcttgccgt gttttttttc cggtcgagag ccgctcgtcc gcagtgattt 91680 
ggcgcccccg tctgtcggct cggcaccgtc ctcggcaatt gggacgcact gctcttttaa 91740 
gtccaggtca ggcgttctcg gaggggacgg gagctcagga accccctcgg gatcggcacc 91800 
ttccgagaga ttgcccggga cggcattatt ccggtccgtc ggatctacct ttaatagcat 91860 
gacgcccaac ctgacgtcag caaggctatc tgtcagcggc aggccctcaa cttccagaat 91920 
ctcattggcg tgccagcgga aaccaggcaa aaaaaatccc ggaatcttac atgcgttcaa 91980 
tgcggccacc agagactcta aactcacatg agtcgtcagg tcgctagaaa ccacacccac 92040 
gtccagcttt cttcctctag cgcaagtcca cttggcgtcc ggggtgtcgg gtttaaaatc 92100 
aataatttta ctaaccccgt cgaccgtaaa cgtggcaact gccagcgttt cgcttaacga 92160 
gacggtgacc acctgcttgg ccctggtttt ctgctgcctt agccacttta acaacaaagt 92220 
cagcgtctta accgtaagga acaggcgatt ccccgtcccg gcctgctcta gcctgtcgtg 92280 
gaacgagggt aggttgggtt cgtccacgac gctcttgtgc acggtgctcg tgtgattgtc 92 340 
cccgtaggcg atggtcgtcc taacaaactg cggctttcct tccgcgtccg actcgtcccc 92400 
gtggtggtag aacgccacgt tcatttccgc aatgttaact ccaaacatct cacgggtgtg 92460 
tgtgaacgtg ttcccaaggg aggtgtttcg aaagctaaag ctaacctcgt ccgtcatgtc 92520 
agaatatacc atcagcggcg ccagtgcatt tttaagcctg agaccgagaa cgcccgcgtc 92580 
ccccacgctg gagagcacgc taaggacggg ggccctgccc tggccgctga tctgtataac 92640 
gcctttctta acgatgccct taatgtggtc gtaaaccctg ctaatgctgc caagcgcatc 92700 
gacgtccacc ctggcgccgt aatgaaacga gacaggcatg gctaaggcga aattaacacc 92761 
ctaaaagacg agaagagaat ggcaaacacg acagacacag cgttcaatgt tcaacctttt 92820 
aatagagcct ctggcccgga aaatgagtaa catgggacac atggggacta cagatcatta 92880 
tgaacggatg tgacatattc agaatt-atca cgctcaaaaa agttcacact cttggcactc 92940 
gtatacgaca agggacacgc gtttgggggg cgggcaccga acaggggcgg tatgtttatt 93 000 
gactttagaa tcctgtcggc cgtggcctgc aaaaactgct cgatgtctcc gacgttcacc 93060 
atggtcacgt ctcggctctt ggaacgaata aacagagtct ctatctccac ggcttcgcga 93120 
aagaggccgt ggatgtatgc gacggagggg gattcgtctc gggagatcat ggtgttgtac 93180 
agcaagctgg cggccctggt gtgtagcagc tcgtccctgg agatgtagtc gttagccagg 93240 
cacaccccgc gcatgattcc gcgaaggcga aaaagtccta tagagtaaaa tgaacttata 93300 
aagtatatcc cttcgataac caggaacagc agcaccttct cggcccgcgt ggtggccttg 93360 
cgcactcggg aatgcagcca gtcaatcttc ttcgccaggg tagcgtcctc gactattttt 93420 
ttggcataaa tcattaagtc cccccgatta ccgccaaaaa ataagttcag tatgttggca 93480 
tacactttgc cgtggatgtt ttccatggct acctgttcgg cgtaatagtg ggtcacgtcg 93540 
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tgattgctga actctttaac taggtcttcg atgttaaagt ttaccagact ctctgccata 93600 
cccaaaaatg tgaagagaaa ctggtaaaac tccctatcgg tgggactcaa tagtcgcacc 93660 
tcatcggagt cctcgtgcag ggaaatttga ctggggaacc atctgttctg ccatgtctcc 93720 
tgggtgagtg ctaaaaagcc agggtggtcg ctcgcgtaca gaaaccgatc tggggactcc 93780 
atggcggcgg ttactattga caggccaaac ataccttggt ccgctcaccg tcggcctttt 9384 0 
gtacgcagtt agactccgcc ccctctgaag agatgttgtc acccacgacc tgatctaagc 93900 
actgcagggc attaagtttt gtctgtttct gaattcgaca ataatacagc atggttttta 93960 
accctagcct gtatccggtc aggagcaagt ctcgaatata actcgcccgc gtggcctgct 94020 
cctcctttaa aaagaaatta agcgactgac tctggtctac aaatggagcc cgcgcaccgg 94080 
cccgctccaa ctgcttaatt gggcagtaat caaaggccgt caaaaaaatc ttgtatctgt 94140 
ccttcagggg ttctggaaat gaagccacgt , cacctccgta acggcgcacc gttcgcaggt 94200 
caccaggctt cactctctta aaaaaagtca cgtttggctt caagatctct tctttgctgg 94260 
tgaccttcga tgcgatattg gcaaagaacg gataaaatgc ttcggtgtac cccgtaagct 94320 
gagaggttcc ggccgttggc atcagcgcca aaaactggct gttgaaaatt ccatgctggg 94380 
caatgctgcg ccccagctgc tcccatctct ccaagggagg gtgggacggc ttaacgccgt 9444 0 
cccacgtttg ccagtgaaac acaccctgag ccaatcgact ccgctcccat ccacgaaacg 94500 
gagtcccctc gccgagtaaa acaatctcat gactggtttg caccgccgta aaatacattg 94560 
cctgaaaaat ctccacgtcc agcttggcgc tctcggcgtc aaggtagcca aagcccagtt 9462 0 
cggcaaacac gtcggccagt ccttggacgc caatgcccat tgatcgttcc tcctggccgc 94680 
gcctaacgct ctcggtaggc gccgtcccgc ccaaaatgca cgcattgaca atgataactg 9474 0 
cggcctccac ggcatcatcg agcagttcaa aaccaaatgt tacatccccc ttcccaggct 94800 
ccacaccaga ctctcccctg tggggtctga ggcactttgg taggctaatg tttgccaagt 94860 
tacacacgga agcctgaccc tcgggttgct gcacgatttc cgcacagaga ttagaacagt 94 92 0 
ttatggcgct gccttgcgtc tcacaccagt ggtgtttgtt gagcgcctcc tttagcagga 94980 
cgtagggact gccggtctta atgacagtgt taataagggc atacatcatc gattttaacg 95040 
gcaacgagct agagtgtttg ccagcggcca ctagtctgtt gtattcaatc tcaaattcgg 95100 
caccgtagag ttttaggaga tttggcgcca cctctggcgc aaacaggtgc cactggccat 95160 
ctgggtttgt ttcgtacagt cggaaaaaaa gctccggcac acacacgccc tgaaacaggt 9522 0 
tgtgacaccg ctcctgattc tccggcatct tcgcgttcaa aaaatcacaa atctgatgat 95280 
gccatagttc catgtaggcg ctggcaccaa caggacggat attgttatcg ttgaaatacc 95340 
caacgtgggc gtttattaat tttaaacagc tggtgatgtt cttgtgttcc gcaaaagacg 95400 
agacatctat ccccacgcct gacttgctag cgagaagggg ggacatttcc tcatgaagtg 95460 
ctttgagggt tttgtcttcg gtcgccatgg acggctttaa aataaaacag ctagaaagct 95520 
gacctccgcg aagcccggct gaccttaata caggcgttgc gcagcacaca atctgtgacg 95580 
agatgtaatg gaacgcgtaa ccaaccaggt acatctcatc cagctccgtt tcgctctcca 95640 
ccaggtgtct gagggtctcc cgcaaacacg gaaattttat acactgacag gccacaaaaa 95700 
cagccaccct cataaacatc tgggccacgc tttcaaaaat gggtgaagaa ccctgggttc 95760 
tcagcacgta cgtatcgtaa aacctaacgg cagacaggta gccgcagtta acgaaatttg 95820 
tgtatgcctt gctctgctta aagtcctgta aaagaccgtc aagggccgct tcgtgttttg 95880 
acataaacgc gcgaacctcg tcgcttagcc tttccccgaa taccgcgaga tagtctcgca 95940 
ccgtaaccag gcacctgtcc tccattattc tgtgccaaag gagaccggtt agtgagttgg 96000 
cctctatgtc ccatccaagt gttgccttta aagaattcac gagtgattcg gcgcatcgtt 96060 
cagggtcaag gcttgagcta caccccccag ttcccgcgtc ggtaactagg gttaaaggtt 96120 
tggcagattt tgccgcagag aaagaagttt cggtgttcat tgttaaagaa atagcttcca 96180 
agaccccgga ttggaaacaa actccgtaaa ttttaacacc ggtaaagcag cgcctttaaa 96240 
gtgaaggctt tgaaaagatg gttgtaaacc ggaaggcacg cttccaagtc tgcaaactac 963 00 
gccgaacgca agcctattta tatacaggtc attctgcagc tgaatgtatt tggtgcgaat 96360 
cacgccgctg taaaaatccc tcaattgggc agctatttca caatatcctt taccagactt 96420 
aaaaaaccca agtcctagcg ccggtctatt atttacccta atgtaggtta agcaaacata 96480 
aatagacgag tgggcgggca aaggctcgtc gcgcaggctg gacgggcatg agtcatctaa 96540 
atccacaaac atgtcactag gaagcgtaag gccaatatgt gtaacaacgg gctctctggc 96600 
gactacgttg ccctttaacg cagacgtcac cttggtgaca aacgtactgt ggaccgtttg 96660 
aaccaacggc ccgaccggcg caagaaactg atgaagcgag ccggtttcca acaattcttc 96720 
aaaattgggt atggcgtcaa gtagaccgct ctcgtggccg taccaaacac acgctattct 96780 
gttggtctgg ggggcagagt ccgcgtccat cctagacagt cgcgccagcg acgtaggcgt 96840 
gaataacatg tcaatggagg acccagtgtc agtctgttta aaggaaaaca ggtaggtgcc 96900 
ccgaggttcc tgtgaactca tggtctgaga ataaatcaaa aaatctc.cat acgtttgaca 96960 
tgtaggcgag acagataaaa atccatcttt gatggcctcc accccagtgg tggtcgacac 97020 
cacatattta gagagcagat cacgaacacc cttagaaaag tcgcgaccgc gagataacga 97080 
aacccggtga ggaggcggcg gcagtagacg catcaaacta tcattcagct tgttcacgtt 97140 
tgcgtccctg gtcttcatgg cgtcgtcaat tcctgccgca cgggctgata acggtgacga 97200 
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gaacaccggc gggctttaca aattaactga 
acagcttaaa ctcctgatgg agttccaact 
catgcccacc gtgacccggt ttttaaatac 
atttattcaa aaacaccctg tgtttttttt 
cattacagac caccaaagcg cagaaacgtc 
tcttaataca gctattcgaa aaccacacga 
cgacaaccgt gccatattgg ccatgattac 
gccctcgggg atcccaccga caccaccaat 
aattttacac gaaacccacc ggcaatactg 
cgacatcggc gaagtcgcgt ctccgcttca 
actgcaactg gcaccaccgc ccctattccc 
taccggccac cacgaattgt ttgtccccct 
gccagtatca aaacagcgag cgtttgagat 
agttgacggc acaccaattt tagcgttcac 
actgttattc ctgtacgact tcgtaatcga 
ttcgcgcaac gcaattgagc attttattaa 
ggcgttcatt gaaaaaacgt gtggctatcg 
cgttaaggcc aggctcgcga gctgtggcct 
aatgatactg atgacaccgc acgagacgac 
agaaatggta aaccaattaa cactatacgg 
cagccccact agcatatcgc tggccaaaat 
acagagcgat cgcgccctgt ggcgcactcc 
attgaacaac gtgctggcgt tttttaagcc 
ctacaaggca ataccgtcgt acctaatgag 
gtggggtaac atcgcgctcg cagagagcgc 
agaccagggt cccgtgtccg cccaggttat 
cgcgacagat tatgacgcag ccatcgtctc 
aacaaaactg tatccgatcc ttagcgaggt 
gctattccaa attagatggc tcatagtctt 
tatcagacgc tcgctggccc tggcgtattt 
ttctccggag tcgttttaca acctgctgga 
acaggtgata ccggaagcca cgtgcccaca 
aaacattcca atagcagcga gcttcattca 
aaacggcatc cccggtatac tggaccttgt 
actactgtac gattcaacgc tagacaccgt 
cgtcgtgagc atgttcgtat tcaaatccac 
agccgttatc gcgttaactc aaacgtccga 
gcaacacctc acgtacatgc aaaagatcgc 
atccgtgttt cacgaaatac acgagggata 
catgttacac gttacgggaa gctgctgcta 
tcaacccccc ctaatacccg ataccatcgt 
ggacacaacc gacgacatct taaagagcct 
ggctgaaaac gaaagtagca gtcgactctc 
gtacgacgac tttcggaccg cctccacaaa 
ttcgggtaat tataacgaaa cacaagtgtc 
atacaccgca cccgacgata cgttaaaatt 
catgtttgca gaataaacga tggaattgcc 
cgtggcaacg acgcaccagg ccgactgcag 
aagcaactgc gttatctacc tcgcacaaag 
cactaacgac ctcgacgacg ttttacggca 
gtccgggacg ctcggctata accaatatgc 
gaccaacgag tggaccgcgg ccatctttca 
tgacgcggcc atccgcgaac ctttcatcga 
cgccggcacc gttcaatact ttctgtttat 
caaaaacaaa acgttttacc tgttcgatcc 
ggcacacgta atcagttcgt cagaccccac 
cagagaatac actggcagtt ttctatacat 
ctacatcacc aaccactata gaactataac 
tatatccacc ggcatagagc cgtgcaccat 
ggatgtgacg tcaaaatcat ccaacctcgc 
gagcgccaaa cccccaccgg cgacgctatc 



caacctccta acgtgcaccg gatcgctaca 97260 
aaaaccacta ccaaccgcac accttttaag 97320 
tgcattcaaa atagacaacc ccctggtttc 97380 
aatgagagtc gccaggcttc cggagccagt 97440 
aacaggcata ctatccgagg ttgtgaatgt 97500 
gtccccagcg gctaaagaca acgattatct 97560 
agaatacatc catcacgtaa cttcacgtac 97620 
gggtatcagc catctaccgt gcgtagagca 97680 
gaacctaacc ctcccggagt cgctatttat 97740 
gacgtggcta atcctatcgt attgtaaaaa 97800 
acctgtcgac gaactggccc gtcgtctggt 97860 
atcgacgtca ctggaaacgt acatcaccat 97920 
atacagcgtc tttgccaaat caaaaaacat 97980 
cgacacagaa cttacaacgt tcacccccga 98040 
gtccctgtgc aaaaaccagg catacgggtg 98100 
gaaaggtatc gattttatgg cggagttggg 98160 
atcaaccgta agcctgtcca acgtgagagc 98220 
atccaaagag gcgtgtgaag atttccgcgc 98280 
gccaaagtgg gaaaatttta cggacttttt 98340 
gttctatttt tacgagtgtc ttaaccagta 98400 
tcaaaatatc ttaaaccgag tcgatgccga 98460 
attaatcggg tctttcccgt tcccctggaa 98520 
tagcaccccc gtggcaacgc tacaaaaaat 98580 
gtctttgttc gaaatagcgg ccaacaaatc 98640 
cccgctaacc gacatacaga ccgccgaacc 98700 
cgcaaaatat tgcagccgtc tccaaataag 98760 
cagcccggga ttcgccgccg agttcattaa 98820 
gctccgaaac acgtccaaaa aaaatcgatc 98880 
cgcggcagag gacgccagag acctagcccc 98940 
tcaaataatg gacattttgg aagaaaagca 99000 
ctatcttcag gaaacattta ggtgcatacg 99060 
agaatttcta caatacttgt ttacttttca 99120 
aacatccatg acctttgtag acgacctgaa 99180 
ctccttaggt gccgcgtttt ataacatgaa 99240 
cgagattcca acggaagaag ggcaacccat 99300 
gattcgcgtc ctcgagaagc tcttacagga 99360 
accgatgtac gcagcgcaca tccgtctgat 99420 
cggacacgaa ataatgacca cacaacttcc 99480 
tttgcagtgt tttaagcgtt ttaaacgtct 99540 
ctcactgacg cgatactttg gattcctata 99600 
acaaaaaatt ttaaacttta acgacaaaac 99660 
gtcacagccc gtgagacaag gacctctatc .99720 
aaaaaacaac gttgagctgc ttcaaaaact 99780 
caataacccc acctctatta aacttgaata 99840 
cgtagattgg agcacatata acctggtgac 99900 
caccccggtt aacacggagg cactagatcg 99960 
gccaattttt tcgaaattta aaatagaggg 100020 
gttcggacaa tacgccggct cgcagtgctt 100080 
ttatttcaat agagagtccc ccgtgacgga 100140 
aggggcgacg ttggacttta tactgagacg 100200 
ccaactgcac cacataccca gctttatcaa 100260 
atctcaggag tactttggac taattggact 100320 
atccctaaaa tcgatcctaa cgcgaaatta 100380 
atgcggcgac aaagccgggg ccgtaattat 100440 
ccattgcgta ccccacgtac caaatagtcc 100500 
cgccatacta gagtacgtgt caccgccaga 100560 
tatgcccagc gaatatgtca atccagagca 100620 
gttcgccaaa gtacacggcc cgcacataga 100680 
cgaagacatc ccaagcccgc cgcgatcgcc 100740 
acgcgtgccg agaaccacca ccgacacgtc 100800 
cggtttacgg ggcgcggaac caccgacaag 100860 
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ctacccagac 
aaccgccgtc 
actgtcggca 
cgacgaagcg 
atccgccgag 
tccatcgttt 
cgcgacagag 
gttctctgcg 
agaacagtta 
aaacgggcga 
tatcatactt 
aaacgttctt 
caagcaggtc 
aaaattcaaa 
gtcaatgagg 
aggctcacag 
cgccaaccac 
acgggataaa 
ggaagaactg 
aacggcattt 
ggcgtccata 
gaccccgccc 
aaatatcata 
aacgcccatg 
ctttaattta 
gcgggcgaaa 
ccaaatactg 
gaacccggtc 
cctgataggc 
ggcatcatcg 
agacaatctg 
cactgttagc 
gctaaataaa 
actggggcac 
cagcgcacaa 
cctaaacgag 
actgttaatg 
ggaaacacgg 
agacttttta 
gacgctgatc 
catggactac 
aaagggaaag 
catcgacgct 
tcctggtacg 
cgacatcctc 
agcgtttgac 
cctgcccatg 
cgcggtgcaa 
ggccgaatac 
cgtaagatcg 
tcccccgaaa 
cgtcgaggac 
tgcgctccag 
ccgccaatcc 
accccgcgcc 
tttgagttca 
gggattcgcg 
aacgcaacag 
tctacacgat 
gctagacgaa 
acaaaaactg 



ccggcaacca 
gaccaccccg 
tccaggggtc 
tcgtctgacg 
gtgatctgga 
gacgacgtgt 
gacacggatg 
ttcgacgacc 
atcaagtacg 
gccgtccgag 
gaacacggtc 
caatttttca 
ctagaactcg 
cagggggcgt 
gccacgcacg 
atttcgtcca 
ctgagtccag 
attcaggacc 
ttttttgatg 
ccaatggaga 
accaccaggt 
gaccaaggaa 
tccacgttaa 
gcaacgtaca 
gagtggccat 
gcaaaaatgg 
ggagacgccg 
cgcgccatgt 
agttctcgaa 
gagtcgttca 
gccaagatcg 
aaaacgctcc 
aggagcccct 
gcgtccgttc 
aaggaatccg 
ctgaagctcg 
ataataaccc 
tggcaagaaa 
cagtcggcac 
acccaattca 
acgccgacgc 
gccgcatgga 
tcggattggc 
gttggaccaa 
acgcagaagc 
tggatcgcgc 
gtgcgcaacc 
tccgcagacc 
tgtcgaatac 
gaggccgcgg 
cccaacctag 
tttccagatt 
agagcggaat 
acccgtgacg 
cccgtcgcaa 
acggtatacg 
tggctggaaa 
ttaaacgtac 
t tggaactgt 
ctcgcgcccc 
gaaagcatag 



acgacgcgga 
aattccaaac 
ggaaacgcaa 
acgaaggggc 
tggacgacga 
tcgacagccc 
gcgcgtttct 
ttataacgga 
aagcgccctc 
aagcggcggc 
taattacaga 
tcctgtgggg 
atctgcaact 
tcaaaaaaca 
cagatgcgca 
gcgaagcgaa 
gctttttagc 
taaaaacagg 
ccatgctcac 
tctttccgca 
taaccgagga 
cgcacattac 
aaatcctaac 
ggcgtcagct 
acgaaaccgt 
aatcggtaac 
aaacgttact 
ccataccgat 
accagcggtt 
taataatgct 
acggcatcct 
aaacgctggg 
cttccctcaa 
cagatcacga 
ctggcgatga 
taactaccca 
gtgacctcaa 
acgtgctaaa 
cgtcagcaaa 
acggccacga 
ccatacccac 
ataaaattca 
aagagatggc 
agctggtgcg 
tggcatctct 
cctatcaaac 
tggcggacaa 
ttcaacaggc 
tctctgacat 
cgtacacgga 
aagcccccaa 
tcctaaaaac 
ttcagcaact 
agatcgcagg 
tatcatcgag 
acaaaatcct 
tcgcgacaaa 
tgctgagcga 
cggcgaaaaa 
tcagggtaaa 
aatccctgct 



cacaaaactc 
aacacctgga 
gctt tccagc 
cccacgtagg 
tatatctcct 
cccaatgagc 
agaacagatc 
ccacgacttt 
gcagcacctg 
cctccaggcg 
cgcgcaggcc 
agaaaaactc 
gataccgtta 
cctaacaact 
aaaaaaactg 
aatatcagtc 
ggtctactcc 
catcgagcag 
agccctggac 
ccgtaaaacc 
cgccaccgaa 
caacattcca 
gtacgcagaa. 
gttgtacctc 
gccaccggtt 
aaaaatggaa 
tgacacaata 
actggagacc 
cgaaaaactc 
gctaaacaac 
gaccaacaac 
aggcagccta 
caacgcgcgt 
gacgctgacg 
tccagatagg 
atcgcgtgtc 
ggaggcggag 
gtttcaaccg 
ggcccgaaaa 
gcgaccgccg 
gccacaggcc 
acaggccttt 
atcagaatac 
cttcatggag 
gcttccaaac 
acgcgtaaac 
gatccatcac 
cacggtggga 
gcaagtcgcg 
cgcaatcaac 
gaagctaata 
gtcaatcctt 
agaggccagc 
caagatggca 
accgttgaac 
ggacaaggag 
atccgtaatg 
ggtagaaaaa 
cacggacgac 
gggcggaaaa 
tcgcgccacg 



ttaactcccg 
gccacgctac 
cttcagcgat 
cgcgtacacg 
ctatattcac 
ccggagttta 
gctcgagacg 
tcttcccttg 
ccaaacatct 
atggacaaaa 
cggggaccgt 
aacataccaa 
catacggcta 
aaaatcaacc 
gcgtcggctt 
cgggcgctga 
gcggacgagg 
cgcaacaaag 
acgttccaac 
gaagttatgc 
gccctcaaca 
aacttttcat 
aacgacatgc 
ggaggcgagt 
caagaactgc 
aagaaccaac 
accgcaacat 
tacattacaa 
aaggccgcca 
acgcggctcg 
acacgttttc 
atacgcgaat 
ctcctcgcgg 
cgaatcgttt 
tggacgcgag 
gacaaagcca 
gtgtctcagg 
tcgacgtcca 
ttcgcagaaa 
tccgaggcca 
gtttctacgg 
caggatttca 
tccagacacg 
agcatctcaa 
gggcccgcgt 
gcgtttctaa 
caatgccaaa 
acaagtttag 
ttcaacgacc 
tcgccggcca 
acggcaactg 
caacaggagc 
atctcggcgg 
accgctatca 
ttatcaaaac 
ccttacgaga 
gtctacagtc 
cagagcaccg 
gtaaaggtgc 
accaccgtag 
aggacggcag 



ctccagcgca 
ttctttcaga 
attcggattc 
acgatgcgat 
cgtcggcgac 
catacgaaga 
cggaaacgcc 
ataaaaaaat 
cggacaaaca 
ttatgattaa 
ccgcgtgcaa 
taagcgacgc 
tcagcgaagg 
ggtgtctggc 
tcaacgtcga 
aggaacagat 
taaaacatct 
aaatccaaca 
cccctccgaa 
tcgaccacct 
attacctgga 
ccatcgtggc 
aattaaacgt 
tagcaaccat 
ccctcgtggc 
aggctctcga 
ccggagatga 
acgcaggcgc 
tccacgacct 
ataacatatc 
tttcaaacgc 
gcgtagaagc 
ttcaaaccat 
ccggcgtcgc 
taaccggtca 
ccaggcgcaa 
aaacggtcct 
aagaaatcga 
aacacctacg 
ccgccgttcc 
ctaccgcgga 
actttcacct 
gctcgtccct 
acaccctgga 
tcagaccccc 
aaaccatagg 
ctgtcagtca 
aacgacccgc 
acggaatcgc 
acgtcgtgac 
acgccctaac 
agcgactcat 
ccgaacggct 
cgcaactctt 
ctatagactt 
cagccatagc 
aacaaaacga 
tcgcgcagcg 
tgaagcaggc 
acgcgtggaa 
gcgaaatatc 



100920 
10 0 98 0 
101040 
101100 
101160 
101220 
101280 
101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101820 
101880 
101940 
102000 
102060 
102120 
102180 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960 
103020 
103080 
103140 
103200 
103260 
103320 
103380 
103440 
103500 
103560 
103620 
103680 
103740 
103800 
103860 
103920 
103980 
104040 
104100 
104160 
104220 
104280 
104340 
104400 
104460 
104520 
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atcggagctt gaacgcatcg gcacacaggc 
aacgctctcc gatcaatgcc gggaagccgc 
aggcttctcg gacataggca caaaactcag 
acagtttctg gagcattttg aaacaaccca 
ccaaaacata accgaaaacg ttccggcgcg 
caatcacctt cacgtgcgcg gcagcgcgcc 
gaccgtcgat ccggaaaaac caactcacgt 
ccagatcacg tactcaaacg tcctagaggc 
cccggttccg acggcccccg gtctggaaat 
cgcaacgtgg atggacagac agtggcccga 
cacgtacgaa cacaccaccg cccacgcaaa 
gatgtgcgtt tttacgcaaa tcatcagggg 
accgagcacc gccgtcgatt ttccagaaga gatcgttcta 
actggtcacc gccatgtggc ccaccctggc ggccgcaatc 
ggaagcccta ggactaatga gtcgtttcct 
aacgctagag gcccaggtca aaaacggccc acataacacc 
tgccaaaaca gaggcaattc cgtatttccc ggcgcaatgg 
gagcctgtgg ggacagacgg actttttgca 
ggtggcggcc gtcacctggg cgctcacgac 
gtccacattt aaacccatga cagccgcgtc 
cctacacctg accacctttg gcccgcgcgg 



ggttggcacc atcaccgtcc gcgatttagg 104580 
aaatttcctc agacaggcca gtctacccga 104640 
cgagcttcag gcgtacatta agtacaaaaa 104700 
gcctaacgtc tttcaacgct tcccgctatc 104760 
cccggcgatg gactcggtgg ccagactgac 104820 
ccactttaca acgtggatag aaacgctacc 104880 
cccggcgcac ggaggagccc ctctgcaccg 104940 
gttgttttca ttatgttcca ccacgctaac 105000 
cgcaaccagg gcacgccgcg gggcagaggc 105060 
catcgctcag acgctccaag acgttctcga 105120 
ccgggacgcg gcatttaaca cattcttggc 105180 
cgctagcaga gccgtgacgc tcccgaagtt 105240 
acacccaggg aatgcacaac 105300 
ttacgattaa aatcctactc 105360 
cccgctaatg ttccaggcgc tgccgcacct 105420 
ccgcctcagt tgagatgctt 105480 
cagtcagcga acctagagca 105540 
aacaatcaac gcaaggccag 105600 
gtggttttgg accaactgtg 105660 
tacgtcgacc tagtcgagac 105720 
aggcgagaaa cgaccaccga 105780 



aatctgcgat 
gatagacggc 
agacgacacg 
tccaacgccg 



gcacccgccg tacgagtacg gacagcccac gggctactgc atctcgggtc aatcgacgac 105840 



gccggtccag gcttcaaaca caccggtatc 
gtttcacgta ccgatcagaa tatttttggc 
cggcggcatg gggctcctca cacccatcct 
aagcctgtat aacgcacccc gcaaacccgt 
cccgcacgac gagcgacagg tctttctgag 
accacacgaa gccgcccggt cgtcgactcc 
cctcgtaacg gcaacgtact ccagtggcgg 
cgtgatgccc ggaccatacc ccccagactg 
gtccgtgacg cacctcagcc acgacgagat 
acacgggacc gtgcaaggca gggatatctt 
acaaaccgcc aatcctccgg catgggaaac 
cgccaaaaaa cctcatataa ttcctgcgtc 
aaccacgacc caccattcac 
gcccggtcaa attacttcac 
atcttcaccg gaaaaattca 
aacggcccat ataaaccagc 
gacgtataaa ccccgagtgc 
tcgccaacca gaggcaacca 
ggtctttata gggaccgcgt 
acccgacgca catcagtcga 
caccccacac ctcgaagagg 
cataaacgtc ccctctagcc 
tcaaacagca actcccagcg 
gctccccaga acggacggcc 
cgaggaggta aaacgatcgc 
ctccaggtgg accgccccgc 
aattctcaaa aatataacat 
cactaacatc tggatacccc 
caagaatgtg ttaatgcgat 
cgctctatcc gaggctattg 
ttgctttttt tattttgaga 
aacgaagtgg accctatgcc 
gggcccatcg agcctatcaa 
ccgagggtgg aggccaccgg 
cccgagatac tagaactagc 
agaattccat tcttcttttg 



agataattac gtttcgcgat 
ttcatctcgg caaccagcgg 
ggctccttaa cccgcaagga 



aagggcaagc 
acgcgtcacg 
acacgcaaac 
cagcctccgg 
tcaccgaacc 
ccacggtccc 
ccaaactctc 
agatagatcg 
ggccacggga 
aaggtcaaaa 
ccccccaacc 
ggttgcttca 
caaaaacaca 
tcggtccaac 
cactcaccgt 
tatcccacgt 
ttattcaaaa 
ccagaataaa 
cgagcgtctt 
agaagacgcc 
tgaatggggc 
ggcaacgctc 
gtcctttttg 
catatacatg 
ggcatactgg 
gtgatttggg 
cgacatctct 



cgctttcgag 
ggccacgccc 
ggaatgcgtc 
gcccattgaa 
acaggcacag 
gccgcttctg 
gcctgcaaat 
gccaaaaacg 
atgtaacctc 
cgcagcggct 
ggataaccga 
tcctaaagcg 
gtcgcaacac 
taacacaccg 
ggtgcctcga 



gcggtgctcg gagcaatggt 
aagcgccttg gccaggcgcg 
cccgacgtcg agcccttcaa 



acgctacccg catccctcca 
tggctatcct accgattcac 
gtggtcatag accctgaaaa 
ttcgagagca ggccgtttta 
ctgtcggtaa catcaaacac 
tttactacgc tatcccgaga 
ccgacaaacg tcacaccgga 
ttaataacgc aaacagaaac 
cggacagatc caccggtgga 
gcaaacagca acgtaaacca 
tcaaccgcac ctcaggcctc 
ctaatatctc aaacgtcgga 
ccaggtcacc gaaccaaagg gaatctttgg 
cgccaaaccc gcaaacgccg gcgtagcctc 
attaatccac ccaccgctag 
gaagagagcc acggcgccac 
gccgagagtc ggcctcgccg 
aacactccaa ccagcgcaca 
gggcgcgaaa atcccggcct 
aacccgcgca ttcaatacac 
gaggtggaat cgaccccaac 
cacgggcccg aaccggacga 



caagttaccg 
gccagccgtc 
ggaaaaatac 
gcctcacgtc 
aacagtacac 
aaccgcatca 
cgacgaatcg 
agatgtgtct 
catagagatt 
ccccactccc 
caacatccaa 
cgtacgaaga 
gtttttatat 
gtgtccagag 
acgctcccgg 
ccgctggaga 

ggggcggccg 

ccgcgaaggc 
tcgtaatgat 
gcgggagaca 
taatcgtgct 
tttaact tct 



catcgactgg aacaccccca 
agagtcaccc caatccctcc 



cacgaagaaa 
aaacttcaag 
ctgtaacgcg 
tagttgcgtc 
gtccgccgcc 
tcgtagtaaa 
agccggaccc 
cccggaggtg 
gtttggccgt 
tgtcaccctg 
caaggcgtcc 
attacacttt 



tcacacgagc 
cgtcgtctga 
cccatctcac 
gctaggtgat 
accgggcgcg 
gctatttgac 
ggacaaaacg 
gtctttgcgc 
taaaaacacc 
gggaaggt tg 
ctgaacgat t 
ccacaggaca 



105900 

105960 

106020 

106080 

106140 

106200 

106260 

106320 

106380 

106440 

106500 

106560 

106620 

106680 

106740 

106800 

106860 

106920 

106980 

107040 

107100 

107160 

107220 

107280 

107340 

107400 

107460 

107520 

107580 

107640 

107700 

107760 

107820 

107880 

107940 

108000 

108060 

108120 

108180 



gggacgatat agacgaggtc aaataaaaca gctcggccac acgcaaatgc tttaataggc 108240 
cggtcgcgca gtccggcgac gccaaacagg gcacgacgac gctaaccagg gagttcgcgt 108300 
ctcgtatggc atgtgccgcg ttgttggcga gcacgcaccg taaataggga tctccaacac .108360 
acgtgatctc gaatagagat ataacccgca tgtgcgatcc gccacaataa gagcaatata 108420 
cgcgcccggt ggtagcacag atcgaaagct gcttctcttt ttggtcgcga ctgaaaaaca 108480 
cgttggtggg tgggaaattt acggtttcaa atttaccccg tccgaaattc aaacagtaac 108540 
cgcactcgag gcacaccacc accctcggag ctggcacggt cttctccagt acgctcctgg 108600 
ccaccacctg ggaccaaaca ggtagagaga tacacggaaa cagtacgtta tacgccaata 108660 
ctttttgacc caggtcgcgg gatatctccg tctcggtcga ctcccctatg ggcaacacaa 108720 
cacgggacat gctcagcagg gccctaaacg tcaggctcct cagaagggcg ttaaacgggt 108780 
tgccgcacgg gacggtcggc gccagttctc gcagcgaggc cagaagtccc gcgtccgaag 108840 
ggcccgggac actctcattc aggttagctc ccagacgtct ggaaatggac ggaacgttca 108900 
actgcatcga gacacaaccg cccccgttcc atttcttccg caaaacgggc agatccaacg 108960 
cgtgctgtgg caaacaggta accaggggaa accgctggcg acagttaagg gttttgcaca 109020 
cgagacaaca cgccctctga aacgacacaa caagatacct ggaccacgcg ctcctgggaa 109080 
cggccgttaa acttaaacct tcgtcgctgg gacagcccac gccggtgcag atacacctca 109140 
gcacccacgc gtacgcctct aacaacgacc ggccgatatc gtgcagccgc gatctcacgt 109200 
cgccgttttt aggtgggtta tccggcctcc agccggtagc aatctcgttc agggcggtct 109260 
gaaaggatgg ggcagaaatt aacgtgcaag cccattttgg gggtcgtccg tcccaggcac 109320 
cgagcccgta cgtcacaaaa cacacgtagc attcggggca tagtccgatt gagcgtatag 109380 
aggccgcgag atctaagccc agccgagaac catccagcca acgatgggca ggataagaac 10 9440 
cgcgcccatg acaggcagct tcgtcttcag ccaggttagg caagcggccc gaggccatcc 109500 
cccaattgta ccgattgaat tggttaattg gtcatcgccg cacgctctcc gcgcccaata 109560 
tccttcaact ccgaccccga agggcgggcc acggagcccg ttcgcctcaa cgcaccgagg 109620 
ccgtcggagt gacccgtcag ggcaaaaacc gttctaagaa gggtttttaa ccgtttagcg 109680 
ctctttggag tcacgaccaa aaactgtaaa acctgtcggt gctccgtaaa gtaggtgcgg 109740 
catatgacca tggagctgta aacgtttagg tctccggaga aaaccagacg tgccttaaat 109800 
ttcataaaat cgtcctggcc cagggacacg gacgagttcc tctcaagata cacgtccgaa 109860 
tttataggca ggttttttcc aaactgggca tcggcgtcac gtggcttaca caaaaaacat 109920 
ttcagcgtgg tggccaaacc gttgttgata attacaaaac acggggcaaa cgggtaggcc 109980 
agtctctcta gtttgtggag ccaaaactta tacacaaact cgagatgata gacgcagccg 110040 
tgctgcaggc gcacggtgca cacggggacc gccccgcctt tagcgtatac gggagccccg 110100 
tcctgacacc tctccaagtc cagggagatt ccagagggtc ccaggtaaga gacaactaaa 110160 
tcgcacagct cgtcaactaa acgttttccg gaactcatcg ttataaagat cctttaggtg 110220 
ctgtgcgtgg ctcccgtaaa aaccgcgtcc gtgctaacga ttttgtgaat gacctgtttt 110280 
acggcgttta ccttggcgtc caggaccatg cagtgctcac agtgagctga ccgcgtctga 110340 
gcacgatgac agaggaaagt ttttaaatac tgacagtagt taatggcgtt gagcctggaa 110400 
tatatggtgg gaaacataat tttcatgtca tcgggcagca gggactcgaa cgccaataaa 110460 
tcgtcaccga acatcacgtg agacagaggt aaaagatgct caccgccggt accgcgtaac 110520 
acgcgaccag ccaccccttc aaatatttta gccttaaaaa gcgggccccc taaagtcgtc 110580 
caactcagct taaaaactcc tacccattat gttcgtcccg tggcaactgg agaccctcat 110640 
gagacactgg ccctctctgc ggggactcgt agaacaatcc ttcctccccg gtaccccgga 110700 
cggagctttt aacagcccgg tattaatcca cactcaggac tctctacaac ccgcctcatc 110760 
gtgcagggtg tgtagcctcc tgttcactct ggtccggaca ttcccacccc ccgactcttt 110820 
cttcgaagac tacggctggt tgtgcctcac ctgcctatac gccccccgat catggacggc 110880 
taccctcatg gtggctgccg accttttgga actaacgcac gtgtacttcc cgcaatgcgt 110940 
gaaagatggg ccagtataca ccgcccaaag catcctcgga atcgacgtcc agctgcactt 111000 
cttcgcaacc cgctgcttcc gacccatcga cagagaacaa atactccaca catctcattt 111060 
aaatttttta caaaccgagt ttattagggg catgttagaa ggcacgattc cgggatcgtt 111120 
ctgttttaaa acgtcctggc cgcgcacaga aaaggacgac caacaaccta ccgttgcgtg 111180 
ttgttccgtt ggccgcggaa gtcacaccaa ccgggataac cgcctacccg aggacctgga 111240 
agaggcgttc aactccacga acgccgagga aaagcccagc ctcctcggcg tcttttcggc 111300 
aacgtgggca gaatcccagc ttcttggctc cgacacacaa caggcagata cccatttaca 111360 
accctccgcc ttcccaaccc cagaagatgc tgaccaatca cagggcccct gcctgatgca 111420 
cccaacgctc aacctaaaaa caaaaaacca caccgcatcc atatgcgttc tatgcgagtg 111480 
tctggccgca cacccggacg ccggtccggt tctgaaagat ctgcgtcgcg acattctgga 111540 
aaacatggaa aacaacgtta agctcgtcaa tcgcatatcg tacatcctaa acgatccgga 111600 
ctcactgtca cacgtgcgcg acgaacatct gcgcggccta attaaacggt gctcggcaca 111660 
agaaatccac aagcattttt tttgcgaccc ggtgtgcgtc ctgaacacgt actcgcactg 111720 
tcccgcggtt ttatttaaat gcccacctcc cgaaaagtat aagaagctca aagctcgtct 111780 
ggcaaccgga gagttcctag actgcaacag aatatttgac tgcgagacct tacagaccct 111840 
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ggccgtcctc tttaaggggt ctcaactggc caaaatcggc aaaaccacgt cgctcgagat 111900 
aatccgtgaa ctcggatttc aactgcgtcg acacaacatt caaatcaccc acccgtttca 111960 
aacctccaac ctatacattt aatcttcaga agcgcaccag acaatgccaa aacagcccag 112020 
aagtcgattg gcgtctcgag cgccgtacgc .acctagcgtc ^ggcgaccgg acgggcccca 112080 
gtccacgcga ccggcatcca ggcacggcag ctgcaaaagc gaaatcatgc agtggaaaaa 112140 
gttagtttca gacacgcagt ttttttctgc cctaacgcgc cgccacgagc tgggggtgga 112200 
ctttttaaga gaaatgggga ccccgatatg cacctcaaag tccgttatgt tgccgttaaa 112260 
cctaaaaacc atcgccccgg gtcggtgcgt ctctctctca tcattcggac actcgtcaaa 112320 
catggggttc aactgttcgt cgtgcacgcc aactgacagg tcagcggtgt ctctggacgc 112380 
aaacgcgctc ggcgaagatt ccgccaggaa aaacagcgag ctgtgttcag tggcgttaac 112440 
cttttaccac cacgccgaaa aggtcgtgca gcacaagggc ttttacctgt ctctgctcag 112500 
ccactccatg gaagtcgtta ggaaaagctt cacgcaaccc gggttgctct acgcccacct 112560 
agtcctaaaa acctttggcc acgatccttt acctattttt acagtcgatg ccgatgagag 112620 
actcgcactc tgggcggtgt tccacactag agacctacac ctgggggaaa ccagtctgcg 112680 
actcattatg gacaaccttc caaattatga cataacggtg gactgcatca agcaaacgta 112740 
cataatgaag tttacaccct cgcgaccgga caacgcaacc gtgacggttc ctgtcaacag 112800 
catttgcgag gccgtggcca ccctagactg caccgacgag tttcgagaag aaattcaaag 112860 
gggcacggcc atcataaact cccaggggct attgtaacct ttccctagac ggaaacagat 112920 
gtaattcacc tattcaaatg ttaagttttt gttttgaaca tatcaccaat aaaaacaact 112980 
tatgtttaca caaattgaag agcgtttcaa ttttaccata aacatagcaa aaaccacggt 113040 
aactaaaact caatagcgat attcggatgg gatccctaga gagcaatcga cgtgcgccgg 113100 
tattaattga cgggaacgca cgttaactgc tctccacgaa tcgcaaactc cgcgttttta 113160 
ggaccattcg acgccgttaa taattggcag taaactgcga aagggcgtac atcgcagtag 113220 
tgattttacg atgtgcacac ctttccgcgg ttcaccgcca acaacaaaag cgggtcacca 113280 
ggcacctaaa tcgcccgggc cttttatgcc aaagtacaaa agggaccggc gttctacgtt 113340 
cacgaacaac actgttgatc gcaaatacaa tagtgttgtt ggtgaccgca aactgccaaa 113400 
tcctaaaatg catggacata gtcctcaaaa tctagaacga acttccaaaa acatgcggca 113460 
ataaacaacc ccttgcagtt aatgcacatc gtaacgagac ggaatccaaa gaacacgccc 113520 
gatcttaaga ccacgggcac gtgtctttgg gttccggggg cgttacgatg gaagttcatc 113580 
accaagtatc ccatcaacct gttacaaaag cgtaaagata atcggtttac aataaacaat 113640 
aaaaaatagt ggacgaagtt acattccgac atacaagcgg acccaaaaaa acaccccaac 113700 
gcaaaacaac tggagactgt gtctttgggt cccttccacg tcgcaatgag atttctggtg 113760 
accgtgtgct gcttaacgcg ttacagttgt gttttatggc tacgtaaaca caccaatcta 113820 
gaatgctaac acgtgcaccg cgcttaggcg cgtttgtgca actaactgct gtcaggttac 113880 
tactcttatt tttaacaagt agcataccaa ctgcaattaa ttgcactaaa cccagtttag 113940 
tctttttagt cgagtcggcc catagtcttc atggagaacc atcattcatc tccgtcaacg 114000 
gcgtcggtgt taccgttgtt attgcgggca aaacaggctc gggggaaaaa agagcaacat 114 060 
atcgtcagtt taaaacaact caagttctaa aatctcaccc actcgtaaac aaaaaacatc 114120 
gcagttaagg tgtatgtacc gaacataaca acacaagttt tttaaaacac agctgcggta 114180 
agtaaccccc attgccacgc gtgtcgcgtg ctaagtgttt ttaaaattac attgtgcgtt 114240 
ttacacaccg agcagtaatc tcaggagggc ggttaacgag cgatatacat attccctaaa 114300 
cacgggaacg cgcgctgacc gccctcccca aatcacaaca cgggactaca aagcctagtg 114360 
ttaatataat caaattaaaa aaccacagaa acctttagtc gtgcgcaaac actagcaaag 114420 
gtacctagag ctttccccta tacttcaaaa aacagcggtg ggttatttga ccacacgtta 114480 
agtaaacacc cgtaagaatt attcccgttt atcaaaatgg aaaaattaag gctttgcgtt 114540 
aaaatctgct aacgcaaagg gcacttaatt tttccagttt ggactcggaa ctttaccgct 114600 
aacgttaaaa tttaataatg caaaaggcac ataatttttt ttgtgtaacc tcaaaactct 114660 
acagctaaag ttaataacgg cgactttggg cagcgaatca gtgtctgtcg caaaccctgt 114720 
taaatttaaa cacaacaggg agcgcgacta aacacggact aactgctcat cgggccctgt 114780 
tgaagaagtt ggtgttgcaa tgcattaggg aatctaaaaa cgaaatcctg ctcggtaaca 114840 
cggaaattgt ctttagttcc ctacagcatc acaacaattg taaaccataa acgtacgcgc 114900 
gtggggttgg tgtggtgtga aatgcttggt gcaactgtta catgggcgga ttgtaaatgt 114960 
ggtgcttggt gcaacggtgg tggggtgcaa gtcccccggt ggggtgcaag tcccccggtg 115020 
gggtgcaggt cccccggtgg ggtgcaggtc ccccggtggg gtgcaagtcc cccggtgggg 115080 
tgcaagtccc ccggtggggt gcaagtcccc cggtggggtg caagtccccc ggtggggtgc 115140 
aagtcccccg gtggggtgca agtcccccgg tggggtgcaa gtcccccggt ggggtgcaag 115200 
tcccccggtg gggtgcacgt cccccggtgg ggtgcaagtc ccccggtggg gtgcaagtcc 115260 
cccggtgggg tgcaagtccc ccggtggggt gcaagtcccc cggtggggtt caagtccccc 115320 
ggtggggtgc aagtcccccg gtggggtgca agtcccccgg tggggtgcag gtcccccggt 115380 
gggagcggct cggctccggg gtggctccgg gtgggggcgg ctcggctccg gggtggctcc 115440 
gggtgggggc ggctcggctc cggggtggct ccgggtgggg gcggctcggc tccggggtgg 115500 
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ctccgggtgg gggcggctcg gctccggggt ggctccgggt gggggcggct cggctccggg 115560 
gtggctccgg gtgggggcgg ctcggctccg gggtggctcc gggtgggggc ggctcggctc 115620 
cggggtggct ccgggtgggg gcggctcggc tccggggtgg ctccgggtgg gggcggctcg 115680 
gctccggggt ggctccgggt gggggcggct cggctccggg gtggctccgg gtgggggcgg 115740 
ctcggctccg gggtggctcc gggtgggggc ggctcggctc cggggtggct ccgggtgggg 115800 
gcggctcggc tccggggtgg ctccgggtgg gggcggctcg gctccggggt ggctccgggt 115860 
gggggcggct cggctccggg gtggctccgg gtgggggcgg ctcggctccg gggtggctcc 115920 
gggtgggggc ggctcggctc cggggtggct ccgggtgggg gcggctcggc tccggggtgg 115980 
ctccgggtgg gggcggctcg gctccggggt ggctccgggt gggggcggct cggctccggg 116040 
gtggctccgg gtgggggcgg ctcggctccg gggtggctcc gggtgggggc ggctcggctc 116100 
cggggtggct ccgggtgggg gcggctcggc tccggggtgg ctccgggtgg gggcggctcg 116160 
gctccggggt ggctccgggt gggggcggct cggctccggg gtggctccgg gtgggggcgg 116220 
ctcggctccg gggtggctcc gggtgggggc ggctcggctc cggggtggct ccgggtgggg 116280 
gcggctcggc tccggggtgg ctccgggtgg gggcggctcg gctccggggt ggctccgggt 116340 
gggggcggct cggctccggg gtggctccgg gtgggggcgg ctcggctccg gggtggctcc 116400 
gggtgggggc ggcctaaaat ccttaccggt aaatttagca gtaaatccaa cgcagtaaat 116460 
ccgcaagcta gccgcacaga ggtgcgactg cctgccaagg ctcctggcgc ctcttttata 116520 
gcgctaaatg ccctccccaa atggttacta tggtttagtt atgcaaccat aataccaata 116580 
aatgaatcac caagaatacg gccaacgcgc cagcgtccaa ccgccctcca accgtgcagc 116640 
gccgtatatt cgaattgcct tccgcacgaa tacggggcgc agcatggtcg cgggaggctg 116700 
gcattgcgca atattcccag cattccaacg ggcacgcaag ggtccaggct accggggatt 116760 
ccccaaacat catccagcgc atagaggttg tggctgagct gcctatctgg ggattcccca 116820 
aacccgagac ccactgagat gctattcagt gggactgcag gctgggtcct cccaggacag 116880 
gtgcctgcgt gtgtggtaag ggagttcccc taattaaaat attatattaa taaataaacc 116940 
aggctaagct gtaattaatg aggcacaggc tgccgaccta ggtacctggg gaacccccta 117000 
acgggatgta attaattcaa acctatataa attccaccct gttaggggga tccccacgtg 117060 
tacctgaatt aatgcgggat acaggctgta ggtttggtaa tgggtaccag gctggagtga 117120 
ctcataccat gggattaaat accagggata caagctgtag gttcggtaac gggtaccagg 117180 
ctggagtgac tcataccatg ggattaaata ccagggatac aggctgtagg ttcggtaatg 11724 0 
ggtaccaggc tggagtgact cataccatgg gattaaatac cagggataca ggctgtaggt 117300 
tcggtaatgg gtaccaggct ggagtgactc ataccatggg attaaatacc agggatacag 117360 
gctgtaggtt cggtaattta aacctatata aattttaccc tgttggggga atccccgttt 117420 
gtacctgaat taatatagga atacaggctg tagaatcggt aatggaaacc aggctgtagg 117480 
actaggctgg agtgactcat accatggaat tcaattaaca aggaaaatat aatagaatat 117540 
atatatatat agggaacttg taaacaaaac ccaactcgcg gattggctgc ctgtttgggc 117600 
caaccagcag cgagaattcc ctgattgaca ggcggactgg ccaatggttt gcgagcattt 117660 
ttgattgaca gaacggccgg ccaatcgcaa ccgagaactc ggcagcgaag caaaagacag 11772 0 
acggccgcgg cgaccaatgg ccgccgcggg ttagtttgat tgacggcttg gccggccaat 11778 0 
gggaaacgtc cctgccggcg gcccctaatc cccctggctt aagggattac ggccgtaaac 117840 
atcgctggcg cggtgccacc gccgccgaac ccccgcgccc ggttcggcgt gacctcgccg 117900 
cgaccccgcc gccctcatgc gcaccatggg tgcgatgggc accggcggca acattattcg 117960 
ctgaccgccg gacgaccccg gcgccaaaga gcggccgatg ggtacgcgcg gcgaccatcg 118020 
aaacattttt caataacccc agtctgaccc cgcggcccaa atctggggcc catggtggcg 118080 
cgcggccccc accgtcccca tttcccacgg tttgctcatg ggcaccgacg gtgcccgtgg 118140 
tgatatgttt cagtgaccct cggatgaccc gggcgacgtt ttgcgcacca tggtcccgaa 118200 
ttcccccatg agtacatgtt tcggtgaccc gccggcgacc ccgggcggca aaagccgccc 118260 
catgggcacc ggacggcacc atgtgcacat ctttcagtga cccgcgggcg accccggcgg 118320 
cggcaccgcg ggcgctgggg gtgatatgtt ctgatgaccg ccgccttacg tttgcagcgg 118380 
tgtcagcgga acatattctc cccggctccc gagcccgggg cccgcggacc cgagttgggt 118440 
tattaaagtt acgttaggga ttaattttaa tccactaccc cccttaatgt gcggctcggc 118500 
ctgcggcgac gctgacgccg gcgggttcga cccattgacc gcatcctcgg tggccgggca 118560 
gaggccggcc ggccaaaggt gcgatcgcgg ggtctgaccc acccgtggga gaaacccctc 118620 
cgttccgttg tcttctgcct gcggtccctg gccccctggc ccaccgggga tgtcttgtga 118680 
gcactttccc cggtgagaat agacaggaga gtgcccgcaa gacatcgacg gggttgccgg 118740 
tcggtgaacg ggaggcgcgc cgcttgcgtg cgtgaggtgt cgccggtggg ccgtatgacc 118800 
cccgcaacgc gaggggccgc tggcgcggaa atttccccag aaccggccgc gccgcttggc 118860 
ggcgcggctt cctcccccaa cgacccctaa tttggttttt aagttgttat ttaaagctcc 118920 
gctgtgtttg tggcacgcgc ttaagcgttg ccacctgttg cggagatccc ccgtgcgcat 118980 
cgccgccctc ctcgcctcgg tgcacgctgc cgagcgtgcc acctgtttgg ttgtgtcatc 119040 
ccatgtgcgc atccccccat cttatttccc cgaccaggca acataaacgt cacgtcgttt 119100 
gtaagaaata attgcttttt ttattttcgt taaccccgca accgcgatac agtctctcgc 119160 
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ctttcacccg ttgttttgtt accagcccct cttggcggcc ggcaccggcg ttaaccgggt 119220 
gcgttggcgg ccacgacctt ggccacgtcg cgcctggaga cggcctgcag caggtcactt 119280 
aaaaccatgt agtttgaggg gctgaccagg gccgcctttt ccatttgcga gagccaccga 11934 0 
aaaaaggtgg gtgactggtt gcgcctgcga ccaagctggt cgcccgttag aaaaactaaa 119400 
tttttcacgt ccctttctga gagctggcgg tcaatggaga gcatcaggtg tttgtatgga 119460 
ctaaagtaac tggggctgga ccggagcctc tcgataataa atcccacgtc tagtaaaaaa 119520 
aaggtggaga ccaggtccaa tcgcccaacg agaaacacgg cctccagcaa caccgggagg 119580 
ggaataccgg gttctccagt tgaggggcac agcccgtttg caaacccctc gggcgtatcg 119640 
tcggacgccg gccggtcaaa aagccacagc acggcctctc tgtcgtctgc ctctaggtgg 119700 
cgaccgaagt ccactaaccg cttatgcggg aacatgggtc cggtgcggcg ctcgccgaca 119760 
ccctcttcta cctacactag agcgtttgat aaatcacgac gaaccgcggg cgttacttaa 119820 
aatgtggtct tcgtcaaatt cccgcaaggt ggtggcaacc gactcggcgg cggccgcgag 119880 
ggttgcggcc gaaacgccga cggcctccgc caatctggcc ggaggcggcg cagcggcacc 119940 
gaggctaaac agcgcgcagg cggccgccac cagggacggg ggcagaccgc cggtggccgg 12 0000 
gttcacgatg gctttgtgaa cagactccac cacctggctg tgaagggcaa agagctgctc 120060 
ttttgtaaag ccgcttttaa acagggtggg cccgatggcg tccgttggta agacggcctc 120120 
tagtttccac cttaaagcct taaggattga tttttcctgt ttcagtaaat ccgccacgga 120180 
aaaatcctcc gccgcgcaaa agcataaaaa ggcggccttg aaaggattca ggtctctaat 120240 
cttgccggct aaaaatagac aggcggcccc cagcctctga aaccgccgac gggggatgct 120300 
gcgacactct agatagcgat ccagaatact aacggccagt gggaaaacgc tagcatccgc 120360 
ctggtgggcc ctggcaacag accgcatcca cgtccccaag ataactcgca tgcccaccgt 12 0420 
cacctccgtt tgtatcgtcc catagcaggc cgtggatgtt acaaaggacg cttcgtgggc 120480 
caggaggtta gagagggccc ggtcctggta caggacgggg tcaatggtcc cggtgggcac 12 054 0 
tgggccaaca gaagccattg tcaaggcaag gtaaagaaag cgatataagt cctacctgcg 12 0600 
gctcacggag gcttaatacg aacgtccggc ttaacacaca atcttctttg cgctctatgt 102660 
ccgctccaca ctggtcccat gtcttctgca gttcagccct gcctgttccc ggaaagcggc 120720 
ggactagatg caacgcggaa gcggctgttc atttaaggtg agtcacaggc ggggcaattt 12 0780 
gccaaccagt gacgcgtttc ctgcagtctg ggttttcacc tggcaagcgc cactggttct 120840 
tggcagccgg ccggtgggga tttttttagt gctgaattgg cagtcctctg tccatgcgca 120900 
ctatgctggc ctggaagtgg ggggcggatg ggtgatgtcg ttcccaccgg cgacgggcct 12 0960 
ttttgacggc cttggcggtt ccgcggcttt gacagaacac cgtaatgcaa aaagccggga 12102 0 
tgacgggact gccaggtgcg tccggcaaaa cgcggacgct ggatgctcta atgccgggaa 121080 
actgtaggca ccttgccagg cgatacagtg aataggggtc tgagttacaa aagatgactc 121140 
cgtgacacca cgggcccggc cgaggggggt cgatcctggg tcccaatacc tgacgataaa 121200 
atcttctctg tgggcggtat tttctgggta cctcttcctc tgatgaggga tacggtatgg 121260 
gaatggggta tcgcggcggg cgcggggggc gcggggggcg cgccggttgc tgctgccgct 121320 
cctccccctc ttgctcctgt tgtctttcag gagcttcgtc ctcgtgggag ttgtgactcg 121380 
tgcatctgag acgtaaacaa ggaactcctc cggcacgcgt gggtggaccc caacccccta 121440 
cggtgtacga gtgggatgta tggccaggaa acgggggtcc tggccgagac ggcccggtgg 121500 
ggtggtctgt gtggctcacc gcctgttgcg tgttagggga tggggcctgt tgaggcgttg 121560 
gcgattgggg tggtcccggt gagtcgggcg gggattgcgg agggggcgag gacgtcggcg 121620 
ttggcgattg tggcggtcca ggtgagtcgg gagggggcaa ggacgtcgga ggctcaggcg 121680 
gttcaggtga gtcgtggtgc gatggttgca aaaacgacga aggctgctga ttgggcgatg 121740 
gattatgcga cgacggcggt gcaggtgagt gggggccggg gtgtgacggg ggtgatggtg 121800 
ggggcgaatg cgatgcgggg gatgccgggg acgaccctga cgtgttttga ggtgacaggc 121860 
ctgaagctaa caccggcgca gacccgcttg gcgacggcgt tggagatccc aacacgatga 121920 
cggactcaga ccctgaccct gatggcgacc cttgggatga cccgtgcctg gattcggtgg 121980 
ccgtttcggt gtccgatccg tgcccttgtt ctgcaaccgc ccgcacttgc ggccctgatg 122040 
ttgttggagc ggggggcgcg ggtgcggcgg ctggcgcggc cgtgccacgc cccctgccgc 122100 
gcctcccggc ctgttctcta gtgccgccgc gtcttccgca atgtcccctg caggaacttc 122160 
gcaacccgtg tcccgatact atgccggaac gatgttgccg gctgccccac atgttcg.cgg 122220 
tttactgcgt gtaaaaaacg gcagattaag tagattgccc tttccacgac ccgcagccac 122280 
ctgagcgcgc gcacgcagaa cctatgttta gtgcgattgc gattatgtcc gctaggtggc 122340 
gggctatttt gatcctgccg cgcacgcatt gaccgttagg tggcgcagag cgccgttaac 122400 
cgcagcgagc accgcacggc ggttttagtt tcgttactat aacgtttagt taaaatccgt 122460 
ggccgatatt gggtggcagt gtgtaagtca cgcgcgccgc ggccccctta ccttgtcgcc 122520 
ggcgaaccgc ggaaagtctg ttccgccgcg cagcgcgcgg ccgctgccga aaccaacgcc 122580 
gcttgaattt cgcgggcggc tgggcttggt gtcagtgatt cacacagatg cgatgtgatt 122640 
gggcgtggtg gtcaacgtca cttgttaacc cgtaaatctg gaaattcttg tccgccgcgc 122700 
cggtgagccg cggccctaag ccttatagtg ctgtgcagcg atcctgaaac tcgaaagctg 122760 
gccatataaa ccccagtatg gcgttgcgtg tcggtggcaa cctttttgaa aaagatttat 122820 
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tgccgccggg tgttaagcac caacaccgac cgtgcgtttt caattatgtc gggaggaatt 122880 
acattaacgc tgctgctggc gacgcttgcc acggttcggt gcgctcttca aacgcactat 122940 
gcggcggtcc ccgtgcactc taccgcgtcc ctggggtgcg tgttaacgac accccacgac 123000 
gttcttatcg ttacctggca aaaacaggaa tcgcctagtc ccgttaacgt ggccacatat 123060 
agttccgaag cgggcacggt ggttcagccc ccgttcgccg gtagggttga cattcccgaa 123120 
cacaagttga ccagaacgac cctgaagttt tttaatgcca ccctggagga cgaggggtgc 123180 
tacctgtgta tctttaacgc gtttggagtg ggaaagctgt cgggaaccgc ctgcttgacg '123240 
gtttacgtcc ccctgtccat gtccgtcacg ttttaccccc cgattaaccc gacgcagctc 123300 
gtctgtcggg ccgaggccag tcccgcaccg tcggtcaact ggaccggcgt gccgcccgag 123360 
ctgtgcagcg aacctgaagt gtttccccgg cccaacggaa caaccctggt tgtgggtcgc 123420 
tgcaacgtaa cgtcggtgga ccccgaagac cttgagaacg ccacgtgcct ggtcactcat 123480 
ataggcggtt tggccgcggc gcggcccctg gaccccgtgt tttcggatcc cctggaaggg 123540 
acgagccact acgtggtggg tgtggtggca gcggccgccg ttttaggcat ttttttaacg 123600 
ggtgtttttt tgtataggtc tatgtgagcg cgcgtgtccc ccgtgtctag tgttttgttc 123660 
cccagtgagt gtctccatga caaatacaaa tttgaggctg gctttttagg gtggtttctt 123720 
gtgcgacgct tcctgtgtaa ctgcatacac cggggtgtcg ccaggaaacc gcgtctcccc 123780 
tttatgtccg ctcgccctcc cagagcgaaa gtgagaatgg ttcctggggc gttttggcgt 123840 
tgagagagtc gggcgatgtt gccgtagcgg cgtctgcaaa ggctcacccg cttctgtttt 123900 
ttttcttttt gtcagacaac aacatggacg ccttgaacaa taaccttaac ctgctgatgg 123960 
attttctgtc taactattcg aatagctaca gtagttatga cgacaatatg tcttacacct 124020 
tagacacgga atccacgctg tgtcggctga cggtggtttt cccacctacc gtttatgcga 124080 
ttatatgctt ttttattttt tgcattacgc tgtttgggaa cgcgttggtg ctatatattt 124140 
tttttaaatt taaagcgctc gccaactctg tggatgtact gatggctggg ttgtgttgta 124200 
actccctgtt tctgtgcgcg tcgtttttgt tcagctggct gctgtacgtc gcgccacaga 124260 
tgctcacgtc cgcgacgtgc aaggtggaaa tctttttctt ttacctgtac acgtactttg 124320 
gcgtgtacat tgtggtgtgt atcagcctta tcaggtgcct gttagttgtg ttt'tcccgcc 124380 
gcccgtgggt caagcacggg gcctccggct ttctctgcgt gtgtgtgtct ttaatcgtgg 124440 
cgctggcgct gtctgccaac gcgagcctct ataggacggc cctgcgtcac ccagagacca 124500 
gcgagtggat atgctacgaa gatgccgggg aagataccgt caactggaag ctgagaatca 124560 
gaaccaccag cgcgatctgc gggtttttgg ttccgtttgg gctgatggtg ctcttttacg 124620 
gacttacgtg gtgtatggtt aaaagcacga agctggccag aaagggagcc gttaggggtg 124680 
taattgtgac ggtggtggtg ctgtttttaa ttttttgcct gccctatcac ctgtgcaact 124740 
tttttgacac cctgttgagg accggttttc tggccgaaac gtgctacctc agggacgtga 124800 
tcagcgtggc catgcacata tgctccctgc tacagagcat gtatagcgcg ttcgtgccag 124860 
tcgtgtattc tggtcttggg tctctgttta ggagaagggt tagggatacc tggtccgtgt 124920 
ttaggtgttt ttccacttca ggtagtttat gagacactca cgcgacactt ggttggattg 124980 
tttgtgtaca tttattttca ttttgtgtac atttattttc attaaagcga tctgacctgc 125040 
agaccttacc tgacgtttac tgtctgtttc ttatgcacca gaggaacagg gactggaagg 125100 
ccagcgccca cggggaaact gtcatgtccg caagctcggg cggcacgtac gcccactgcc 125160 
aggggtggaa cgtcacggcg gggtcgacca gggaggccag gtggcgcccg tccggtgacg 12522 0 
tgatggcggc cacggccgag ttggcagacg ggttgcgtgg gtagtgctgc gcgaacatcc 125280 
tcgggtcggc gttgcccgtg tggtagttca gggcgatgcg ctgctgctgg ttgagatggt 125340 
actccatggc gtcgcgcggg tatctcacgc ccaggtaccg gccgttggcc caccctggga 125400 
ggacgaggcc ccgaagaacc ctaaacatga tgctgatggt ggtctggggg atgtggaggt 125460 
ttagccagag gcactcgtgg ttccctgatg cgttctcctc caggtggatg tcccactggt 125520 
cgggggtttt gggtccgggc gtgtcgtgag gggtctctct aagaagaccg agcgccccca 125580 
ggagctggaa cccaaactcc ccgcagcata gtgaaaatgt atccgctcgg cggaagaagg 12564 0 
ccataaggcc cccatagcac ccagggtcgt tgagaagacc catgatcgcg catcgggccc 125700 
ccacgtagct gtcttcgatg cccacggttc caccgatggt cagaccggag aatccccgga 125760 
ggatgttccc tcctctaagg tcgtcggtgg agacggccgc gacgtcgaac ccgacgttgg 125820 
tgaaggcggc catcagcgcc ctgggaagcg gggcaccggg ggtgaccaag gcggccactg 125880 
ctggcggcct cgacggcgtt gcaaacagag tcagttcgct gtttctgcaa acctcggcga 125940 
ggtggcccag gttgtgttgg ttgcacccgt agtcctttct gtagagttcc tgcgcgggcg 126000 
tgaagctggg cccccatgag taccactgtt cgtccgagaa cgaggtccag tttgccgcca 126060 
ccgaagtgag ggtctgtgaa tagacttcat cgttgttgtg tgagatgacg attctttctg 126120 
acagaccctc ctgacccacg gtgccgcaca cggtggcccg gcagtcaaag ttttggcacg 126180 
cctggcgcac ctcgtcgacg tgctggggct ggatttcgaa gatgacgccc ggggtttcgg 126240 
acaccagcca ctgcaggggc gtttcctctg acgggatgcg aatctgtagg cctcggttac 126300 
cggccagggc catctcgata gcggtaacca ccatgcctcc gtcgctgacg tcgtggccgg 126360 
acaccaccag accccgggag agcagggcct ctgttagcat gaagaggttg gccaggtgtg 126420 
tcgcctgaac gtccggtagg gtgggactcg gcaagagaca caggtgctca aaggtcgatc 126480 




cctgggtcag gtgcggcacg gggaaggaca ccagcacgat gaggttaccg gtggccttca 126540 
ggtcaggtgt gacgcgttgc ctggacgact tcacctcggc ggtggccgtg atgacgactg 126600 
cgttaaacgg cacgggggct accgtctgat gctgttgggt ggcgctgatc agttcttccg 126660 
acaggcacgg ggtgctgccg gccgacgtga tggcgaagtt gatgctcagg tccctgcata 126720 
gttccttgca ggcaaacagt gtgtgctgca ggagccaggc ctggtcgtcc tccgggttcc 126780 
aaccgaccga cgccgtcagc gtgatgtcgc ttagacgccg cacgtgcgct agcatgatgt 126840 
tggtgaccgc ctcgcagatg gcgtatctcg cgcccactgc cgcgttgatg cccatcttgt 126900 
atgcctgttc tccataagcc acgcactgtc cgacccaccg gttcctgccc ctgccctggg 126960 
tgacccaggt ttctgggcta accaggcgtt ccgtcgcgtc ctgcctggtg atcggctggg 127020 
ggcgatatat gggcacccgc gccattctgt ccgtaaacac cgacgtgtgg ttgacaatgt 127080 
ggtaatctga cagcggccgg cccaggggtc ccacttcgca ctgctgagcc acgagtccgt 127140 
tggagcatct gtccacgtgt cgcgtgacaa actctttgct gccaaccgtc ggacatcgca 127200 
gaagctgatc gaccacggag gccaggcgga ataggcccca gtcggtcggc gagggtgcct 12 7260 
gttcgccacc atcctcaggt tcttctgggc gcaccagcca gttggaggaa acgggctgct 127320 
cgtcgaaccc aaaagttgcg atgacttcgg ccgatgttgg ctgtctgatc ggccacgctc 127380 
ccgggttcca cagctccagg tcgttcacga actgaattcc aagctcttcg caggtttccc 127440 
caacgacggc gaaagggcat ccgcagatgc gtgcggccct cctgatggca tccaacgggc 127500 
ccggttccgg ctccgtgtcg cgcgggagga cggcaaagac cacgcaggca gacacgttga 127560 
gaaacttgtt aactaggatg tcgttcacgg tttcccgtga gatgtctggc gtggcggtca 127620 
gttggctcag aatggttggt ggaagctgag agacaaagag gcgcaggccg ccccgcgaga 127680 
ccagggctcc caggtggccg atgacagagg cgggttcatg ggcgcgatta atgctggtaa 127740 
ataggggccc ttccaaaatc tcgtaagcca gcgacaacgt ttgggtgatg cgggtaatac 127800 
ctgcggcgct ccccaggtaa agcgagcgct ggttgccatc gccgtcaacg ggttcaaaag 127860 
acccaagaca gatgataggc tgactgaata aagacctcct gaggaatgtg tatgcttgag 127920 
gcacgcacga taacagcgat gtggtggtaa taacgttcgc ctcgctggtg gtggcgctgt 127980 
ggacggtttt aagaaagcca ccgaggacag gcgtgccggt gcgcgccagc tgctggcagt 128040 
ggagtgtcgc tgttgtgtta atagctctgt ggcgtgcgcg agaaaatgcg cccactgacg 128100 
gcgtagtggc gtaaaatcca cacagggccg atctgatttc gcccccgggc agcagcgtgg 128160 
catggaacag aatgttttgc agcgcacctg cacctgtctg aatggcgctg tgcaggtgtg 128220 
taaatgagtc agctatgtgg gttacgtttt cgcacgggag gcgctggaaa acgcagctgt 128280 
agctcctgcg ggccatgacg gcgtatctgg acaccgtata gctgttgccg ttggaggcct 128340 
gcacggccag cggtaatatg ttccgttgta acgggagcat caccgcggcg catattgggt 128400 
ctctctcggg caacggtctc caggcggtgg attgcaatgt caccgggagc tggcgctgca 128460 
cgcgcacatc atattctact agtgtgccaa gtgtcagggc ggccaaggtg cccgggggta 128520 
cccgatcggc gacaaaccgt ctcgtctggc agctcagatg tctgcccacc tctaaccgca 128580 
acgttctatc gaggccgctg tctctaaggc actgcactat ttcgctagat cttgtggtag 128640 
gttggcgcat caggtctggg ccatagggca gggatatagt atgatggccc tggaggtgcg 128700 
ccgaccgagt ttccagaggt gcataattaa ccagaggcgc cagaagatgc cagataacgc 128760 
ccagcgctgg gtcgtcatcg tcgttgtcaa agttatcaga ggggcgtgtt acaagccaaa 128820 
gtaaaaattc actaaagcac tgttctggca caagcgccaa aacactttct gcgtccgaat 128880 
tgtcatgaat aaaagcctcc tcttctggac taagggctgc agcggcccag cgagggttcg 128940 
tgcgttgggc catatttgct aagtctagcc ggcgcaaaag caagcgggtg ggactttaat 12 9000 
agggcgtata ggggttgtta ggtatgcggt ggggagccgt ccatgttaca tggacttcaa 129060 
tattaattta tgttagtttt gggcaccttg ggatgttaca cttataactt cccccatgtc 129120 
acagtaatca cttgacagat ccccttctaa ctcataacaa ctgtttcggg gtgttgaggt 129180 
tacaggtaag ggtgggaggg gaatgttagg tggcgaacca aagcataacg tcgcggggcg 12924 0 
ggagtgggag cctgtaaccc cagagatgtc acttacgctt gttaggcgat cgacaactac 129300 
ctccgaaagt gtgtcctgtt ggatgttttg tggattaggt aaaacagaaa gttgtggaga 129360 
aagtaacctt atagatgacc tcctgctggc gcgtggctgg gtgacagctg gttgttgatg 129420 
aatgaacact tcctcgggcg ttcctggggt ggtgggtgtg gtctcactaa ggacggccaa 129480 
ctcaacacta acttcctccc gttcagaccg tgagcggttt cgtctgtagc gtcttgtgga 129540 
gggggttgcc tgaacccttg ttggtgtaaa caatccaggt aaccggtaaa tgcacgcggt 129600 
cagggcgatt tgtaacaaac ctaaaaacgc ataataagct tacaaacgca tttttggtaa 129660 
cagtttgtat aatttagccc caaagtttat acaaagcgaa actacttacc gattagtaaa 129720 
atgcaaaccg caagaaggat aacactcact gattctgttc caattggttc gaatttaaga 129780 
atggagagta aatttgcccc gctgagaatg cctggaaata agaggcgcgt attttacatg 129840 
tggtacaggg tagaaaacct tgtaggtttc agttgagtca ccctgccacg taaacccgcc 129900 
tagagtgctg cacgcggtgt tggttgcagg aagtacatcc taattttaac tctaaaaggg 129960 
gttccttgta aaattattta agttatggtg tacttacata atatccaatg agtagttaca 130020 
ctccaaaaag atactcgata aaatgcgtga tatgtgaagt ttgcaacaaa tgcagcggtg 130080 
agtaggccca agaaaatgat aaaccctata tgaggaaatt ttattagtac gtttgtgtaa 130140 
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ctgcaaaata aaattttaaa taaatacttt atattcatgt taccaagttt aatttgtggt 130200 
ggttgttctt gaattatatg tactcctagt tctgaggcaa ctgtatgtag tgcaagcacg 130260 
ctgccttgca acataaccaa taggcatgtt ccatgattcg atattcgcca tgccatttta 130320 
agtgccacgc agattgcagc aatagctata aaaaaacaaa agtatgttaa tagtctataa 130380 
agtataaaac attaaaacaa aattaaaaaa ttaaatacct gtatttgccg taactataga 130440 
gtatgataat agtaaggtgt aagactgtgt tgttgtagtt tctgttaatg cataaagtcc 130500 
cacaagtgct aaaataaaaa aataaatgtt agttatttaa tttttaaaag tttatactat 130560 
tcaagtaatt caaagttgtg tcatacctgc gtaaattgca aaccaagatc caattattct 130620 
tagtatgttt gtttttccac atagccacac ataaccataa aacacaatgg gcattaaaat 130680 
tactggcaac agcgcagata ctttgtagtt tggagtccac ataagaaatg ccgtaagaat 130740 
aacagtgcct atccatgttg gtcccacaag aagacatgat gttgttttac tagaaacaat 130800 
tttcggtggt tttataacaa atattaatat gcctgaaaaa tataaacaaa cgattaattt 130860 
aaggccaata tcatagtaaa acaaacttta aacaggtaaa aagttacgtc ttacctatgt 130920 
aaaattgaga tacgcatgca atagcactac acgtctgtac caaaatactt gcgttgtcca 130980 
aggccaatgc cgatagtaaa acacaacatg ctgcaaacaa gtacacaaaa tataagttta 131040 
atttgctaat taaaattaat atttcaaatg tattataaaa tcatacctgt aatccatggg 131100 
aaaaaaattg aaggcaacgg agttaattga cctgctctgc ttcgattaca atacaaacta 131160 
atagttttcc ataaaagaga acaacacgta acaaatgaac aaataattgt gactgcttga 131220 
ttagaagtgt ccctggcatg ttgtaccaag gctattatcc aaaacgttaa tgtaagaagg 131280 
tttgtaagta accaactact ccaaataaaa aaggtatagc tacacgccat tttaaaataa 131340 
tcaaaactgt attatcaagt taagggggag tttgagcttt aaaaggttgg ttacatcgtt 131400 
tccaggcaaa cctcacactg agtcacacaa aggtttctgt gtgcccctgt acgggaatct 131460 
gccttattgc tgattttgtt ggcaaccaac caaagaaatc atggaaaagc agagtgggaa 131520 
tatttgtttt tgttcgttgg cctggatgct tggaaacatt cctgttattt tgatgttagg 131580 
ccaaactgtg gaaaacatta agagttgtta ttgaaaatat tttttggtaa caaaaccatg 13164 0 
ttttaggtcc caaaaggaac ataaaaatgt ttaatagaaa catgcctagt acagaacatt 131700 
ttgtttggta cgtaaactaa aaatatgtat ggtaagggct aggggtaagg gcaaggggta 131760 
agggctaggg gtaagggcaa ggggtaaggg caaggggtaa gggcaagggg taagggcaag 13182 0 
gggtaagggc aaggggtaag ggcaaggggt aagggcaagg ggtaagggca aggggtaagg 131880 
gctaggggta agggctaggg gtaagggcta ggggtaaggg ctagggtgag ctagggtgag 13194 0 
ctagggttat agttatagtg taaaataata ggcctacgat aaaatgtcaa catatatttt 132000 
ttaatttgtg tttcattaac agccatggtg tattagagta aaggttaacg cttcaataca 132060 
atatacaagt aaaagtaaac cacacgaatt tataacatat ttacaaaagc gaaccaagct 132120 
ggcacatgta acttcacgct accctaaacg tacaatactg ggactagaac ccagaggtag 132180 
ttagaatata cggtagttac agaactttgc agttccctta ggccagcagg gctctgcggt 132240 
taattaaaca aagtttaaag ataactgaaa ctttaggaag tgcgtatggg tgcaatgtgt 132300 
tccaaatagg gcaagggtta cataaactgt tgcctagcgg ccgggcccgg aggcgcccgg 1323 60 
ccggcgccgc cgccgggccg cggccgccat cttgcgcccg gggcgagggt cccccgcgcg 132420 
ccccccgggc ccgcgcgccg ggcgccgccg gcctcccccg gctcccggcc ctccgccccg 132480 
ccggcccggg cccgcggccg ccgcccccgg ccgccccgcg ccgcgcgccg ggccgctttc 13254 0 
ggttcgcggg gccgggggtc ccgcgggggg ccggggccgc ccccgggtgc cgctccgccg 132600 
ggcccggccg actcccggga gcgccccggt ccggccgagc ccggcggcgc cgcgccccga 132660 
cgccccccgg gcccggggcc ccacaagccg cggcgcgcgg ctcccgatgc cgggcggccg 13272 0 
ccgcccggca tggcggtcct ccgccggcct cccctccccc acgccgcccc gaaaggtggt 132780 
ctccgcgccg ccgggagggg ggccggggcc cggggcgcct cggcggggcc cggcgcgggg 132 84 0 
cgcgaccgag ggccccggga gaacggggga tcgggaaaac gcgaggggag cgggggacag 132900 
gggacggcgt gtgcgtgctt gtgagacacc gggtacggct gcctgcctgc tcgctggcct 132960 
gcttgctgag gggacagtag gcctgcttgc tcgctggcct gcttgctgag gggacagtag 133020 
gcctgcttgc tgaggggaca gtaggcctgc ttgctcgctg gcctgcttgc tgaggggaca 133080 
gtagggctgc tggcttgcta gtagggctgc tcgctggcct gcttgcttgc tcgctggcct 133140 
gcttgcttgc tcgctggcct gcttgcttgc tcgctggcct gcttgcttgc tcgctggcct 133200 
gcttgcttgc tcgctggcct gcttgcttgc tcgctggcct gcttgcttgc tcgctggcct 133260 
gcttgctgag gggacagtag ggctgcttgc ttgctaaggg gacggtacgc ctgcctgatg 133320 
gcttgatagt agggctgctg ggctgctagt agggctgctg ggctgctagt agggctgctg 133380 
ggctgctagt agggctgctg ggctgctagt agggctcctg ggctgctagt agggctgctg 133440 
ggctgctagt agggctcctg ggctgctagt agggctgctg ggctgctagt agggctgctg 133500 
ggctgctagt agggctgctg ggctgctagt agggctgcct gctggct_tgc ttgcttgctt 133560 
gctagtgggg ccgcttgcct gctactaggg ctgctgtgca gctgggagaa cagagtaggg 133620 
ctgccggcca gctgcgtgcg agggcgtccg agggccagac gaggacacgg gacccgggcc 133680 
tctcccccgc ccggaccgcc gggcacccgg cccagatct 133719 
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<210> 2 

<211> 1272 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (1272) 

<400> 2 

atg ttt gtg ttg gtt tta ttt atg tta ttg caa cct gta teg gtg gag 48 

Met Phe Val Leu Val Leu Phe Met Leu Leu Gin Pro Val Ser Val Glu 
15 10 15 

ctt ttg cct gca aaa tta act tct gtt cca acg tgg tgt cca cca cat 96 
Leu Leu Pro Ala Lys Leu Thr Ser Val Pro Thr Trp Cys Pro Pro His 
20 25 30 

ccc gga gat acc tac ttg eta acc tgc cgc ggg acg tct acg gee aga 144 
Pro Gly Asp Thr Tyr Leu Leu Thr Cys Arg Gly Thr Ser Thr Ala Arg 
35 40 45 

gac cag cga age aca caa tgg ttt cgc aac aac acg ctt atg cgt ggg 192 
Asp Gin Arg Ser Thr Gin Trp Phe Arg Asn Asn Thr Leu Met Arg Gly 
50 55 60 

agt aat ttc tac ggc aga ctg gta tct gtg act ccc aat get acg ata 240 
Ser Asn Phe Tyr Gly Arg Leu Val Ser Val Thr Pro Asn Ala Thr lie 
65 70 75 80 

tct gac egg tat gcg tgt caa aca aaa aca aca acg egg agt aac aac 288 
Ser Asp Arg Tyr Ala Cys Gin Thr Lys Thr Thr Thr Arg Ser Asn Asn 
85 90 95 

ate gat ttt egg gta age tea teg cgc etc acg etc caa gaa egg tgc 336 
lie Asp Phe Arg Val Ser Ser Ser Arg Leu Thr Leu Gin Glu Arg Cys 
100 105 110 

tct tea tac ggc tat act tac gcg aat aac aca agg gta ttg agg tgt 384 
Ser Ser Tyr Gly Tyr Thr Tyr Ala Asn Asn Thr Arg Val Leu Arg Cys 
115 120 125 

tac tct ggt gga aac gta act tta aga aac gtt gtc ttt cat tta aac 432 
Tyr Ser Gly Gly Asn Val Thr Leu Arg Asn Val Val Phe His Leu Asn 
130 135 140 

ggt aca gcg gtc ate aac ggt act aca aca aac ata cat aca ttt gtg 480 
Gly Thr Ala Val He Asn Gly Thr Thr Thr Asn He His Thr Phe Val 
145 150 155 160 

tta aca gaa aag aca gga ggg acg tat ttc tgt tct gcg ttt att ggg 528 
Leu Thr Glu Lys Thr Gly Gly Thr Tyr Phe Cys Ser Ala Phe He Gly 
165 170 175 

aat gaa aaa ttc tat tct cag aca att aat gtg ttt ttt act tea ttt 576 
Asn Glu Lys Phe Tyr Ser Gin Thr He Asn Val Phe Phe Thr Ser Phe 
180 185 190 

acc ttt aaa cct aca aac gac att ccc aat gag tea cat ttt aat aaa 624 
Thr Phe Lys Pro Thr Asn Asp He Pro Asn Glu Ser His Phe Asn Lys 
195 200 205 
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act ggg caa ata caa caa aca get agt gta caa cat cct gaa aac tac 672 
Thr Gly Gin lie Gin Gin Thr Ala Ser Val Gin His Pro Glu Asn Tyr 
210 215 220 

gtt gtg ttc tct gtt ccc gtt ttt tct att ggc gtt tta aca ggt att 720 
Val Val Phe Ser Val Pro Val Phe Ser He Gly Val Leu Thr Gly He 
225 230 235 240 

gca ata teg ttg att atg tgt tgg tta ttt aca ata cgc tgc aac gag 768 
Ala He Ser Leu He Met Cys Trp Leu Phe Thr He Arg Cys Asn Glu 
245 250 255 

aac tct gaa tea tea act aat agt tat gca age cag aca age tac att 816 
Asn Ser Glu Ser Ser Thr Asn Ser Tyr Ala Ser Gin Thr Ser Tyr He 
260 265 270 

caa ccc tec cat aat cag cgt tec aat act aat gaa tgt agt cgc cat 864 
Gin Pro Ser His Asn Gin Arg Ser Asn Thr Asn Glu Cys Ser Arg His 
275 280 285 

ace tac aga aat get cat caa gaa gag agt att gaa gaa eta cca aac 912 
Thr Tyr Arg Asn Ala His Gin Glu Glu Ser He Glu Glu Leu Pro Asn 
290 295 300 

caa cac aca agt gaa act gat tct tgc tgt caa tta gtt tta ctt gaa 960 
Gin His Thr Ser Glu Thr Asp Ser Cys Cys Gin Leu Val Leu Leu Glu 
305 310 315 320 

gtg aaa aat gta gee tac gat gga ccg cag gaa aac aca att aac gaa 1008 
Val Lys Asn Val Ala Tyr Asp Gly Pro Gin Glu Asn Thr He Asn Glu 
325 330 335 

gtt atg gaa cag tat gat gat gtg gtt gta aaa aat ata gaa caa aca 1056 
Val Met Glu Gin Tyr Asp Asp Val Val Val Lys Asn He Glu Gin Thr 
340 345 350 

tea tat gag gat aat gtt gag cac atg gac tat agt gat act ata aat 1104 
Ser Tyr Glu Asp Asn Val Glu His Met Asp Tyr Ser Asp Thr He Asn 
355 360 365 

ccc aat ttt aat tac tac agt gga eta ata ttg gaa gaa gta gat gaa 1152 
Pro Asn Phe Asn Tyr Tyr Ser Gly Leu He Leu Glu Glu Val Asp Glu 
370 375 380 

gtt ttt tac aat gaa eta gaa aat caa tat cat gga tta ata ctg gag 1200 
Val Phe Tyr Asn Glu Leu Glu Asn Gin Tyr His Gly Leu He Leu Glu 
385 390 395 400 

aat tta gat cac aat gag tac aat cat tta aat gaa tta aac atg ata 1248 
Asn Leu Asp His Asn Glu Tyr Asn His Leu Asn Glu Leu Asn Met He 
405 410 415 

gaa caa tat gat tgg tta gaa taa 1272 
Glu Gin Tyr Asp Trp Leu Glu 
420 



<210> 3 
<211> 423 
<212> PRT 
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<213> Macaca mulatta rhadinovirus 17577 
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<211> 567 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (567) 

<400> 4 

atg gac att aca gtt aac tgc ate gtt gca gtt gat gaa caa tta gga 48 

Met Asp lie Thr Val Asn Cys lie Val Ala Val Asp Glu Gin Leu Gly 

15 10 15 

att ggt aaa aac gga aca atg cct tgg ccg tac ctt aga aac gaa atg 96 
lie Gly Lys Asn Gly Thr Met Pro Trp Pro Tyr Leu Arg Asn Glu Met 
20 25 30 

atg tat ttt cag aaa atg acg tct acg cca teg gtt gta ggt gaa aag 144 
Met Tyr Phe Gin Lys Met Thr Ser Thr Pro Ser Val Val Gly Glu Lys 
35 40 45 

aat gtg gtt att atg ggg aaa aga aca tgg ttt tec ate cct gaa aag 192 
Asn Val Val lie Met Gly Lys Arg Thr Trp Phe Ser lie Pro Glu Lys 
50 55 60 

aag cgc cct ctg gtg aac aga att aat att att ttg age aga gaa tta 240 
Lys Arg Pro Leu Val Asn Arg lie Asn lie lie Leu Ser Arg Glu Leu 
65 70 75 80 

cgt gaa cca ccg cac gga gca cac ttt eta gcg aga acg eta gac gat 288 
Arg Glu Pro Pro His Gly Ala His Phe Leu Ala Arg Thr Leu Asp Asp 
85 90 95 

gcg ttt aat ttc tat aga caa tac aaa eta aaa gaa caa tta aac acg 336 
Ala Phe Asn Phe Tyr Arg Gin Tyr Lys Leu Lys Glu Gin Leu Asn Thr 
100 105 110 

gtc tgg gtt att gga gga aaa tct gtt tat gag agt gtt ctt aac tat 384 
Val Trp Val lie Gly Gly Lys Ser Val Tyr Glu Ser Val Leu Asn Tyr 
115 120 125 

aag tgt ccc tta aaa eta tat att acg cgc att atg gag agt ttt gat 432 
Lys Cys Pro Leu Lys Leu Tyr lie Thr Arg lie Met Glu Ser Phe Asp 
130 135 140 

tgt gat gta ttt ttt cca tea att aat ttt acc gaa tac acc atg tta 480 
Cys Asp Val Phe Phe Pro Ser lie Asn Phe Thr Glu Tyr Thr Met Leu 
145 150 155 160 

tea gag ata ccg ggc aaa gat aca aac ttt gag gaa aat gga att aaa 528 
Ser Glu lie Pro Gly Lys Asp Thr Asn Phe Glu Glu Asn Gly lie Lys 
165 170 175 

tat aag ttt cag gta tat gaa aaa aat ttt aat aaa taa 567 
Tyr Lys Phe Gin Val Tyr Glu Lys Asn Phe Asn Lys 
180 185 



<210> 5 
<211> 188 
<212> PRT 
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<213> Macaca mulatta rhadinovirus 17577 
<400> 5 

Met Asp He Thr Val Asn Cys He Val Ala Val Asp Glu Gin Leu Gly 

1 5 10 15 

He Gly Lys Asn Gly Thr Met Pro Trp Pro Tyr Leu Arg Asn Glu Met 

20 25 30 

Met Tyr Phe Gin Lys Met Thr Ser Thr Pro Ser Val Val Gly Glu Lys 

35 40 45 

Asn Val Val He Met Gly Lys Arg Thr Trp Phe Ser He Pro Glu Lys 

50 55 1 60 

Lys Arg. Pro Leu Val Asn Arg He Asn He He Leu Ser Arg Glu Leu 
65 70 75 80 

Arg Glu Pro Pro His Gly Ala His Phe Leu Ala Arg Thr Leu Asp Asp 

85 90 95 

Ala Phe Asn Phe Tyr Arg Gin Tyr Lys Leu Lys Glu Gin Leu Asn Thr 

100 105 110 

Val Trp Val He Gly Gly Lys Ser Val Tyr Glu Ser Val Leu Asn Tyr 

115 120 125 

Lys Cys Pro Leu Lys Leu Tyr He Thr Arg He Met Glu Ser Phe Asp 

130 135 140 

Cys Asp Val Phe Phe Pro Ser He Asn Phe Thr Glu Tyr Thr Met Leu 
145 150 155 160 

Ser Glu lie Pro Gly Lys Asp Thr Asn Phe Glu Glu Asn Gly He Lys 

165 170 175 

Tyr Lys Phe Gin Val Tyr Glu Lys Asn Phe Asn Lys 
180 185 



<210> 6 
<211> 1938 
<212> DMA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1938) 

<400> 6 

atg acg ttt aaa ctt ttt cct ctg ttt tta tta cac gcc ata atg tac 48 

Met Thr Phe Lys Leu Phe Pro Leu Phe Leu Leu His Ala He Met Tyr 

15 10 15 

gtc cac tgc gat gaa aac tgt aaa cct cca cat ttc acg gaa tat cgc 96 
Val His Cys Asp Glu Asn Cys Lys Pro Pro His Phe Thr Glu Tyr Arg 
20 25 30 

gtc aag tct aac aca gaa aag gac tta tat agt gtt gga gaa aca get 144 
Val Lys Ser Asn Thr Glu Lys Asp Leu Tyr Ser Val Gly Glu Thr Ala 
35 40 45 

gaa tta att tgt cgt cct ggt tat gtt aca aat aca aaa ata ata aca 192 
Glu Leu He Cys Arg Pro Gly Tyr Val Thr Asn Thr Lys He He Thr 
50 55 60 

aca gaa tgt tta caa aat ggt acg tgg tea aca cca aat ttt cca tgc 240 
Thr Glu Cys Leu Gin Asn Gly Thr Trp Ser Thr Pro Asn Phe Pro Cys 
65 70 75 80 

gac aga aaa aga tgt ccc aca cct get gac ttg ctg aat gga gcc gtg 288 
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Asp Arg Lys Arg Cys Pro Thr Pro Ala Asp Leu Leu Asn Gly Ala Val 
85 90 95 



cac att cac ggg gga gat aat gcc tta aaa ttt gga tec aat att tec 
His lie His Gly Gly Asp Asn Ala Leu Lys Phe Gly Ser Asn lie Ser 
100 105 110 



336 



tat gag tgt aat gaa ggt tat gat tta att ggt agt aat gtt cgt ttt 
Tyr Glu Cys Asn Glu Gly Tyr Asp Leu lie Gly Ser Asn Val Arg Phe 
115 120 125 



384 



tgt att tta caa gac aca gaa aac gta aat tgg gat tea aat gaa cca 
Cys lie Leu Gin Asp Thr Glu Asn Val Asn Trp Asp Ser Asn Glu Pro 
130 135 140 



432 



gtc tgt gaa att cag aaa tgt att aaa cca ccg gca gtg gaa cat ggg 
Val Cys Glu lie Gin Lys Cys lie Lys Pro Pro Ala Val Glu His Gly 
145 150 155 160 



480 



gac tac eta cct aac caa gat gtt tat aac tat gga gat gca att aca 
Asp Tyr Leu Pro Asn Gin Asp Val Tyr Asn Tyr Gly Asp Ala He Thr 
165 170 175 



528 



ttt aaa tgt tea ttg teg tat aca etc gtt gga tea aca aca tta gta 
Phe Lys Cys Ser Leu Ser Tyr Thr Leu Val Gly Ser Thr Thr Leu Val 
180 185 190 



576 



tgc acg tea aac aaa aag tgg tea aac tct ttc cca acg tgt tta atg 
Cys Thr Ser Asn Lys Lys Trp Ser Asn Ser Phe Pro Thr Cys Leu Met 
195 200 205 



624 



etc gta tgt gaa agt cca caa ata gac aat ggg tac ata gac att ggc 
Leu Val Cys Glu Ser Pro Gin He Asp Asn Gly Tyr He Asp He Gly 
210 215 220 



672 



ttg tec aga aga tac aac cat gga caa tea att act gta aag tgt age 
Leu Ser Arg Arg Tyr Asn His Gly Gin Ser He Thr Val Lys Cys Ser 
225 230 235 240 



720 



gac ggg tac aac att gtt ggg cct gaa aca tta acg tgc aca aac aca 
Asp Gly Tyr Asn He Val Gly Pro Glu Thr Leu Thr Cys Thr Asn Thr 
245 250 255 



768 



act tgg gtt cca cca tta cct aag tgt gta tta gtt aca aat aac cca 
Thr Trp Val Pro Pro Leu Pro Lys Cys Val Leu Val Thr Asn Asn Pro 
260 265 270 



816 



age aca ccc atg cca gaa aca ccc atg cca gaa aca ccc acg cca gat 
Ser Thr Pro Met Pro Glu Thr Pro Met Pro Glu Thr Pro Thr Pro Asp 
275 280 285 



864 



tat caa aaa ata aat ttg tea acc get aaa act gca aca aca cca aat 
Tyr Gin Lys He Asn Leu Ser Thr Ala Lys Thr Ala Thr Thr Pro Asn 
290 295 300 



912 



gcg ttt gtt aca act gtt gtt tct cea gag aaa gac gac gtt act tgt 
Ala Phe Val Thr Thr Val Val Ser Pro Glu Lys Asp Asp. Val Thr Cys 
305 310 315 320 



960 



gta aag cct cat ttt gag cga ttc atg gta aag get gaa aat gac aag 
Val Lys Pro His Phe Glu Arg Phe Met Val Lys Ala Glu Asn Asp Lys 

43 



1008 



325 



330 



335 



gaa aaa tac agt gtt ggt gcg age gtt gag eta ata tgt cga cca gga 1056 

Glu Lys Tyr Ser Val Gly Ala Ser Val Glu Leu' lie Cys Arg Pro Gly 

340 345 350 

ttt act aaa atg cag tct aca gtt tct gtt gaa tgt ttg tec aac gga 1104 

Phe Thr Lys Met Gin Ser Thr Val Ser Val Glu Cys Leu Ser Asn Gly 

355 360 365 

aca tgg act get cca aat gee aag tgt cat aga aaa aaa tgt cca ace 1152 

Thr Trp Thr Ala Pro Asn Ala Lys Cys His Arg Lys Lys Cys Pro Thr 

370 375 380 

cct caa gaa ctt tta aac gga gag tat ata gtt aca age gga gaa gat 1200 

Pro Gin Glu Leu Leu Asn Gly Glu Tyr lie Val Thr Ser Gly Glu Asp 

385 . 390 395 400 

get ttt aag tac gga aca aat ata aca tat aaa tgt aat gaa ggt tat 1248 

Ala Phe Lys Tyr Gly Thr Asn lie Thr Tyr Lys Cys Asn Glu Gly Tyr 

405 410 415 

caa ctt tta gga agt atg gtg egg att tgt atg ctt aaa gac gat tta 1296 

Gin Leu Leu Gly Ser Met Val Arg lie Cys Met Leu Lys Asp Asp Leu 

420 425 430 



1344 



1392 



1440 



aaa aca gtt gac tgg gag cca aaa gcg cct ata tgt gat att gaa aaa 

Lys Thr Val Asp Trp Glu Pro Lys Ala Pro lie Cys Asp lie Glu Lys 
435 440 445 

tgt aag cca ccg cca caa att aca aac gga aaa tac cat ccg gtg aaa 

Cys Lys Pro Pro Pro Gin lie Thr Asn Gly Lys Tyr His Pro Val Lys 

450 455 460 

gac ttt tat cag tat ttg gac acc gta aca ttt teg tgc aat cgt gac 

Asp Phe Tyr Gin Tyr Leu Asp Thr Val Thr Phe Ser Cys Asn Arg Asp 

465 470 475 480 

ttt tct tta gtt gga gat gaa atg aca acg tgt ata agt aat acg tgg 1488 

Phe Ser Leu Val Gly Asp Glu Met Thr Thr Cys lie Ser Asn Thr Trp 
485 490 495 

aat aaa ccg ttt cca aga tgt gaa caa ate act tgc age get cct aat 1536 

Asn Lys Pro Phe Pro Arg Cys Glu Gin lie Thr Cys Ser Ala Pro Asn 
500 505 510 

att gca cac gga aag ctg eta aca ggt tct tea age gtt tac aaa tac 1584 

lie Ala His Gly Lys Leu Leu Thr Gly Ser Ser Ser Val Tyr Lys Tyr 
515 520 525 

ggt caa tct gtt acc att ggt tgt gaa act gga ttt act eta att ggc 1632 

Gly Gin Ser Val Thr lie Gly Cys Glu Thr Gly Phe Thr Leu lie Gly 

530 535 540 

agt gaa att tct aca tgc aag gat tea teg tgg gat cca cca ctt cct 1680 

Ser Glu lie Ser Thr Cys Lys Asp Ser Ser Trp Asp Pro Pro Leu Pro 

545 550 555 560 

acg tgc gtg cca get gtt tea atg cct tct gac aca cct aaa cca gaa 1728 

Thr Cys Val Pro Ala Val Ser Met Pro Ser Asp Thr Pro Lys Pro Glu 
565 570 575 
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acc aaa aaa cca aac acg cca acg cca gaa gca ccc aaa cca aac acc 
Thr Lys Lys Pro Asn Thr Pro Thr Pro Glu Ala Pro Lys Pro Asn Thr 
580 585 590 

cca aac gtt gga aca cat aca cca ttc aaa cca cca cca caa aat cca 
Pro Asn Val Gly Thr His Thr Pro Phe Lys Pro Pro Pro Gin Asn Pro 
595 600 605 

cca ata gca ccc cca atg agt aaa tgg aaa agg cat gtc gtg tta gtt 
Pro lie Ala Pro Pro Met Ser Lys Trp Lys Arg His Val Val Leu Val 
610 615 620 

ctt ttt gca agt gtc gcg tec ttg tta ttc gta ctt get gec ctt tat 
Leu Phe Ala Ser Val Ala Ser Leu Leu Phe Val Leu Ala Ala Leu Tyr 
625 630 635 640 

tgt tgt ttt eta aaa taa 
Cys Cys Phe Leu Lys 
645 



<210> 7 
<211> 645 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 7 

Met Thr Phe Lys Leu Phe Pro Leu Phe Leu Leu His Ala He Met Tyr 

1 5 10 15 

Val His Cys Asp Glu Asn Cys Lys Pro Pro His Phe Thr Glu Tyr Arg 

20 25 30 

Val Lys Ser Asn Thr Glu Lys Asp Leu Tyr Ser Val Gly Glu Thr Ala 

35 40 45 

Glu Leu He Cys Arg Pro Gly Tyr Val Thr Asn Thr Lys He He Thr 

50 55 60 

Thr Glu Cys Leu Gin Asn Gly Thr Trp Ser Thr Pro Asn Phe Pro Cys 
65 70 75 80 

Asp Arg Lys Arg Cys Pro Thr Pro Ala Asp Leu Leu Asn Gly Ala Val 

85 90 95 

His He His Gly Gly Asp Asn Ala Leu Lys Phe Gly Ser Asn He Ser 

100 105 HO 

Tyr Glu Cys Asn Glu Gly Tyr Asp Leu He Gly Ser Asn Val Arg Phe 

115 120 125 

Cys He Leu Gin Asp Thr Glu Asn Val Asn Trp Asp Ser Asn Glu Pro 

130 135 140 

Val Cys Glu He Gin Lys Cys He Lys Pro Pro Ala Val Glu His Gly 
145 150 155 160 

Asp Tyr Leu Pro Asn Gin Asp Val Tyr Asn Tyr Gly Asp Ala He Thr 

165 170 175 

Phe Lys Cys Ser Leu Ser Tyr Thr Leu Val Gly Ser Thr Thr Leu Val 

180 185 190 

Cys Thr Ser Asn Lys Lys Trp Ser Asn Ser Phe Pro Thr Cys Leu Met 

195 200 205 

Leu Val Cys Glu Ser Pro Gin He Asp Asn Gly Tyr He Asp He Gly 

210 215 220 

Leu Ser Arg Arg Tyr Asn His Gly Gin Ser He Thr Val Lys Cys Ser 
225 230 235 240 

Asp Gly Tyr Asn He Val Gly Pro Glu Thr Leu Thr Cys Thr Asn Thr 

245 250 255 

Thr Trp Val Pro Pro Leu Pro Lys Cys Val Leu Val Thr Asn Asn Pro 
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260 265 270 

Ser Thr Pro Met Pro Glu Thr Pro Met Pro Glu Thr Pro Thr Pro Asp 

275 280 285 

Tyr Gin Lys lie Asn Leu Ser Thr Ala Lys Thr Ala Thr Thr Pro Asn 

290 295 300 

Ala Phe Val Thr Thr Val Val Ser Pro Glu Lys Asp Asp Val Thr Cys 
305 310 315 320 

Val Lys Pro His Phe Glu Arg Phe Met Val Lys Ala Glu Asn Asp Lys 

325 330 335 

Glu Lys Tyr Ser Val Gly Ala Ser Val Glu Leu He Cys Arg Pro Gly 

340 345 350 

Phe Thr Lys Met Gin Ser Thr Val Ser Val Glu Cys Leu Ser Asn Gly 

355 360 365 

Thr Trp Thr Ala Pro Asn Ala Lys Cys His Arg Lys Lys Cys Pro Thr 

370 375 380 

Pro Gin Glu Leu Leu Asn Gly Glu Tyr He Val Thr Ser Gly Glu Asp 
385 390 395 400 

Ala Phe Lys Tyr Gly Thr Asn He Thr Tyr Lys Cys Asn Glu Gly Tyr 

405 410 415 

Gin Leu Leu Gly Ser Met Val Arg He Cys Met Leu Lys Asp Asp Leu 

420 425 430 

Lys Thr Val Asp Trp Glu Pro Lys Ala Pro He Cys Asp He Glu Lys 

435 440 445 

Cys Lys Pro Pro Pro Gin He Thr Asn Gly Lys Tyr His Pro Val Lys 

450 455 460 

Asp Phe Tyr Gin Tyr Leu Asp Thr Val Thr Phe Ser Cys Asn Arg Asp 
465 470 475 480 

Phe Ser Leu Val Gly Asp Glu Met Thr Thr Cys He Ser Asn Thr Trp 

485 490 495 

Asn Lys Pro Phe Pro Arg Cys Glu Gin He Thr Cys Ser Ala Pro Asn 

500 505 510 

He Ala His Gly Lys Leu Leu Thr Gly Ser Ser Ser Val Tyr Lys Tyr 

515 520 525 

Gly Gin Ser Val Thr He Gly Cys Glu Thr Gly Phe Thr Leu He Gly 

530 535 540 

Ser Glu He Ser Thr Cys Lys Asp Ser Ser Trp Asp Pro Pro Leu Pro 
545 550 555 560 

Thr Cys Val Pro Ala Val Ser Met Pro Ser Asp Thr Pro Lys Pro Glu 

565 570 575 

Thr Lys Lys Pro Asn Thr Pro Thr Pro Glu Ala Pro Lys Pro Asn Thr 

580 585 590 

Pro Asn Val Gly Thr His Thr Pro Phe Lys Pro Pro Pro Gin Asn Pro 

595 600 605 

Pro He Ala Pro Pro Met Ser Lys Trp Lys Arg His Val Val Leu Val 

610 615 620 

Leu Phe Ala Ser Val Ala Ser Leu Leu Phe Val Leu Ala Ala Leu Tyr 
625 630 635 640 

Cys Cys Phe Leu Lys 
645 



<210> 8 

<211> 3399 

<212> DNA 

<213> Macaca mulatta rhadinovi rus 17577 

<220> 

<221> CDS 

<222> (1) . . (3399) 
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<400> 8 

atg get tec aaa ggc aac gec gga caa ccc ctg gaa gat aat cag ggg 4 8 
Met Ala Ser Lys Gly Asn Ala Gly Gin Pro Leu Glu Asp Asn Gin Gly 
15 10 15 

tct cgt gec ccg ata ggt gcg tgc gga tac gtg tac gcg tat teg aaa 96 
Ser Arg Ala Pro lie Gly Ala Cys Gly Tyr Val Tyr Ala Tyr Ser Lys 
20 25 30 

caa gac ttt ccc ttt gec gag gcg tec ata etc ggc aac aga cca tct 144 
Gin Asp Phe Pro Phe Ala Glu Ala Ser lie Leu Gly Asn Arg Pro Ser 
35 40 45 

gga tct ggc gtt ttc teg eta cca ate ctt tac gga ctt aca gtt gaa 192 
Gly Ser Gly Val Phe Ser Leu Pro lie Leu Tyr Gly Leu Thr Val Glu 
50 55 60 

cac gaa ttc cct etc acc gta aaa gec gca tac aaa aaa gtt gac acc 240 
His Glu Phe Pro Leu Thr Val Lys Ala Ala Tyr Lys Lys Val Asp Thr 
65 70 75 80 

acg acg etc gec gtt aag gtg acg tgc ttt cac aga gag gtt att gtg 288 
Thr Thr Leu Ala Val Lys Val Thr Cys Phe His Arg Glu Val lie Val 
85 90 95 

ttt cac aat gca agt tta ttc agg ccg gtg ttt gac ggt acc ggt ctt 336 
Phe His Asn Ala Ser Leu Phe Arg Pro Val Phe Asp Gly Thr Gly Leu 
100 105 110 

aac gaa eta tgc gag gaa gec agg get etc ttt ggg tac acg cag ttt 384 
Asn Glu Leu Cys Glu Glu Ala Arg Ala Leu Phe Gly Tyr Thr Gin Phe 
115 120 125 

ata gaa ccg ggt cca cct cac age ata tgg aac cct ctg gaa tgt ccg 432 
lie Glu Pro Gly Pro Pro His Ser lie Trp Asn Pro Leu Glu Cys Pro 
130 135 140 

cag tta ccg gac aag gat gag atg ttt ctt ggc gtt gtt gtt acg gaa 480 
Gin Leu Pro Asp Lys Asp Glu Met Phe Leu Gly Val Val Val Thr Glu 
145 150 155 160 

ggg ttt aag gaa aga ctg tgg agg ggc tgt etc gtt ccc gcg gtg ttc 528 
Gly Phe Lys Glu Arg Leu Trp Arg Gly Cys Leu Val Pro Ala Val Phe 
165 170 175 

cag acc cag cag gtg cag att gec gga cgc cag gcg ttt aaa gtg ccg 576 
Gin Thr Gin Gin Val Gin lie Ala Gly Arg Gin Ala Phe Lys Val Pro 
180 185 190 

ttg tac gac gaa gac ctg ttt gca cct cac ggt cat aga atg cca agg 624 
Leu Tyr Asp Glu Asp Leu Phe Ala Pro His Gly His Arg Met Pro Arg 
195 200 205 

ttt tac cat aaa gac gtt age gcg tac etc tac gac tec etc ttt acc 672 
Phe Tyr His Lys Asp Val Ser Ala Tyr Leu Tyr Asp Ser Leu Phe Thr 
210 215 220 

age ate gee cag gee ctg aga etc aaa gac gtg acg gcg gtc ate cac 720 
Ser lie Ala Gin Ala Leu Arg Leu Lys Asp Val Thr Ala Val lie His 
225 230 235 240 
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gcc aca gaa 
Ala Thr Glu 



gtg cag gca 
Val Gin Ala 



teg tec eac 
Ser Ser His 
275 

tac ggc tgt 
Tyr Gly Cys 
290 

aac tac gat 
Asn Tyr Asp 
305 

agg gtt aac 
Arg Val Asn 



gcg ggt caa 
Ala Gly Gin 



cag aag caa 
Gin Lys Gin 
355 

tec ttt ttc 
Ser Phe Phe 
370 

aag gaa aac 
Lys Glu Asn 
385 

gat ggt tct 
Asp Gly Ser 



teg ccc cat 
Ser Pro His 



cac cac aaa 
His His Lys 
435 

ggc ace gcc 
Gly Thr Ala 
450 

ccg gca acg 
Pro Ala Thr 
465 

ccc gcc gta 



aag caa ttc 
Lys Gin Phe 
245 

aaa cag ttt 
Lys Gin Phe 
260 

atg att gtg 
Met lie Val 



atg ttt etc 
Met Phe Leu 



age tgg ccc 
Ser Trp Pro 
310 

gcg tta gag 
Ala Leu Glu 
325 

ctg ttc get 
Leu Phe Ala 
340 

gcg ccc agg 
Ala Pro Arg 



etc caa cac 
Leu Gin His 



ggc age gaa 
Gly Ser Glu 
390 

teg ttc acg 
Ser Phe Thr 
405 

ctg ctg gcg 
Leu Leu Ala 
420 

age tec acg 
Ser Ser Thr 



gcc aac tea 
Ala Asn Ser 



tgc etc aac 
Cys Leu Asn 
470 

ace ace cct 



atg cag gac 
Met Gin Asp 



tea acg acg 
Ser Thr Thr 
265 

gac age gtc 
Asp Ser Val 
280 

gag tgt ccc 
Glu Cys Pro 
2 95 

ata ttt gat 
lie Phe Asp 



cgc tgg teg 
Arg Trp Ser 



gcc aat teg 
Ala Asn Ser 
345 

gga caa aag 
Gly Gin Lys 
360 

gga ctg ggg 
Gly Leu Gly 
375 

gcc ttt aag 
Ala Phe Lys 



ccg tat cac 
Pro Tyr His 



aag tta tgt 
Lys Leu Cys 
425 

aac cag gcg 
Asn Gin Ala 
440 

gag atg tgc 
Glu Met Cys 
455 

acg ctg ttc 
Thr Leu Phe 



cag cgc agg 



cat tac aaa 
His Tyr Lys 
250 

ctg ccg aaa 
Leu Pro Lys 



gtc gcc gag 
Val Ala Glu 



cag gac gcg 
Gin Asp Ala 
300 

ggt tgt gac 
Gly Cys Asp 
315 

gcc gaa cag 
Ala Glu Gin 
330 

gtg ctg tac 
Val Leu Tyr 



gga gac gta 
Gly Asp Val 



ttt tta aat 
Phe Leu Asn 
380 

ggc gta ccc 
Gly Val Pro 
395 

ctg gcc tac 
Leu Ala Tyr 
410 

tat tac atg 
Tyr Tyr Met 



ttt aac atg 
Phe Asn Met 



acg eta tgt 
Thr Leu Cys 
460 

tat aga ctg 
Tyr Arg Leu 
475 

gac ccc tac 



att gcc aag 
lie Ala Lys 
255 

acg aca gac 
Thr Thr Asp 
270 

etc gcc ctt 
Leu Ala Leu 
285 

tgc gag ttg 
Cys Glu Leu 



tea cca gag 
Ser Pro Glu 



gcc gtt cac 
Ala Val His 
335 

eta act aaa 
Leu Thr Lys 
350 

aac gtg tac 
Asn Val Tyr 
365 

gag gcc acg 
Glu Ala Thr 



tea aac gcc 
Ser Asn Ala 



gcc gcg tct 
Ala Ala Ser 
415 

cag ttc ttg 
Gin Phe Leu 
430 

gtc cat tat 
Val His Tyr 
445 

cac ggc aac 
His Gly Asn 



aag, gat agg 
Lys Asp Arg 



gtg gtg ace 



ata 768 
He 



ggg 816 
Gly 



agt 864 
Ser 



ctg 912 
Leu 



get 960 

Ala 

320 

gtg 1008 
Val 



gtg 1056 
Val 



aac 1104 
Asn 



ate 1152 
He 



etc 1200 

Leu 

400 

ttc 1248 
Phe' 



caa 1296 
Gin 



gtc 1344 
Val 



acg 1392 
Thr 



ttt 1440 

Phe 

480 

gga 14 8 8 



48 



L kJ H . l! ii, 



Pro Ala Val Thr Thr Pro Gin Arg Arg Asp Pro Tyr Val Val Thr Gly 
485 490 495 

aca gcc ggg acc ttt aac gac ctg gag att ctg ggc aac ttc gcg age 1536 
Thr Ala Gly Thr Phe Asn Asp Leu Glu lie Leu Gly Asn Phe Ala Ser 
500 505 510 

ttt aga gac cgc gaa gag gac gga aac ccg gcc gac gag cac cca aag 1584 
Phe Arg Asp Arg Glu Glu Asp Gly Asn Pro Ala Asp Glu His Pro Lys 
515 520 525 

tac acg tac tgg cag eta tgt cag acc gtg aca gaa aag eta tec gcg 1632 
Tyr Thr Tyr Trp Gin Leu Cys Gin Thr Val Thr Glu Lys Leu Ser Ala 
530 535 540 

att gga ate acc gaa gac cac gat aat cac gtg aac etc ate acc aac 1680 
He Gly He Thr Glu Asp His Asp Asn His Val Asn Leu He Thr Asn 
545 550 555 560 

ate caa agt ttt etc agg gtg ttc aag ggt ate gac tea att gtg gac 172 8 
He Gin Ser Phe Leu Arg Val Phe Lys Gly lie Asp Ser He Val Asp 
565 570 575 

gga gag gtc atg aag ttc gtt aat teg atg att aaa aat aac ttt aat 1776 
Gly Glu Val Met Lys Phe Val Asn Ser Met He Lys Asn Asn Phe Asn 
580 585 590 

ttc cgc gag cac gtc aaa teg gtc cat cac ata etc cag ttc tgc tgc 1824 
Phe Arg Glu His Val Lys Ser Val His His He Leu Gin Phe Cys Cys 
595 600 605 

aac gtg tat tgg cag gcg ccg tgc gcg gtg ttc ctg aat ctg tac tac 1872 
Asn Val Tyr Trp Gin Ala Pro Cys Ala Val Phe Leu Asn Leu Tyr Tyr 
610 615 620 

aaa tec ctg ctg tgg ate att cag gat ate tgc ctg ccg tac tgc atg 1920 
Lys Ser Leu Leu Trp He He Gin Asp lie Cys Leu Pro Tyr Cys Met 
625 630 635 640 

ate tac gaa caa gat aat ccg gcg atg ggc ate etc ccc tec gag tgg 1968 
He Tyr Glu Gin Asp Asn Pro Ala Met Gly He Leu Pro Ser Glu Trp 
645 650 655 

eta aag atg cat ttt cag acg ttg tgg acg aac ttt aaa gcg gcg tgt 
Leu Lys Met His Phe Gin Thr Leu Trp Thr Asn Phe Lys Ala Ala Cys 
660 665 670 

etc gac cgc ggg gtc etc acg ggg tgc gaa ctg aaa ate gta cac egg 
Leu Asp Arg Gly Val Leu Thr Gly Cys Glu Leu Lys He Val His Arg 
675 680 685 

gac atg ttc tgc gac ttc ttc gac acc gac gcg ggg tec aac ggc tta 2112 
Asp Met Phe Cys Asp Phe Phe Asp Thr Asp Ala Gly Ser Asn Gly Leu 
690 695 700 

atg gcc ccc ttt aaa atg cag gtt aga ata gcc cga gcc atg atg gtc 2160 
Met Ala Pro Phe Lys Met Gin Val Arg He Ala Arg Ala Met Met Val 
705 710 715 720 



2016 



2064 



gtt ccg aaa tea att aaa ata aaa aat aga ate att ttt tec aac acc 
Val Pro Lys Ser He Lys lie Lys Asn Arg lie lie Phe Ser Asn Thr 



2208 
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725 730 735 

gcg gga tec gag gcg gtg cag teg ggg ttc gtc aaa ccg acg gga acc 2256 

Ala Gly Ser Glu Ala Val Gin Ser Gly Phe Val Lys Pro Thr Gly Thr 
740 745 750 

agg gac act tac gtg gtg gec gga ccg tac atg aag ttt etc aac teg 2304 

Arg Asp Thr Tyr Val Val Ala Gly Pro Tyr Met Lys Phe Leu Asn Ser 

755 760 765 

ctg cat cgc gcg ctg ttc ccc gac acc aag acc gec gcg ctg tac ctg 2352 

Leu His Arg Ala Leu Phe Pro Asp Thr Lys Thr Ala Ala Leu Tyr Leu 
770 775 780 



2400 



tgg cac aag ate tec cag acc aac aaa acc cca gtt ctg aaa gac gtc 

Trp His Lys He Ser Gin Thr Asn Lys Thr Pro Val Leu Lys Asp Val 

785 790 795 800 

ccg gac gac gag ctg gcg gag ctg gtg teg tac gta aag acc aac age 244 8 

Pro Asp Asp Glu Leu Ala Glu Leu Val Ser Tyr Val Lys Thr Asn Ser 

805 810 815 

etc gcg ttc gag gaa acg aac gtg ctg gac gtg gtt ccg gat tea etc 24 96 

Leu Ala Phe Glu Glu Thr Asn Val Leu Asp Val Val Pro Asp Ser Leu 

820 825 830 

atg teg tac gcg agg ate aaa ctg aac ggg gec att eta agg gca tgt 2544 

Met Ser Tyr Ala Arg He Lys Leu Asn Gly Ala He Leu Arg Ala Cys 

835 840 845 



2592 



ggc cag att cag ttc tac gee acg acg ctg cac tgc etc acg ccg gtg 
Gly Gin He Gin Phe Tyr Ala Thr Thr Leu His Cys Leu Thr Pro Val 
850 855 860 

eta cag acg ate gat gec gag gaa tac ccc cac gtg ctg ggc tec gcg 2640 
Leu Gin Thr He Asp Ala Glu Glu Tyr Pro His Val Leu Gly Ser Ala 
865 870 875 880 

gca ate gec aca ccg gtg get tac ctg gca gaa ata cgc ggc cgc acc 2688 
Ala He Ala Thr Pro Val Ala Tyr Leu Ala Glu He Arg Gly Arg Thr 
885 890 895 

gec etc acc gtc cag acg acg gcg cgt cag ccg gtc gee gec aca ggg 2736 
Ala Leu Thr Val Gin Thr Thr Ala Arg Gin Pro Val Ala Ala Thr Gly 
900 905 910 

cgc ctg cgt ccc gtg ata acc gtt ccc atg gta gtc aac aaa tac acg 2784 
Arg Leu Arg Pro Val He Thr Val Pro Met Val Val Asn Lys Tyr Thr 
915 920 925 

ggg gtc aac ggg aac aac aac gtt ttc cac tgc gga aac ctg ggg tac 2832 
Gly Val Asn Gly Asn Asn Asn Val Phe His Cys Gly Asn Leu Gly Tyr 
9,30 935 940 

ttc gcg ggg cgc ggc gtg gac cgc aac ctg tgg ccg gaa age tec ccc 
Phe Ala Gly Arg Gly Val Asp Arg Asn Leu Trp Pro Glu Ser Ser Pro 
945 950 955 960 

ttt aag aaa acg ggc gtc age gee atg eta aga aag aga cac gtc atg 
Phe Lys Lys Thr Gly Val Ser Ala Met Leu Arg Lys Arg His Val Met 
965 970 975 



2880 



2928 
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atg acc ccc att ate gac cgc eta ata aag cga gee gcg gga cag aca 2976 

Met Thr Pro lie He Asp Arg Leu He Lys Arg Ala Ala Gly Gin Thr 

980 985 990 

ate age acg ttc gag gcg gaa age gtt aaa agg age gtg cag gcg ctg 3024 

He Ser Thr Phe Glu Ala Glu Ser Val Lys Arg Ser Val Gin Ala Leu 

995 1000 1005 

tta gag gat aag gac aac cct aac eta ttg aag teg gta ate ttg gag 3072 

Leu Glu Asp Lys Asp Asn Pro Asn Leu Leu Lys Ser Val He Leu Glu 

1010 1015 1020 

ctt ata cga cac ctg ggg aag ggc tgc cag gac tta age tec gag gac 3120 

Leu He Arg His Leu Gly Lys Gly Cys Gin Asp Leu Ser Ser Glu Asp 

1025 1030 1035 1040 

gtg caa tat tac etc ggt gac tat tgt atg ttg acg gac gag gtt tta 3168 

Val Gin Tyr Tyr Leu Gly Asp Tyr Cys Met Leu Thr Asp Glu Val Leu 
1045 1050 1055 

ttt acg ttg gat aat ata gca cag tea ggc gtg ccg tgg act ate gag 3216 

Phe Thr Leu Asp Asn He Ala Gin Ser Gly Val Pro Trp Thr He Glu 

1060 1065 1070 

gac gcg ggt gee eta ata gag gat cgc cag gac gca gac gat ctt cag 3264 

Asp Ala Gly Ala Leu He Glu Asp Arg Gin Asp Ala Asp Asp Leu Gin 

1075 1080 1085 

ttc gta gac age gac gat ate gee acc get tec tgt cag ccc ccc gag 3312 

Phe Val Asp Ser Asp Asp He Ala Thr Ala Ser Cys Gin Pro Pro Glu 

1090 1095 1100 

gaa cag eta ccg acc cct age gee ggc gee eta ctg gee ggg aag aag 3360 

Glu Gin Leu Pro Thr Pro Ser Ala Gly Ala Leu Leu Ala Gly Lys Lys 

1105 1110 1115 1120 

cga aaa att aac gcg ctg ctg age gat eta gac ctt tag 3399 

Arg Lys He Asn Ala Leu Leu Ser Asp Leu Asp Leu 
1125 1130 



<210> 9 
<211> 1132 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 9 

Met Ala Ser Lys Gly Asn Ala Gly Gin Pro Leu Glu Asp Asn Gin Gly 

15 10 15 

Ser Arg Ala Pro He Gly Ala Cys Gly Tyr Val Tyr Ala Tyr Ser Lys 

20 25 30 

Gin Asp Phe Pro Phe Ala Glu Ala Ser He Leu Gly Asn Arg Pro Ser 

35 40 45 

Gly Ser Gly Val Phe Ser Leu Pro He Leu Tyr Gly Leu Thr Val Glu 

50 55 60 

His Glu Phe Pro Leu Thr Val Lys Ala Ala Tyr Lys Lys Val Asp Thr 
65 70 75 80 

Thr Thr Leu Ala Val Lys Val Thr Cys Phe His Arg Glu Val He Val 

85 90 95 

Phe His Asn Ala Ser Leu Phe Arg Pro Val Phe Asp Gly Thr Gly Leu 

51 



100 105 110 

Asn Glu Leu Cys Glu Glu Ala Arg Ala Leu Phe Gly Tyr Thr Gin Phe 

115 120 125 

lie Glu Pro Gly Pro Pro His Ser He Trp Asn Pro Leu Glu Cys Pro 

130 135 140 

Gin Leu Pro Asp Lys Asp Glu Met Phe Leu Gly Val Val Val Thr Glu 
145 150 155 160 

Gly Phe Lys Glu Arg Leu Trp Arg Gly Cys Leu Val Pro Ala Val Phe 

165 170 175 

Gin Thr Gin Gin Val Gin He Ala Gly Arg Gin Ala Phe Lys Val Pro 

180 185 190 

Leu Tyr Asp Glu Asp Leu Phe Ala Pro His Gly His Arg Met Pro Arg 

195 200 205 

Phe Tyr His Lys Asp Val Ser Ala Tyr Leu Tyr Asp Ser Leu Phe Thr 

210 215 220 

Ser He Ala Gin Ala Leu Arg Leu Lys Asp Val Thr Ala Val He His 
225 230 235 240 

Ala Thr Glu Lys Gin Phe Met Gin Asp His Tyr Lys He Ala Lys He 

245 250 255 

Val Gin Ala Lys Gin Phe Ser Thr Thr Leu Pro Lys Thr Thr Asp Gly 

260 265 270 

Ser Ser His Met He Val Asp Ser Val Val Ala Glu Leu Ala Leu Ser 

275 280 285 

Tyr Gly Cys Met Phe Leu Glu Cys Pro Gin Asp Ala Cys Glu Leu Leu 

290 295 300 

Asn Tyr Asp Ser Trp Pro He Phe Asp Gly Cys Asp Ser Pro Glu Ala 
305 310 315 320 

Arg Val Asn Ala Leu Glu Arg Trp Ser Ala Glu Gin Ala Val His Val 

325 330 335 

Ala Gly Gin Leu Phe Ala Ala Asn Ser Val Leu Tyr Leu Thr Lys Val 

340 345 350 

Gin Lys Gin Ala Pro Arg Gly Gin Lys Gly Asp Val Asn Val Tyr Asn 

355 360 365 

Ser Phe Phe Leu Gin His Gly Leu Gly Phe Leu Asn Glu Ala Thr He 

370 375 380 

Lys Glu Asn Gly Ser Glu Ala Phe Lys Gly Val Pro Ser Asn Ala Leu 
385 390 395 400 

Asp Gly Ser Ser Phe Thr Pro Tyr His Leu Ala Tyr Ala Ala Ser Phe 

405 410 415 

Ser Pro His Leu Leu Ala Lys Leu Cys Tyr Tyr Met Gin Phe Leu Gin 

420 425 430 

His His Lys Ser Ser Thr Asn Gin Ala Phe Asn Met Val His Tyr Val 

435 440 , 445 

Gly Thr Ala Ala Asn Ser Glu Met Cys Thr Leu Cys His Gly Asn Thr 

450 455 460 

Pro Ala Thr Cys Leu Asn Thr Leu Phe Tyr Arg Leu Lys Asp Arg Phe 
465 470 475 480 

Pro Ala Val Thr Thr Pro Gin Arg Arg Asp Pro Tyr Val Val Thr Gly 

485 490 495 

Thr Ala Gly Thr Phe Asn Asp Leu Glu He Leu Gly Asn Phe Ala Ser 

500 505 510 

Phe Arg Asp Arg Glu Glu Asp Gly Asn Pro Ala Asp Glu His Pro Lys 

515 520 525 

Tyr Thr Tyr Trp Gin Leu Cys Gin Thr Val Thr Glu Lys Leu Ser Ala 

530 535 540 

He Gly He Thr Glu Asp His Asp Asn His Val Asn Leu He Thr Asn 
545 550 555 560 

lie Gin Ser Phe Leu Arg Val Phe Lys Gly He Asp Ser He Val Asp 

565 570 575 

Gly Glu Val Met Lys Phe Val Asn Ser Met He Lys Asn Asn Phe Asn 
580 585 590 



Phe Arg Glu His Val Lys Ser Val His His lie Leu Gin Phe Cys Cys 

595 600 605 

Asn Val Tyr Trp Gin Ala Pro Cys Ala Val Phe Leu Asn Leu Tyr Tyr 

610 615 620 

Lys Ser Leu Leu Trp lie lie Gin Asp He Cys Leu Pro Tyr Cys Met 
625 630 635 640 

He Tyr Glu Gin Asp Asn Pro Ala Met Gly He Leu Pro Ser Glu Trp 

645 650 655 

Leu Lys Met His Phe Gin Thr Leu Trp Thr Asn Phe Lys Ala Ala Cys 

660 665 670 

Leu Asp Arg Gly Val Leu Thr Gly Cys Glu Leu Lys He Val His Arg 

675 680 685 

Asp Met Phe Cys Asp Phe Phe Asp Thr Asp Ala Gly Ser Asn Gly Leu 

690 695 700 

Met Ala Pro Phe Lys Met Gin Val Arg He Ala Arg Ala Met Met Val 
705 710 715 720 

Val Pro Lys Ser He Lys He Lys Asn Arg He He Phe Ser Asn Thr 

725 730 735 

Ala Gly Ser Glu Ala Val Gin Ser Gly Phe Val Lys Pro Thr Gly Thr 

740 745 750 

Arg Asp Thr Tyr Val Val Ala Gly Pro Tyr Met Lys Phe Leu Asn Ser 

755 760 765 

Leu His Arg Ala Leu Phe Pro Asp Thr Lys Thr Ala Ala Leu Tyr Leu 

770 775 780 

Trp His Lys He Ser Gin Thr Asn Lys Thr Pro Val Leu Lys Asp Val 
785 790 795 800 

Pro Asp Asp Glu Leu Ala Glu Leu Val Ser Tyr Val Lys Thr Asn Ser 

805 810 815 

Leu Ala Phe Glu Glu Thr Asn Val Leu Asp Val Val Pro Asp Ser Leu 

820 825 830 

Met Ser Tyr Ala Arg He Lys Leu Asn Gly Ala He Leu Arg Ala Cys 

835 840 845 

Gly Gin He Gin Phe Tyr Ala Thr Thr Leu His Cys Leu Thr Pro Val 

850 855 860 

Leu Gin Thr He Asp Ala Glu Glu Tyr Pro His Val Leu Gly Ser Ala 
865 870 875 880 

Ala He Ala Thr Pro Val Ala Tyr Leu Ala Glu He Arg Gly Arg Thr 

885 890 895 

Ala Leu Thr Val Gin Thr Thr Ala Arg Gin Pro Val Ala Ala Thr Gly 

900 905 910 

Arg Leu Arg Pro Val He Thr Val Pro Met Val Val Asn Lys Tyr Thr 

915 920 925 

Gly Val Asn Gly Asn Asn Asn Val Phe His Cys Gly Asn Leu Gly Tyr 

930 935 940 

Phe Ala Gly Arg Gly Val Asp' Arg Asn Leu Trp Pro Glu Ser Ser Pro 
945 950 955 960 

Phe Lys Lys Thr Gly Val Ser Ala Met Leu Arg Lys Arg His Val Met 

965 970 975 

Met Thr Pro He He Asp Arg Leu He Lys Arg Ala Ala Gly Gin Thr 

980 985 990 

He Ser Thr Phe Glu Ala Glu Ser Val Lys Arg Ser Val Gin Ala Leu 

995 1000 1005 

Leu Glu Asp Lys Asp Asn Pro Asn Leu Leu Lys Ser Val He Leu Glu 

1010 1015 1020 

Leu He Arg His Leu Gly Lys Gly Cys Gin Asp Leu Ser Ser Glu Asp 
1025 1030 1035 1040 

Val Gin Tyr Tyr Leu Gly Asp Tyr Cys Met Leu Thr Asp Glu Val Leu 

1045 1050 1055 

Phe Thr Leu Asp Asn He Ala Gin Ser Gly Val Pro Trp Thr He Glu 

1060 1065 1070 

Asp Ala Gly Ala Leu He Glu Asp Arg Gin Asp Ala Asp Asp Leu Gin 
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1075 1080 1085 

Phe Val Asp Ser Asp Asp lie Ala Thr Ala Ser Cys Gin Pro Pro Glu 

1090 1095 1100 

Glu Gin Leu Pro Thr Pro Ser Ala Gly Ala Leu Leu Ala Gly Lys Lys 
1105 1110 1115 1120 

Arg Lys lie Asn Ala Leu Leu Ser Asp Leu Asp Leu 
1125 1130 



<210> 10 
<211> 2061 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 



<220> 

<221> CDS 

<222> (1) - - (2061) 



<400> 10 

atg gcc agg gaa etc gca gca tta tac gcg cag ctg teg gec etc gee 48 

Met Ala Arg Glu Leu Ala Ala Leu Tyr Ala Gin Leu Ser Ala Leu Ala 

1 5 10 15 



gtc gac ttg agt ctg gtt ate ttt gcg gac ccg cga agt ate gac ggt 96 
Val Asp Leu Ser Leu Val lie Phe Ala Asp Pro Arg Ser lie Asp Gly 
20 25 30 



gcc cgc att eta aaa aca aaa aca cag ata gag aac ctg aac cgc gac 144 
Ala Arg lie Leu Lys Thr Lys Thr Gin lie Glu Asn Leu Asn Arg Asp 
35 40 45 



ctt ctg ccg ctg eta cgc gag caa aac teg gta gag acg tec age ctg 192 
Leu Leu Pro Leu Leu Arg Glu Gin Asn Ser Val Glu Thr Ser Ser Leu 
50 55 60 

teg etc gaa gtg gag cac ctg gcc aaa aac ate gag gac aaa etc ggc 240 
Ser Leu Glu Val Glu His Leu Ala Lys Asn He Glu Asp Lys Leu Gly 
65 70 75 80 



gag ctg gag cgc agt ctg egg cag aga tat teg age cga gag cat ttt 288 
Glu Leu Glu Arg Ser Leu Arg Gin Arg Tyr Ser Ser Arg Glu His Phe 
85 90 95 



gaa aca eta cac ctg aga ccc gaa tgt cac tat cac tct acg gtt act 336 

Glu Thr Leu His Leu Arg Pro Glu Cys His Tyr His Ser Thr Val Thr 
100 105 110 

ttt cag ttt tac ggg ggc ggg tta ata gat gta aac atg tgc eta ata 384 

Phe Gin Phe Tyr Gly Gly Gly Leu He Asp Val Asn Met Cys Leu He 
115 120 125 



aac gat gta gaa ctg ctg tgt aaa aga eta ggg agt gtg ttt tat tgc 432 
Asn Asp Val Glu Leu Leu Cys Lys Arg Leu Gly Ser Val Phe Tyr Cys 
130 135 140 



ate ggt gca aac gaa get ctg tec gga ttg aac egg gtt ctg acg ttt 480 
He Gly Ala Asn Glu Ala Leu Ser Gly Leu Asn Arg Val Leu Thr Phe 
145 150 155 160 



ctg tea aca ctg egg ggt ate tec ccg ate ccg cac cca gac eta tac 528 

54 




teg ctg gcg gec ata caa gat cac aat ata ttc aaa egg gtg tec gcg 
Ser Leu Ala Ala He Gin Asp His Asn He Phe Lys Arg Val Ser Ala 
260 265 270 

tea ate atg gaa eta tec aat eta att tat tgg aac gee ggg caa ace 
Ser He Met Glu Leu Ser Asn Leu He Tyr Trp Asn Ala Gly Gin Thr 
275 280 285 



aaa eta age gcg act cac ttc tac gac tgt ttc cga ccg gat ccc ata 

Lys Leu Ser Ala Thr His Phe Tyr Asp Cys Phe Arg Pro Asp Pro He 

325 330 335 

gaa tec ctg ttc tgc ggc ggt ctt ttt aac tct ata gac gac ace ata 

Glu Ser Leu Phe Cys Gly Gly Leu Phe Asn Ser He Asp Asp Thr He 

340 345 350 

aac gca ctg age egg gat tgc tec gtg acg ttc ttt caa cag gca aac 

Asn Ala Leu Ser Arg Asp Cys Ser Val Thr Phe Phe Gin Gin Ala Asn 

355 360 365 



576 



624 



Leu Ser Thr Leu Arg Gly He Ser Pro He Pro His Pro Asp Leu Tyr 
165 170 175 

gtc acg tea gtg cct tgc gta cag tgc ctg agg gaa ate gaa etc gta 
Val Thr Ser Val Pro Cys Val Gin Cys Leu Arg Glu He Glu Leu Val 
180 185 190 

cca aat cag ggg tec agt tta etc gcg gtg ttg gca gac cga cac tgc 
Pro Asn Gin Gly Ser Ser Leu Leu Ala Val Leu Ala Asp Arg His Cys 
195 200 205 

gat cac etc tgt aag aag gtt agg gcg gag cca ata cac ggc ctg ttt 672 
Asp His Leu Cys Lys Lys Val Arg Ala Glu Pro He His Gly Leu Phe 
210 215 220 

gag aca gaa ctg age cag ctg ggt eta aaa gta aca aaa cgt teg gac 720 
Glu Thr Glu Leu Ser Gin Leu Gly Leu Lys Val Thr Lys Arg Ser Asp 
225 230 235 240 

gec acg cag cac ggc gtc egg tec tct gca gat cag tta agg gag teg 768 
Ala Thr Gin His Gly Val Arg Ser Ser Ala Asp Gin Leu Arg Glu Ser 
245 250 255 



816 



864 



ggc etc cag ace ggg ace gaa aac gag tgc tea caa atg gee aga ctg 912 
Gly Leu Gin Thr Gly Thr Glu Asn Glu Cys Ser Gin Met Ala Arg Leu 
290 295 300 

eta aca cac gag gee gat atg cac gag cac cgt gcg eta ata aca ccc 960 
Leu Thr His Glu Ala Asp Met His Glu His Arg Ala Leu He Thr Pro 
305 310 315 320 



1008 



1056 



1104 



tat acc aac gtt atg cga aaa caa aac gag ctg ttc acc aga etc aat 1152 

Tyr Thr Asn Val Met Arg Lys Gin Asn Glu Leu Phe Thr Arg Leu Asn 

370 375 380 

age ate ctg cgt cag ggg age gcg gga teg caa aaa ccg gec acc ccc 1200 

Ser lie Leu Arg Gin Gly Ser Ala Gly Ser Gin Lys Pro Ala Thr Pro 

385 390 395 " 400 

teg gag cca egg acc acc acc gtg gcg gca acc gcg gca age gac gtc 1248 

Ser Glu Pro Arg Thr Thr Thr Val Ala Ala Thr Ala Ala Ser Asp Val 
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405 410 415 

ate aaa gac gca cag tat cgc aaa gaa cag tac atg aaa aag gtg gec 1296 

jle Lys Asp Ala Gin Tyr Arg Lys Glu Gin Tyr Met Lys Lys Val Ala 
420 425 430 

agg gac ggc ttt aaa aaa eta aca gag tgt ctg cag acg cag age gcg 1344 

Arg Asp Gly Phe Lys Lys Leu Thr Glu Cys Leu Gin Thr Gin Ser Ala 
435 440 445 

gtg ttg gca aac gca etc tgc atg cgc gta tgg ggg ggc gtc gee tac 13 92 

Val Leu Ala Asn Ala Leu Cys Met Arg Val Trp Gly Gly Val Ala Tyr 

450 455 460 

ggc gag gcg tec gag ctg gtg aac cat ttt etc etc agg egg cgc ttc 1440 

Gly Glu Ala Ser Glu Leu Val Asn His Phe Leu Leu Arg Arg Arg Phe 

465 470 475 480 



gtc gcg ctt ccc tgg gag gcg cgc tgc cgc teg gat cag att tta ttc 
Val Ala Leu Pro Trp Glu Ala Arg Cys Arg Ser Asp Gin lie Leu Phe 
485 490 495 



aac gcg gta caa aag tec gee tgg tgc ttt ata cga gag etc gtc etc 
Asn Ala Val Gin Lys Ser Ala Trp Cys Phe He Arg Glu Leu Val Leu 
595 600 605 



tec eta gcg cgc gag aaa etc tec ate tec aac eta gac gtt aaa ggc 

Ser Leu Ala Arg Glu Lys Leu Ser He Ser Asn Leu Asp Val Lys Gly 

625 630 635 640 

ctg acg tec ggc ctg tat eta acg tac gag caa gac gcg ccg etc gtt 

Leu Thr Ser Gly Leu Tyr Leu Thr Tyr Glu Gin Asp Ala Pro Leu Val 

645 650 655 



1488 



gaa aac tea aag tac att aaa aac tea eta tat tec cag cgc etc agt 1536 

Glu Asn Ser Lys Tyr He Lys Asn Ser Leu Tyr Ser Gin Arg Leu Ser 
500 505 510 

cgc gaa cac gta gag att ate acg ctg cag ttt tac ggc ctg ata acc 1584 

Arg Glu His Val Glu Xle He Thr Leu Gin Phe Tyr Gly Leu lie Thr 
515 520 525 

ggc ccc ctg acg cgc cag age gat etc ttt ccc ggc ccc gec aac gtc 1632 

Gly Pro Leu Thr Arg Gin Ser Asp Leu Phe Pro Gly Pro Ala Asn Val 
530 535 540 

gcg ctg gec cag tgt ttc gag gcg gee gga atg ctt ccg cat cac aag 1680 

Ala Leu Ala Gin Cys Phe Glu Ala Ala Gly Met Leu Pro His His Lys 

545 550 555 560 

atg ctg gtg tea gag atg ata tgg ccc cag att caa ccg aaa gac tgg 1728 

Met Leu Val Ser Glu Met lie Trp Pro Gin He Gin Pro Lys Asp Trp 

565 570 575 

ata gac cag aca ttt aat cgt ttt tac caa ctt ccc gaa ggt gat etc 1776 

He Asp Gin Thr Phe Asn Arg Phe Tyr Gin Leu Pro Glu Gly Asp Leu 
580 585 590 



1824 



teg gtg gcg ctt tat aat cgc acg tgg gaa aag acg ctg egg ata ttt 1872 
Ser Val Ala Leu Tyr Asn Arg Thr Trp Glu Lys Thr Leu Arg He Phe 
610 615 620 



1920 



1968 
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eta att tct caa aat acc ggc tgg ata ttt aaa gac ctg tac get ctt 
Leu He Ser Gin Asn Thr Gly Trp He Phe Lys Asp Leu Tyr Ala Leu 
660 665 670 



ctg tac cat cac ctg caa ctg tec gac ggc cat gat gat aac taa 
Leu Tyr His His Leu Gin Leu Ser Asp Gly His Asp Asp Asn 
675 680 685 



<210> 11 
<211> 686 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 11 

Met Ala Arg Glu Leu Ala Ala Leu Tyr Ala Gin Leu Ser Ala Leu Ala 

15 10 15 

Val Asp Leu Ser Leu Val He Phe Ala Asp Pro Arg Ser He Asp Gly 

20 25 30 

Ala Arg He Leu Lys Thr Lys Thr Gin He Glu Asn Leu Asn Arg Asp 

35 40 45 

Leu Leu Pro Leu Leu Arg Glu Gin Asn Ser Val Glu Thr Ser Ser Leu 

50 55 60 

Ser Leu Glu Val Glu His Leu Ala Lys Asn He Glu Asp Lys Leu Gly 
65 70 75 80 

Glu Leu Glu Arg Ser Leu Arg Gin Arg Tyr Ser Ser Arg Glu His Phe 

85 90 95 

Glu Thr Leu His Leu Arg Pro Glu Cys His Tyr His Ser Thr Val Thr 

100 105 HO 

Phe Gin Phe Tyr Gly Gly Gly Leu He Asp Val Asn Met Cys Leu He 

115 . 120 125 

Asn Asp Val Glu Leu Leu Cys Lys Arg Leu Gly Ser Val Phe Tyr Cys 

130 135 140 

He Gly Ala Asn Glu Ala Leu Ser Gly Leu Asn Arg Val Leu Thr Phe 
145 150 155 160 

Leu Ser Thr Leu Arg Gly He Ser Pro He Pro His Pro Asp Leu Tyr 

165 170 175 

Val Thr Ser Val Pro Cys Val Gin Cys Leu Arg Glu He Glu Leu Val 

180 185 190 

Pro Asn Gin Gly Ser Ser Leu Leu Ala Val Leu Ala Asp Arg His Cys 

195 200 205 

Asp His Leu Cys Lys Lys Val Arg Ala Glu Pro He His Gly Leu Phe 

210 215 220 

Glu Thr Glu Leu Ser Gin Leu Gly Leu Lys Val Thr Lys Arg Ser Asp 
225 230 235 240 

Ala Thr Gin His Gly Val Arg Ser Ser Ala Asp Gin Leu Arg Glu Ser 

245 250 255 

Ser Leu Ala Ala He Gin Asp His Asn He Phe Lys Arg Val Ser Ala 

260 265 270 

Ser lie Met Glu Leu Ser Asn Leu He Tyr Trp Asn Ala Gly Gin Thr 

275 280 285 

Gly Leu Gin Thr Gly Thr Glu Asn Glu Cys Ser Gin Met Ala Arg Leu 

290 295 300 

Leu Thr His Glu Ala Asp Met His Glu His Arg Ala Leu He Thr Pro 
305 310 315 320 

Lys Leu Ser Ala Thr His Phe Tyr Asp Cys Phe Arg Pro Asp Pro He 

325 330 335 

Glu Ser Leu Phe Cys Gly Gly Leu Phe Asn Ser He Asp Asp Thr He 

340 345 350 

Asn Ala Leu Ser Arg Asp Cys Ser Val Thr Phe Phe Gin Gin Ala Asn 



355 360 365 

Tyr Thr Asn Val Met Arg Lys Gin Asn Glu Leu Phe Thr Arg Leu Asn 

370 375 380 

Ser lie Leu Arg Gin Gly Ser Ala Gly Ser Gin Lys Pro Ala Thr Pro 
385 390 395 400 

Ser Glu Pro Arg Thr Thr Thr Val Ala Ala Thr Ala Ala Ser Asp Val 

405 410 415 

He Lys Asp Ala Gin Tyr Arg Lys Glu Gin Tyr Met Lys Lys Val Ala 

420 425 430 

Arg Asp Gly Phe Lys Lys Leu Thr Glu Cys Leu Gin Thr Gin Ser Ala 

435 440 445 

Val Leu Ala Asn Ala Leu Cys Met Arg Val Trp Gly Gly Val Ala Tyr 

450 455 460 

Gly Glu Ala Ser Glu Leu Val Asn His Phe Leu Leu Arg Arg Arg Phe 
465 470 475 480 

Val Ala Leu Pro Trp Glu Ala Arg Cys Arg Ser Asp Gin He Leu Phe 

485 490 495 

Glu Asn Ser Lys Tyr lie Lys Asn Ser Leu Tyr Ser Gin Arg Leu Ser 

500 505 510 

Arg Glu His Val Glu He He Thr Leu Gin Phe Tyr Gly Leu He Thr 

515 520 525 

Gly Pro Leu Thr Arg Gin Ser Asp Leu Phe Pro Gly Pro Ala Asn Val 

530 535 540 

Ala Leu Ala Gin Cys Phe Glu Ala Ala Gly Met Leu Pro His His Lys 
545 550 555 560 

Met Leu Val Ser Glu Met He Trp Pro Gin He Gin Pro Lys Asp Trp 

565 570 575 

He Asp Gin Thr Phe Asn Arg Phe Tyr Gin Leu Pro Glu Gly Asp Leu 

580 585 590 

Asn Ala Val Gin Lys Ser Ala Trp Cys Phe He Arg Glu Leu Val Leu 

595 600 605 

Ser Val Ala Leu Tyr Asn Arg Thr Trp Glu Lys Thr Leu Arg He Phe 

610 615 620 

Ser Leu Ala Arg Glu Lys Leu Ser He Ser Asn Leu Asp Val Lys Gly 
625 630 635 640 

Leu Thr Ser Gly Leu Tyr Leu Thr Tyr Glu Gin Asp Ala Pro Leu Val 

645 650 655 

Leu He Ser Gin Asn Thr Gly Trp He Phe Lys Asp Leu Tyr Ala Leu 

660 665 670 

Leu Tyr His His Leu Gin Leu Ser Asp Gly His Asp Asp Asn 
675 680 685 



<210> 12 
<211> 2490 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (2490) 

<400> 12 

atg atg ata act aac cga acg cgt cgt etc ctg egg gcg tgg gtc gtg 

Met Met He Thr Asn Arg Thr Arg Arg Leu Leu Arg Ala Trp Val Val 
15 10 15 

ata ate gcg ate ggc acg gcg gtt ggc gaa aac gtc acc acc ccc aag 
He He Ala He Gly Thr Ala Val Gly Glu Asn Val Thr Thr Pro Lys 
20 25 30 
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ggc gcg acc acc acc gcg aag cca acg ccg ggc ccg teg acg ccc aca 144 

Gly Ala Thr Thr Thr Ala Lys Pro Thr Pro Gly Pro Ser Thr Pro Thr 
35 40 45 

cct ccc gag aac cca cct agg gec gag gcg ttt aag ttt cgc gtg tgc 192 

Pro Pro Glu Asn Pro Pro Arg Ala Glu Ala Phe Lys Phe Arg Val Cys 
50 55 60 

age gec teg gec acc ggc gaa etc ttc agg ttt aac etg gaa aaa acg 240 

Ser Ala Ser Ala Thr Gly Glu Leu Phe Arg Phe Asn Leu Glu Lys Thr 
65 70 75 80 

tgt ccg ggc acc gag gac aag acg cac caa gaa ggc ate ctg atg gtg 2 88 

Cys Pro Gly Thr Glu Asp Lys Thr His Gin Glu Gly lie Leu Met Val 
85 90 95 

ttt aaa aaa aat att gtc ccg cac ate ttt aag gtc aga egg tac cgc 336 

Phe Lys Lys Asn lie Val Pro His lie Phe Lys Val Arg Arg Tyr Arg 

100 105 110 

aag gtg gec acc teg gtg acc gtc tat cga ggg tgg acc gag acc gec 384 

Lys Val Ala Thr Ser Val Thr Val Tyr Arg Gly Trp Thr Glu Thr Ala 
115 120 125 

gtg acc ggc aag caa gag gtc ate cga ccg gtg- ccg cag tac gag ate 432 

Val Thr Gly Lys Gin Glu Val lie Arg Pro Val Pro Gin Tyr Glu lie 
130 135 140 

aac cac atg gac acg acc tac cag tgt ttc age tec atg cgc gta aac 480 

Asn His Met Asp Thr Thr Tyr Gin Cys Phe Ser Ser Met Arg Val Asn 

145 150 155 160 

gtc aac ggc ata gta aac acc tac acg gac agg gac ttc act aac cag 528 

Val Asn Gly He Val Asn Thr Tyr Thr Asp Arg Asp Phe Thr Asn Gin 
165 170 175 

acc gtg ttt ctg caa ccg gtc gag ggg etc acg gat aac ate cag cga 576 

Thr Val Phe Leu Gin Pro Val Glu Gly Leu Thr Asp Asn He Gin Arg 

180 185 190 

tac ttc agt cag ccg gtg ctg tac acg aca ccg gga tgg ttt ccg gga 624 

Tyr Phe Ser Gin Pro Val Leu Tyr Thr Thr Pro Gly Trp Phe Pro Gly 
195 200 205 

att tac agg gtc aga acc acg gtc aac tgc gag ate gtg gac atg ate 672 

He Tyr Arg Val Arg Thr Thr Val Asn Cys Glu He Val Asp Met He 
210 215 220 

gcg cgt teg gcg gaa ccg tac teg tat ttt gtc acc gec ctg gga gac 720 

Ala Arg Ser Ala Glu Pro Tyr Ser Tyr Phe Val Thr Ala Leu Gly Asp 

225 230 235 240 

acg gta gag gtg tec ccg ttc tgc cac aac gac tea acg tgc teg gtc 768 

Thr Val Glu Val Ser Pro Phe Cys His Asn Asp Ser Thr Cys Ser Val 
245 250 255 

gcg gag aaa acc gaa aac ggc etc ggc gec cgc gtg etc aca aat tac 816 

Ala Glu Lys Thr Glu Asn Gly Leu Gly Ala Arg Val Leu Thr Asn Tyr 

260 265 270 



59 
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acc ate gtc gac ttc gcg acc cgc cag ccc acc acc gaa acg egg gtc 864 

Thr lie Val Asp Phe Ala Thr Arg Gin Pro Thr Thr Glu Thr Arg Val 
275 280 285 

ttc gec gac teg gga gaa tac acc gta teg tgg aag gcg gag gac ccc 912 

Phe Ala Asp Ser Gly Glu Tyr Thr Val Ser Trp Lys Ala Glu Asp Pro 

290 295 300 

aag teg gcg gtc tgc gcg ctg acg etc tgg aaa acc ttc ccc agg gcg 960 

Lys Ser Ala Val Cys Ala Leu Thr Leu Trp Lys Thr Phe Pro Arg Ala 
305 310 315 320 

ata cag acg acg cac gag gec age tac cac ttc gtg gec aac gac gtg 1008 

lie Gin Thr Thr His Glu Ala Ser Tyr His Phe Val Ala Asn Asp Val 
325 330 335 

acg gcg acc ttc acg tec ccg etc tec cag gta act aac ttc acg ggc 1056 

Thr Ala Thr Phe Thr Ser Pro Leu Ser Gin Val Thr Asn Phe Thr Gly 
340 345 350 

acg tac ccc tgc etc aat gat gtt att cag aaa acc etc aac gec acc 1104 

Thr Tyr Pro Cys Leu Asn Asp Val lie Gin Lys Thr Leu Asn Ala Thr 
355 360 365 

ate aag aag ctg tec gat acc cac gca aca aac gga teg gag cag tac 1152 

lie Lys Lys Leu Ser Asp Thr His Ala Thr Asn Gly Ser Glu Gin Tyr 

370 375 380 

tac gaa acc gag ggg ggt ctg ttt etc ctg tgg cag ccg tta acg ccg 1200 

Tyr Glu Thr Glu Gly Gly Leu Phe Leu Leu Trp Gin Pro Leu Thr Pro 
385 390 395 400 

eta age eta get gac gag atg cgc gaa tta aac ggc acc acg cca gca 124 8 

Leu Ser Leu Ala Asp Glu Met Arg Glu Leu Asn Gly Thr Thr Pro Ala 
405 410 415 ■ 

ccc ccc acc aca acc tea acc gec aac cgc gtt cga aga age gtc ggt 12 96 

Pro Pro Thr Thr Thr Ser Thr Ala Asn Arg Val Arg Arg Ser Val Gly 
420 425 430 

acg aac gag cag gca acg gac gac eta gcg gcg ccc cag ctg cag ttc 1344 

Thr Asn Glu Gin Ala Thr Asp Asp Leu Ala Ala Pro Gin Leu Gin Phe 
435 440 445 

gec tac gac aag etc cgc gcg age ate aac aag gtg ctg gag gag etc 1392 

Ala Tyr Asp Lys Leu Arg Ala Ser lie Asn Lys Val Leu Glu Glu Leu 

450 455 460 

tec agg gcg tgg tgc cga gaa cag gtg agg gac acc tac atg tgg tac 144 0 

Ser Arg Ala Trp Cys Arg Glu Gin Val Arg Asp Thr Tyr Met Trp Tyr 
465 470 475 480 

gaa ctg age aag att aac ccc acc age gta atg acg gcg ata tac ggg 14 88 

Glu Leu Ser Lys lie Asn Pro Thr Ser Val Met Thr Ala lie Tyr Gly 
485 490 495 

egg ccg gtg teg gee aag ttc gtg ggc gac gec ate tec gtg acg gac 1536 

Arg Pro Val Ser Ala Lys Phe Val Gly Asp Ala lie Ser Val Thr Asp 
500 505 510 

tgc gtg gcg gtg gac cag gcg tec gtc age ate cac aag age etc cgc 1584 



60 



Cys Val Ala Val Asp Gin Ala Ser Val Ser lie His Lys Ser Leu Arg 

515 520 525 

acg tec acc ccg ggg ate tgc tac teg cgc ccc ccg gtc acg ttc agg 1632 

Thr Ser Thr Pro Gly lie Cys Tyr Ser Arg Pro Pro Val Thr Phe Arg 
530 535 540 

ttc etc aac age acc acg ctg ttc aag ggc cag ctg gga ccc aga aac 1680 

Phe Leu Asn Ser Thr Thr Leu Phe Lys Gly Gin Leu Gly Pro Arg Asn 

545 550 555 560 

gag ate ata ctg acg gac aac cag gtg gag gcg tgc aaa gag acg tgc 172 8 

Glu He He Leu Thr Asp Asn Gin Val Glu Ala Cys Lys Glu Thr Cys 

565 570 575 

gaa cac tac ttc ata gcg age aac gta acc tac tac tac aaa gac tac 1776 

Glu His Tyr Phe He Ala Ser Asn Val Thr Tyr Tyr Tyr Lys Asp Tyr 
580 585 590 

gtc ttc gtg aaa aaa att aac acc tec gag ata tec acc etc ggt acg 1824 

Val Phe Val Lys Lys He Asn Thr Ser Glu He Ser Thr Leu Gly Thr 

595 600 605 

ttc ate gee ctg aac ctg teg ttt ata gag aac ata gat ttc agg gtc 1872 

Phe He Ala Leu Asn Leu Ser Phe He Glu Asn He Asp Phe Arg Val 
610 615 620 

ate gag ctg tac age cgc gcg gag aaa aag ctg tec ggg age gtt ttc 192 0 

He Glu Leu Tyr Ser Arg Ala Glu Lys Lys Leu Ser Gly Ser Val Phe 

625 630 635 640 



gat ata gaa acc atg ttc agg gaa tac aac tac tac acg caa cgc ctg 
Asp He Glu Thr Met Phe Arg Glu Tyr Asn Tyr Tyr Thr Gin Arg Leu 



645 650 655 



atg etc atg ate ctg gtg att gtg gcg gtc gtc ctg ate gtg ttt gcg 
Met Leu Met He Leu Val He Val Ala Val Val Leu He Val Phe Ala 
725 730 735 



tac ccc gac ata gac aaa atg cag ccc tct ggc ggt aaa gtc gac cag 
Tyr Pro Asp lie Asp Lys Met Gin Pro Ser Gly Gly Lys Val Asp Gin 



1968 



gcg gga etc egg gag gac ctg gac aac acg ate gac ctg aac cgc gac 2016 
Ala Gly Leu Arg Glu Asp Leu Asp Asn Thr He Asp Leu Asn Arg Asp 
660 665 670 

cgc ctg gee cgc gac ctg tec gag ata gtc gcg gac ctg ggc gat gtc 2064 
Arg Leu Ala Arg Asp Leu Ser Glu He Val Ala Asp Leu Gly Asp Val 
675 680 685 

ggc cgc acg gtc gtt aac gtg gec agt age gtg ata acc ctg ttc gga 2112 
Gly Arg Thr Val Val Asn Val Ala Ser Ser Val He Thr Leu Phe Gly 
690 695 700 

tea ate gtg age ggg ttc att aac ttt ata aag agt ccg ttc ggg ggc 2160 
Ser He Val Ser Gly Phe He Asn Phe He Lys Ser Pro Phe Gly Gly 
705 710 715 720 



2208 



eta aac egg cgc acc aac gee ate gec cag gec ccc ate agg atg ate 2256 
Leu Asn Arg Arg Thr Asn Ala He Ala Gin Ala Pro lie Arg Met He 
740 745 750 



2304 



61 



755 



760 



765 



gag cag att 
Glu Gin lie 
770 

gag cgt agg 
Glu Arg Arg 
785 

egg cgc gcg 
Arg Arg Ala 



ctg gaa aac 
Leu Glu Asn 



aaa aac att 
Lys Asn lie 



egg tta gac 
Arg Leu Asp 
790 

tea gac gga 
Ser Asp Gly 
805 

gaa gag get 
Glu Glu Ala 
820 



etc gec ggc 
Leu Ala Gly 
775 

gaa cag cag 
Glu Gin Gin 



eta aaa cgt 
Leu Lys Arg 



caa gag tat 
Gin Glu Tyr 
825 



atg cac cag 
Met His Gin 
780 

agg tea gcg 
Arg Ser Ala 
795 

cgc ttt agg 
Arg Phe Arg 
810 

gaa atg age 
Glu Met Ser 



eta cag cag 
Leu Gin Gin 



ccc teg ctt 
Pro Ser Leu 



gga tat aaa 
Gly Tyr Lys 
815 

aaa taa 
Lys 

830 



gaa 2352 
Glu 



ttc 2400 

Phe 

800 

ccg 2448 
Pro 



2490 



<210> 13 
<211> 829 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 13 
Met Met lie Thr 
1 

lie lie Ala lie 
20 

Gly Ala Thr Thr 
35 

Pro Pro Glu Asn 
50 

Ser Ala Ser Ala 
65 

Cys Pro Gly Thr 

Phe Lys Lys Asn 
100 

Lys Val Ala Thr 
115 

Val Thr Gly Lys 
130 

Asn His Met Asp 
145 

Val Asn Gly He 

Thr Val Phe Leu 
180 

Tyr Phe Ser Gin 
195 

He Tyr Arg Val 
210 

Ala Arg Ser Ala 
225 

Thr Val Glu Val 

Ala Glu Lys Thr 
260 

Thr He Val Asp 
275 



Asn Arg Thr Arg 
5 

Gly Thr Ala Val 

Thr Ala Lys Pro 
40 

Pro Pro Arg Ala 
55 

Thr Gly Glu Leu 
70 

Glu Asp Lys Thr 
85 

He Val Pro His 

Ser Val Thr Val 
120 

Gin Glu Val He 
135 

Thr Thr Tyr Gin 
150 

Val Asn Thr Tyr 
165 

Gin Pro Val Glu 

Pro Val Leu Tyr 
200 

Arg Thr Thr Val 
215 

Glu Pro Tyr Ser 
230 

Ser Pro Phe Cys 
245 

Glu Asn Gly Leu 

Phe Ala Thr Arg 
280 



Arg Leu Leu Arg 
10 

Gly Glu Asn Val 
25 

Thr Pro Gly Pro 

Glu Ala Phe Lys 
60 

Phe Arg Phe Asn 
75 

His Gin Glu Gly 
90 

He Phe Lys Val 
105 

Tyr Arg Gly Trp 

Arg Pro Val Pro 
140 

Cys Phe Ser Ser 
155 

Thr Asp Arg Asp 
170 

Gly Leu Thr Asp 
185 

Thr Thr Pro Gly 

Asn Cys Glu He 
220 

Tyr Phe Val Thr 
235 

His Asn Asp Ser 
250 

Gly Ala Arg Val 
265 

Gin Pro Thr Thr 



62 



Ala Trp Val Val 
15 

Thr Thr Pro Lys 
30 

Ser Thr Pro Thr 
45 

Phe Arg Val Cys 

Leu Glu Lys Thr 
80 

He Leu Met Val 
95 

Arg Arg Tyr Arg 
110 

Thr Glu Thr Ala 
125 

Gin Tyr Glu He 

Met Arg Val Asn 
160 

Phe Thr Asn Gin 
175 

Asn He Gin Arg 
190 

Trp Phe Pro Gly 
205 

Val Asp Met He 

Ala Leu Gly Asp 
240 

Thr Cys Ser Val 
255 

Leu Thr Asn Tyr 
270 

Glu Thr Arg Val 
285 



Phe Ala Asp Ser 
290 

Lys Ser Ala Val 
305 

lie Gin Thr Thr 

Thr Ala Thr Phe 
340 

Thr Tyr Pro Cys 
355 

lie Lys Lys Leu 
370 

Tyr Glu Thr Glu 
385 

Leu Ser Leu Ala 

Pro Pro Thr Thr 
420 

Thr Asn Glu Gin 
435 

Ala Tyr Asp Lys 
450 

Ser Arg Ala Trp 
465 

Glu Leu Ser Lys 

Arg Pro Val Ser 
500 

Cys Val Ala Val 
515 

Thr Ser Thr Pro 
530 

Phe Leu Asn Ser 
545 

Glu lie lie Leu 

Glu His Tyr Phe 
580 

Val Phe Val Lys 
595 

Phe lie Ala Leu 
610 

lie Glu Leu Tyr 
625 

Asp lie Glu Thr 

Ala Gly Leu Arg 
660 

Arg Leu Ala Arg 
675 

Gly Arg Thr Val 
690 

Ser lie Val Ser 
705 

Met Leu Met lie 

Leu Asn Arg Arg 
740 

Tyr Pro Asp lie 
755 

Glu Gin He Lys 



Gly Glu Tyr Thr 
295 

Cys Ala Leu Thr 
310 

His Glu Ala Ser 
325 

Thr Ser Pro Leu 

Leu Asn Asp Val 
360 

Ser Asp Thr His 
375 

Gly Gly Leu Phe 
390 

Asp Glu Met Arg 
405 

Thr Ser Thr Ala 

Ala Thr Asp Asp 
440 

Leu Arg Ala Ser 
455 

Cys Arg Glu Gin 
470 

He Asn Pro Thr 
485 

Ala Lys Phe Val 

Asp Gin Ala Ser 
520 

Gly He Cys Tyr 
535 

Thr Thr Leu Phe 
550 

Thr Asp Asn Gin 
565 

He Ala Ser Asn 

Lys He Asn Thr 
600 

Asn Leu Ser Phe 
615 

Ser Arg Ala Glu 
630 

Met Phe Arg Glu 
645 

Glu Asp Leu Asp 

Asp Leu Ser Glu 
680 

Val Asn Val Ala 
695 

Gly Phe He Asn 
710 

Leu Val He Val 
725 

Thr Asn Ala He 

Asp Lys Met Gin 
760 

Asn He Leu Ala 



Val Ser Trp Lys 
300 

Leu Trp Lys Thr 
315 

Tyr His Phe Val 
330 

Ser Gin Val Thr 
345 

He Gin Lys Thr 

Ala Thr Asn Gly 
380 

Leu Leu Trp Gin 
395 

Glu Leu Asn Gly 
410 

Asn Arg Val Arg 
425 

Leu Ala Ala Pro 

He Asn Lys Val 
460 

Val Arg Asp Thr 
475 

Ser Val Met Thr 
490 

Gly Asp Ala He 
505 

Val Ser He His 

Ser Arg Pro Pro 
540 

Lys Gly Gin Leu 
555 

Val Glu Ala Cys 
570 

Val Thr Tyr Tyr 
585 

Ser Glu He Ser 

He Glu Asn He 
620 

Lys Lys Leu Ser 
635 

Tyr Asn Tyr Tyr 
650 

Asn Thr He Asp 
665 

He Val Ala Asp 

Ser Ser Val He 
700 

Phe He Lys Ser 
715 

Ala Val Val Leu 
730 

Ala Gin Ala Pro 
745 

Pro Ser Gly Gly 
Gly Met His Gin 



Ala Glu Asp Pro 

Phe Pro Arg Ala 
320 

Ala Asn Asp Val 
335 

Asn Phe Thr Gly 
350 

Leu Asn Ala Thr 
365 

Ser Glu Gin Tyr 

Pro Leu Thr Pro 
400 

Thr Thr Pro Ala 
415 

Arg Ser Val Gly 
430 

Gin Leu Gin Phe 
445 

Leu Glu Glu Leu 

Tyr Met Trp Tyr 
480 

Ala He Tyr Gly 
495 

Ser Val Thr Asp 
510 

Lys Ser Leu Arg 
525 

Val Thr Phe Arg 

Gly Pro Arg Asn 
560 

Lys Glu Thr Cys 

575 

Tyr Lys Asp Tyr 
590 

Thr Leu Gly Thr 
605 

Asp Phe Arg Val 

Gly Ser Val Phe 
640 

Thr Gin Arg Leu 
655 

Leu Asn Arg Asp 
670 

Leu Gly Asp Val 
685 

Thr Leu Phe Gly 

Pro Phe Gly Gly 
720 

He Val Phe Ala 

735 

He Arg Met He 
750 

Lys Val Asp Gin 
765 

Leu Gin Gin Glu 



u : 



770 775 780 

Glu Arg Arg Arg Leu Asp Glu Gin Gin Arg Ser Ala Pro Ser Leu Phe 
785 790 795 800 

Arg Arg Ala Ser Asp Gly Leu Lys Arg Arg Phe Arg Gly Tyr Lys Pro 

805 810 815 

Leu Glu Asn Glu Glu Ala Gin Glu Tyr Glu Met Ser Lys 
820 825 



<210> 14 
<211> 3045 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 



<220> 

<221> CDS 

<222> (1) . . (3045) 



<400> 14 

atg gat ttc ttt aac ccg tac ctg ggc cct cgc gga cca cgc ccc cac 48 

Met Asp Phe Phe Asn Pro Tyr Leu Gly Pro Arg Gly Pro Arg Pro His 

15 10 15 

tea cac aga ggc acc gat get ccc gec cct gec ggc gec gga gec gtt 96 

Ser His Arg Gly Thr Asp Ala Pro Ala Pro Ala Gly Ala Gly Ala Val 

20 25 30 



cag ccg cca cca gac gtt tgc agg etc ate ccc gec tgc etc cga acg 144 
Gin Pro Pro Pro Asp Val Cys Arg Leu lie Pro Ala Cys Leu Arg Thr 
35 40 . 45 



cca ggg gca ggc ggg atg ate ccg gtc acg ate ccg ttc ccg cca acg 192 
Pro Gly Ala Gly Gly Met lie Pro Val Thr lie Pro Phe Pro Pro Thr 
50 55 60 



tac ttc gag aac ggt get cgc gga gac gtg ctg etc g"cc aac gaa egg 240 
Tyr Phe Glu Asn Gly Ala Arg Gly Asp Val Leu Leu Ala Asn Glu Arg 
65 70 75 80 



tec atg tgg acg gcg cgc gac cgc aag ccc gtc gec ccg gac ccc caa 288 
Ser Met Trp Thr Ala Arg Asp Arg Lys Pro Val Ala Pro Asp Pro Gin 
85 90 95 



gac caa tec ate acg ttt cac gcg tac gac gtc gtt gaa aca acg tac 336 
Asp Gin Ser lie Thr Phe His Ala Tyr Asp Val Val Glu Thr Thr Tyr 
100 105 110 



gcg gcg gac agg tgt gec gag gta cct age cgc ttc caa acg gac att 384 
Ala Ala Asp Arg Cys Ala Glu Val Pro Ser Arg Phe Gin Thr Asp lie 
115 120 125 



ate cca age gga acc gtg etc aag etc ctg ggg cga acc gag gac ggc 432 
lie Pro Ser Gly Thr Val Leu Lys Leu Leu Gly Arg Thr Glu Asp Gly 
130 135 140 



acc age gtg tgc gtg aac gtg ttc cgt caa cag gta tat ttc tac gcg 480 
Thr Ser Val Cys Val Asn Val Phe Arg Gin Gin Val Tyr Phe Tyr Ala 
145 150 155 160 



aag gtt cca gec ggc att aac gtc acc cac ate etc cag cag gee etc 528 

64 



Lys Val Pro Ala Gly lie Asn Val Thr His lie Leu Gin Gin Ala Leu 
165 170 175 



aag aac aca gcc ggc cgt gcc gcg tgc ggc ttc teg acc aga aga gtg 
Lys Asn Thr Ala Gly Arg Ala Ala Cys Gly Phe Ser Thr Arg Arg Val 
180 185 190 



576 



aac aaa aga att etc aaa acg tac gac gtc gcg gag cat ccc gtc acg 
Asn Lys Arg He Leu Lys Thr Tyr Asp Val Ala Glu His Pro Val Thr 
195 200 205 



624 



gaa ate acg eta teg tec ggt tec atg etc teg acc etc age gac cgc 
Glu He Thr Leu Ser Ser Gly Ser Met Leu Ser Thr Leu Ser Asp Arg 
210 215 220 



672 



etc gtc gcg tgc ggg tgc gag gtg ttc gag tea aac gtg gac gcc gtt 
Leu Val Ala Cys Gly Cys Glu Val Phe Glu Ser Asn Val Asp Ala Val 
225 230 235 240 



720 



cgc egg ttc gtt ctg gat cac ggg ttt acc acg ttc ggg tgg tac teg 
Arg Arg Phe Val Leu Asp His Gly Phe Thr Thr Phe Gly Trp Tyr Ser 
245 250 255 



768 



tgc gcg cgc gcc acg ccc cgc ctg gcg gcc aga gat gcc agg acg gcc 
Cys Ala Arg Ala Thr Pro Arg Leu Ala Ala Arg Asp Ala Arg Thr Ala 
260 265 270 



816 



ctg gag ttt gac tgc age tgg gag gac etc age gtt caa gcg gac cgc 
Leu Glu Phe Asp Cys Ser Trp Glu Asp Leu Ser Val Gin Ala Asp Arg 
275 280 285 



864 



age gac tgg ccc ccg tac cgc ate gtg gcc ttt gat ate gag tgc act 
Ser Asp Trp Pro Pro Tyr Arg He Val Ala Phe Asp He Glu Cys Thr 
290 295 300 



912 



gga gag gcg gga ttt ccg tgc gcc acg cgc gac ggc gac gcg gtg ate 
Gly Glu Ala Gly Phe Pro Cys Ala Thr Arg Asp Gly Asp Ala Val He 
305 310 315 320 



960 



cag ate tec tgc gtc ttc tac acg acc agg gaa ggc gcg ccc aat ccg 
Gin He Ser Cys Val Phe Tyr Thr Thr Arg Glu Gly Ala Pro Asn Pro 
325 330 335 



1008 



cca aac ata ctg ttc age gtc ggg acg tgc gac ccc ate ccg gac acc 
Pro Asn He Leu Phe Ser Val Gly Thr Cys Asp Pro He Pro Asp Thr 
340 345 350 



1056 



gac gtt ttg gag ttt ccg teg gaa tat gac atg ctg gtg teg ttc ttc 
Asp Val Leu Glu Phe Pro Ser Glu Tyr Asp Met Leu Val Ser Phe Phe 
355 360 365 



1104 



gcc atg ate cgc gac ttc gag gtg gac ttt tta acc ggc tat aac ate 
Ala Met He Arg Asp Phe Glu Val Asp Phe Leu Thr Gly Tyr Asn He 
370 375 380 



1152 



tea aac ttc gat etc ccg tac eta ate acg cga gcg tec cag gtg tac 
Ser Asn Phe Asp Leu Pro Tyr Leu He Thr Arg Ala Ser Gin Val Tyr 
385 390 395 400 



1200 



aac ctt cga tta aac gaa tac aca aaa ata aaa acc ggc tec ate ttt 
Asn Leu Arg Leu Asn Glu Tyr Thr Lys He Lys Thr Gly Ser He Phe 



1248 



65 



405 410 415 

gaa gtt cac gag ccc cgt ggc ggg gga ggg ggg ttc atg agg teg gtc 1296 

Glu Val His Glu Pro Arg Gly Gly Gly Gly Gly Phe Met Arg Ser Val 

420 425 430 

tea aaa att aaa ata gcg ggc ate gtc ccc ata gac atg tac cag gtg 1344 

Ser Lys lie Lys lie Ala Gly He Val Pro He Asp Met Tyr Gin Val 

435 440 445 

tgt cgc gaa aag etc age etc tec gac tac aaa ctg gac acg gtg gec 1392 

Cys Arg Glu Lys Leu Ser Leu Ser Asp Tyr Lys Leu Asp Thr Val Ala 

450 455 460 

agg cag tgt ctg ggt ggg aaa aaa gag gac gta teg tac aag gac att 144 0 

Arg Gin Cys Leu Gly Gly Lys Lys Glu Asp Val Ser Tyr Lys Asp He 

465 470 475 480 

ccc cct ctg ttt cgc tea ggt ccg ggc ggc agg get aag gtg ggc age 1488 

Pro Pro Leu Phe Arg Ser Gly Pro Gly Gly Arg Ala Lys Val Gly Ser 

485 490 495 

tat tgc gtg atg gac teg gtc ctg gtg atg gac etc tta aaa atg ttt 1536 

Tyr Cys Val Met Asp Ser Val Leu Val Met Asp Leu Leu Lys Met Phe 

500 505 510 

atg ata cac gtg gag att teg gag ata gee aag ctg gec aag att cag 1584 

Met He His Val Glu He Ser Glu He Ala Lys Leu Ala Lys He Gin 

515 520 525 

gee agg cgc gtc ctg acg gac ggc caa cag etc cgc gtg ttc tec tgc 1632 

Ala Arg Arg Val Leu Thr Asp Gly Gin Gin Leu Arg Val Phe Ser Cys 

530 535 540 

ctg etg gag gec gcg gec agg gag aac ttt ate etc ccg gtt cca acg 1680 

Leu Leu Glu Ala Ala Ala Arg Glu Asn Phe He Leu Pro Val Pro Thr 

545 550 555 560 

ccc gag gga cag ggg ggc tat cag ggc gcg acg gtg ate aac ccc att 1728 

Pro Glu Gly Gin Gly Gly Tyr Gin Gly Ala Thr Val He Asn Pro He 

565 570 575 

ccg ggg ttt tac gac gag ccg gtc ctg gtg gtc gat ttt gec age ctg 1776 

Pro Gly Phe Tyr Asp Glu Pro Val Leu Val Val Asp Phe Ala Ser Leu 

580 585 590 



tac ccg age ate ate cag gcg cac aac ctg tgc tac tec ace atg ata 

Tyr Pro Ser He He Gin Ala His Asn Leu Cys Tyr Ser Thr Met lie 

595 600 605 

cac gga cga gac ctg cac ctg cac ccc aac ctg acg ccg gac gac tac 

His Gly Arg Asp Leu His Leu His Pro Asn Leu Thr Pro Asp Asp Tyr 

610 615 620 

gag acg ttc gtg ctg age ggc gga ccg gta cat ttt gta aaa aaa cac 

Glu Thr Phe Val Leu Ser Gly Gly Pro Val His Phe Val Lys Lys His 

625 630 635 640 

aag egg gag tct ctg ctg gga aga ctg eta acc gtg tgg tta gaa aag 

Lys Arg Glu Ser Leu Leu Gly Arg Leu Leu Thr Val Trp Leu Glu Lys 
645 650 655 

66 



1824 



1872 



1920 



1968 



iL HJ IL..I1 .m%S 



cga agg gcg ate egg cgc acc ctg gcg gcg tgc gat gac ccg teg eta 2016 
Arg Arg Ala He Arg Arg Thr Leu Ala Ala Cys Asp Asp Pro Ser Leu 
660 665 670 

aaa acc ate tta gat aaa caa cag ctg gee ate aag gtg aca tgt aac 2064 
Lys Thr He Leu Asp Lys Gin Gin Leu Ala He Lys Val Thr Cys Asn 
675 680 685 

gcg gtt tac ggg ttc acc ggg gtg gee age ggc etc etc cca tgc att 2112 
Ala Val Tyr Gly Phe Thr Gly Val Ala Ser Gly Leu Leu Pro Cys He 
690 695 700 

aac ata gcg gaa acc gtg acg etc egg ggg cgc acg atg ctg gag atg 2160 
Asn He Ala Glu Thr Val Thr Leu Arg Gly Arg Thr Met Leu Glu Met 
705 710 715 720 

tea aag tct tac gtg gag gee ctg acg acg gaa gac ctg cga acg cgt 2208 
Ser Lys Ser Tyr Val Glu Ala Leu Thr Thr Glu Asp Leu Arg Thr Arg 
725 730 735 

etc ggt cgc gag gtg acc gee cgt cac ggc gcg egg ttt cgc gtc gtc 2256 
Leu Gly Arg Glu Val Thr Ala Arg His Gly Ala Arg Phe Arg Val Val 
740 745 750 



tac ggt gac acc gac tec etc ttt ate gcg tgc gac ggt tat tec gcg 
Tyr Gly Asp Thr Asp Ser Leu Phe He Ala Cys Asp Gly Tyr Ser Ala 
755 760 765 



gac ctg ttc ccc cca ccc att aag eta gag gcg gaa aag acg ttc aag 
Asp Leu Phe Pro Pro Pro He Lys Leu Glu Ala Glu Lys Thr Phe Lys 
785 790 795 800 



gag gtc etc aac gcg age tat ctg gac etc cga aac age gtc gtg ccc 
Glu Val Leu Asn Ala Ser Tyr Leu Asp Leu Arg Asn Ser Val Val Pro 
885 890 895 



2304 



gaa gee gtt tec get ttc tgt gac gat ctg gee gee agg ate act gcg 2352 
Glu Ala Val Ser Ala Phe Cys Asp Asp Leu Ala Ala Arg He Thr Ala 
770 775 780 



2400 



tgt ctg ctg ctg ctg acg aaa aag cgc tac ate ggg gtc eta ttg aac 2448 

Cys Leu Leu Leu Leu Thr Lys Lys Arg Tyr He Gly Val Leu Leu Asn 

805 . 810 815 

gac aaa atg gtc atg aaa ggg gtc gac etc att cgc aaa acg gee tgc 2496 

Asp Lys Met Val Met Lys Gly Val Asp Leu He Arg Lys Thr Ala Cys 
820 825 830 

aag ttt gtc cag gag cga tgc cgc gee ate ctg gac ctg gtg etc cac 2544 

Lys Phe Val Gin Glu Arg Cys Arg Ala He Leu Asp Leu Val Leu His 
835 840 845 

gat ccg gag gtc aag get gcg gcg egg ctg ttg tgc aag egg ccg ccg 2 5 92 

Asp Pro Glu Val Lys Ala Ala Ala Arg Leu Leu Cys Lys Arg Pro Pro 
850 855 860 

cac gcg gta tac gag gag ggg ctg ccg get ggc ttt ata aaa ate gta 2640 

His Ala Val Tyr Glu Glu Gly Leu Pro Ala Gly Phe He Lys He Val 

865 870 875 880 



2688 



67 



ate gag cag tta acg ttc tec acc gag etc age cgc ccc gtc tgc gat 2736 

lie Glu Gin Leu Thr Phe Ser Thr Glu Leu Ser Arg Pro Val Cys Asp 

900 905 910 

tac aag acc acc aac ctg ccc cac ctg gcg gtg tac caa aag ctg gcg 2784 

Tyr Lys Thr Thr Asn Leu Pro His Leu Ala Val Tyr Gin Lys Leu Ala 

915 920 925 



age agg tgc gag gag ctg ccc cag gtg cac gat aga ate ccc tac gtg 

Ser Arg Cys Glu Glu Leu Pro Gin Val His Asp Arg He Pro Tyr Val 

930 935 940 

ttc gtt gac gcg ccc ggg tec eta aag teg gac ctg gee gaa cac ccg 

Phe Val Asp Ala Pro Gly Ser Leu Lys Ser Asp Leu Ala Glu His Pro 

945 950 955 960 

gat tac gtc aga cag cac cag att ccc gtc gcg gtc gac eta tat ttc 

Asp Tyr Val Arg Gin His Gin He Pro Val Ala Val Asp Leu Tyr Phe 



965 



970 975 



gac aaa ctg gtg cac ggc gcg gee aac ate etc cag tgt ctg ttc ggc 

Asp Lys Leu Val His Gly Ala Ala Asn He Leu Gin Cys Leu Phe Gly 
980 985 990 

aac aac gcg gac acc acg gtg gee ate etc tac aat ttt etc aac gtc 

Asn Asn Ala Asp Thr Thr Val Ala He Leu Tyr Asn Phe Leu Asn Val 
995 1000 1005 

ccg tat aag ctg ttc teg tga 

Pro Tyr Lys Leu Phe Ser 

1010 1015 



2832 



2880 



2928 



2976 



3024 



3045 



<210> 15 
<211> 1014 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 15 
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Pro 


Arg 


Pro 


1 
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10 










15 




Ser 


His 


Arg 


Gly 


Thr 


Asp 


Ala 
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Ala 


Pro 


Ala 


Gly 
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Gly 
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Val 






20 










25 










30 






Gin 


Pro 


Pro 


Pro 


Asp 


Val 


Cys 


Arg 


Leu 


He 


Pro 


Ala 


Cys 


Leu 


Arg 


Thr 






35 










40 










45 








Pro 


Gly 


Ala 


Gly 


Gly 


Met 


He 


Pro 


val 


Thr 


He 


Pro 


Phe 


Pro 


Pro 


Thr 




50 










55 










60 










Tyr 


Phe 


Glu 


Asn 


Gly 


Ala 


Arg 


Gly 


Asp 


Val 


Leu 


Leu 


Ala 


Asn 


Glu 


Arg 


65 










70 










75 










80 


Ser 
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Trp 


Thr 


Ala 


Arg 


Asp 


Arg 


Lys 


Pro 


Val 


Ala 


Pro 


Asp 


Pro 


Gin 








85 










90 










95 




Asp 


Gin 


Ser 


He 


Thr 


Phe 


His 


Ala 


Tyr 


Asp 


Val 


Val 


Glu 


Thr 


Thr 


Tyr 






100 










105 










110 






Ala 


Ala 


Asp 


Arg 


Cys 


Ala 


Glu 


Val 


Pro 


Ser 


Arg 


Phe 


Gin 


Thr 


Asp 


He 






115 










120 










125 








He 


Pro 


Ser 


Gly 


Thr 


Val 


Leu 


Lys 


Leu 


Leu 


Gly 


Arg 


Thr 


Glu 


Asp 


Gly 




130 










135 










140 










Thr 


Ser 


Val 


Cys 


Val 


Asn 


Val 


Phe 


Arg 


Gin 


Gin 


Val 


Tyr 


Phe 


Tyr 


Ala 


145 








150 










155 










160 


Lys 


Val 


Pro 


Ala 


Gly 


He 


Asn 


Val 


Thr 


His 


He 


Leu 


Gin 


Gin 


Ala 


Leu 








165 










170 










175 





68 



Lys Asn Thr Ala Gly Arg Ala Ala Cys Gly Phe Ser Thr Arg Arg Val 

180 185 190 

Asn Lys Arg lie Leu Lys Thr Tyr Asp Val Ala Glu His Pro Val Thr 

195 200 205 

Glu lie Thr Leu Ser Ser Gly Ser Met Leu Ser Thr Leu Ser Asp Arg 

210 215 220 

Leu Val Ala Cys Gly Cys Glu Val Phe Glu Ser Asn Val Asp Ala Val 
225 230 235 240 

Arg Arg Phe Val Leu Asp His Gly Phe Thr Thr Phe Gly Trp Tyr Ser 

245 250 255 

Cys Ala Arg Ala Thr Pro Arg Leu Ala Ala Arg Asp Ala Arg Thr Ala 

260 265 270 

Leu Glu Phe Asp Cys Ser Trp Glu Asp Leu Ser Val Gin Ala Asp Arg 

275 280 285 

Ser Asp Trp Pro Pro Tyr Arg lie Val Ala Phe Asp He Glu Cys Thr 

290 295 300 

Gly Glu Ala Gly Phe Pro Cys Ala Thr Arg Asp Gly Asp Ala Val He 
305 310 315 320 

Gin He Ser Cys Val Phe Tyr Thr Thr Arg Glu Gly Ala Pro Asn Pro 

325 330 335 

Pro Asn He Leu Phe Ser Val Gly Thr Cys Asp Pro He Pro Asp Thr 

340 345 350 

Asp Val Leu Glu Phe Pro Ser Glu Tyr Asp Met Leu Val Ser Phe Phe 

355 360 365 

Ala Met He Arg Asp Phe Glu Val Asp Phe Leu Thr Gly Tyr Asn He 

370 375 380 

Ser Asn Phe Asp Leu Pro Tyr Leu He Thr Arg Ala Ser Gin Val Tyr 
385 390 395 400 

Asn Leu Arg Leu Asn Glu Tyr Thr Lys He Lys Thr Gly Ser He Phe 

405 410 415 

Glu Val His Glu Pro Arg Gly Gly Gly Gly Gly Phe Met Arg Ser Val 

420 425 430 

Ser Lys He Lys He Ala Gly He Val Pro He Asp Met Tyr Gin Val 

435 440 445 

Cys Arg Glu Lys Leu Ser Leu Ser Asp Tyr Lys Leu Asp Thr Val Ala 

450 455 460 

Arg Gin Cys Leu Gly Gly Lys Lys Glu Asp Val Ser Tyr Lys Asp He 
465 470 475 480 

Pro Pro Leu Phe Arg Ser Gly Pro Gly Gly Arg Ala Lys Val Gly Ser 

485 490 495 

Tyr Cys Val Met Asp Ser Val Leu Val Met Asp Leu Leu Lys Met Phe 

500 505 510 

Met He His Val Glu He Ser Glu He Ala Lys Leu Ala Lys He Gin 

515 520 525 

Ala Arg Arg Val Leu Thr Asp Gly Gin Gin Leu Arg Val Phe Ser Cys 

530 535 540 

Leu Leu Glu Ala Ala Ala Arg Glu Asn Phe He Leu Pro Val Pro Thr 
545 550 555 560 

Pro Glu Gly Gin Gly Gly Tyr Gin Gly Ala Thr Val He Asn Pro He 

565 570 575 

Pro Gly Phe Tyr Asp Glu Pro Val Leu Val Val Asp Phe Ala Ser Leu 

580 585 590 

Tyr Pro Ser He He Gin Ala His Asn Leu Cys Tyr Ser Thr Met He 

595 600 605 

His Gly Arg Asp Leu His Leu His Pro Asn Leu Thr Pro Asp Asp Tyr 

610 615 620 

Glu Thr Phe Val Leu Ser Gly Gly Pro Val His Phe Val Lys Lys His 
6 25 630 635 640 

Lys Arg Glu Ser Leu Leu Gly Arg Leu Leu Thr Val Trp Leu Glu Lys 

645 650 655 

Arg Arg Ala He Arg Arg Thr Leu Ala Ala Cys Asp Asp Pro Ser Leu 

69 



660 665 670 

Lys Thr lie Leu Asp Lys Gin Gin Leu Ala lie Lys Val Thr Cys Asn 

675 680 685 

Ala Val Tyr Gly Phe Thr Gly Val Ala Ser Gly Leu Leu Pro Cys lie 

690 695 700 

Asn lie Ala Glu Thr Val Thr Leu Arg Gly Arg Thr Met Leu Glu Met 
705 710 715 720 

Ser Lys Ser Tyr Val Glu Ala Leu Thr Thr Glu Asp Leu Arg Thr Arg 

725 730 735 

Leu Gly Arg Glu Val Thr Ala Arg His Gly Ala Arg Phe Arg Val Val 

740 745 750 

Tyr Gly Asp Thr Asp Ser Leu Phe lie Ala Cys Asp Gly Tyr Ser Ala 

755 760 765 

Glu Ala Val Ser Ala Phe Cys Asp Asp Leu Ala Ala Arg He Thr Ala 

770 775 780 

Asp Leu Phe Pro Pro Pro He Lys Leu Glu Ala Glu Lys Thr Phe Lys 
785 790 795 800 

Cys Leu Leu Leu Leu Thr Lys Lys Arg Tyr He Gly Val Leu Leu Asn 

805 810 815 

Asp Lys Met Val Met Lys Gly Val Asp Leu He Arg Lys Thr Ala Cys 

820 825 830 

Lys Phe Val Gin Glu Arg Cys Arg Ala He Leu Asp Leu Val Leu His 

835 840 845 

Asp Pro Glu Val Lys Ala Ala Ala Arg Leu Leu Cys Lys Arg Pro Pro 

850 855 860 

His Ala Val Tyr Glu Glu Gly Leu Pro Ala Gly Phe lie Lys lie Val 
865 870 875 880 

Glu Val Leu Asn Ala Ser Tyr Leu Asp Leu Arg Asn Ser Val Val Pro 

885 890 895 

He Glu Gin Leu Thr Phe Ser Thr Glu Leu Ser Arg Pro Val Cys Asp 

900 905 910 

Tyr Lys Thr Thr Asn Leu Pro His Leu Ala Val Tyr Gin Lys Leu Ala 

915 920 925 

Ser Arg Cys Glu Glu Leu Pro Gin Val His Asp Arg He Pro Tyr Val 

930 935 940 

Phe Val Asp Ala Pro Gly Ser Leu Lys Ser Asp Leu Ala Glu His Pro 
945 950 955 960 

Asp Tyr Val Arg Gin His Gin He Pro Val Ala Val Asp Leu Tyr Phe 

965 970 975 

Asp Lys Leu Val His Gly Ala Ala Asn He Leu Gin Cys Leu Phe Gly 

980 985 990 

Asn Asn Ala Asp Thr Thr Val Ala He Leu Tyr Asn Phe Leu Asn Val 

995 1000 1005 

Pro Tyr Lys Leu Phe Ser 
1010 



<210> 16 
<211> 1251 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1251) 

<400> 16 

atg ctg gtt aac gaa ctg teg gtg gtc etc ggc gac tgg gag gtg act 

Met Leu Val Asn Glu Leu Ser Val Val Leu Gly Asp Trp Glu Val Thr 

1 5 10 15 



70 



ttt cac egg ggt aga ttc age ttc gtc aac etc acc cgc ctg caa acg 96 

Phe His Arg Gly Arg Phe Ser Phe Val Asn Leu Thr Arg Leu Gin Thr 

20 25 30 

ttc aag ggc cac ggg ggc tac gec agg gtc cga etc ccc ttc teg etc 144 

Phe Lys Gly His Gly Gly Tyr Ala Arg Val Arg Leu Pro Phe Ser Leu 

35 40 45 

gac cag tta etc cac caa cat ttc gcg ttc gga etc gtg acg cgt etc 192 

Asp Gin Leu Leu His Gin His Phe Ala Phe Gly Leu Val Thr Arg Leu 

50 55 60 

aag gaa ctg ccc ccc ttc tec gac tgc gtg gee ctt ate gee ccg ttg 240 

Lys Glu Leu Pro Pro Phe Ser Asp Cys Val Ala Leu He Ala Pro Leu 

65 70 75 80 

gat tec ggc ggc gac gcg gac gcg gcg cgc gtg gec ccc ggg ttc gtg 288 

Asp Ser Gly Gly Asp Ala Asp Ala Ala Arg Val Ala Pro Gly Phe Val 

85 90 95 

ctg gac tec tct cgc ccg ctg acc gtg tgg gta aac gcg age ggg egg 336 

Leu Asp Ser Ser Arg Pro Leu Thr Val Trp Val Asn Ala Ser Gly Arg 

100 105 HO 



cac acg ate egg ttc tgc etc etc ttt eta aag ccg ate gac ctg gag 
His Thr He Arg Phe Cys Leu Leu Phe Leu Lys Pro He Asp Leu Glu 
115 120 125 



ggc acc cca aag ccc acc tgc gcg acc gaa age ctg ccc ggt gga ccc 

Gly Thr Pro Lys Pro Thr Cys Ala Thr Glu Ser Leu Pro Gly Gly Pro 

145 150 155 160 

ctg cgc gtc tec ggc gag gcg tct cag acg teg ccc cat tct ttc gtt 

Leu Arg Val Ser Gly Glu Ala Ser Gin Thr Ser Pro His Ser Phe Val 

165 170 175 



tta cag gtg agg ccg ttt teg gat gac gcg gcg cac agg gac gcg egg 

Leu Gin Val Arg Pro Phe Ser Asp Asp Ala Ala His Arg Asp Ala Arg 

195 200 205 

ate tec ccg aaa tac gtc acg ttt agt aac tec ggg ggt aac gtc tgc 

He Ser Pro Lys Tyr Val Thr Phe Ser Asn Ser Gly Gly Asn Val Cys 

210 215 220 

aag gcg tec gtt cac acg ctg tec ccg teg egg tgt aaa acg gcg caa 

Lys Ala Ser Val His Thr Leu Ser Pro Ser Arg Cys Lys Thr Ala Gin 

225 230 235 240 

atg gaa ate ate tac get ccc ggg gac ccc aac gec gag ata gtc ctg 

Met Glu He He Tyr Ala Pro Gly Asp Pro Asn Ala Glu He Val Leu 
245 250 255 



384. 



cgc gcg gtc acg tac gtc ttc ggc gag aac ggc ggc gcg cgc teg gag 432 
Arg Ala Val Thr Tyr Val Phe Gly Glu Asn Gly Gly Ala Arg Ser Glu 
130 135 140 



480 



528 



gcg tat ttt ccc acg gee aac teg gtg gec tgc eta age ctg ttg egg 576 
Ala Tyr Phe Pro Thr Ala Asn Ser Val Ala Cys Leu Ser Leu Leu Arg 
180 185 190 



624 



672 



720 



768 



71 



ggc cag tec gga ccc gtc ctg ccc acc cac acc ggc ggc cgc gta ttg 816 

Gly Gin Ser Gly Pro Val Leu Pro Thr His Thr Gly Gly Arg Val Leu 
260 265 270 

ggg 9 tc tac 9 CC 9 ac 9 CC 9 aa aaa acc atc caa cct gga agc tcc gcg 864 

Gly Val Tyr Ala Asp Ala Glu Lys Thr lie Gin Pro Gly Ser Ser Ala 

275 280 285 

gaa gtc egg gtt cag tta atc ttc caa cag gga gcg gec get egg ggc 912 

Glu Val Arg Val Gin Leu lie Phe Gin Gin Gly Ala Ala Ala Arg Gly 

290 295 300 



gat ctg gcg ttt ctg gtc acg ggc gtg gca ccg gag ccc eta ttc gtc 

Asp Leu Ala Phe Leu Val Thr Gly Val Ala Pro Glu Pro Leu Phe Val 

305 310 315 320 

gtc acc ccg gca etc ttg ctt tcc ggt tgc aca acc cac ctg cgc eta 

Val Thr Pro Ala Leu Leu Leu Ser Gly Cys Thr Thr His Leu Arg Leu 

325 330 335 



aac gcg ttc aca atc ccg gtc ggt ttc cca ggg gtg gtc teg gcg gag 
Asn Ala Phe Thr lie Pro Val Gly Phe Pro Gly Val Val Ser Ala Glu 
385 390 395 400 



960 



1008 



ttc aac ccc aac ggt acc ccc acg act ata aaa aga gac acc ctt gtg 1056 

Phe Asn Pro Asn Gly Thr Pro Thr Thr He Lys Arg Asp Thr Leu Val 
340 345 350 

gee gee gec gcg ccc tgc ccc gtg gtg cga tta age tcc gec gac gac 1104 

Ala Ala Ala Ala Pro Cys Pro Val Val Arg Leu Ser Ser Ala Asp Asp 

355 360 365 

gcg ccg cga gac etc gtc gcg tea cca gac acc ggg gcg etc tcc att 1152 

Ala Pro Arg Asp Leu Val Ala Ser Pro Asp Thr Gly Ala Leu Ser He 

370 375 380 



1200 



tgt cac gtg tcc eta cgc gac aac ggg gtc cac gaa cgc atg aac cat 124 8 
Cys His Val Ser Leu Arg Asp Asn Gly Val His Glu Arg Met Asn His 
405 410 415 



tga 



1251 



<210> 17 
<211> 416 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 17 

Met Leu Val Asn Glu Leu Ser Val Val Leu Gly Asp Trp Glu Val Thr 

15 10 15 

Phe His Arg Gly Arg Phe Ser Phe Val Asn Leu Thr Arg Leu Gin Thr 

20 25 30 

Phe Lys Gly His Gly Gly Tyr Ala Arg Val Arg Leu Pro Phe Ser Leu 

35 40 45 

Asp Gin Leu Leu His Gin His Phe Ala Phe Gly Leu Val Thr Arg Leu 

50 55 60 

Lys Glu Leu Pro Pro Phe Ser Asp Cys Val Ala Leu He Ala Pro Leu 
65 70 75 80 

Asp Ser Gly Gly Asp Ala Asp Ala Ala Arg Val Ala Pro Gly Phe Val 

72 
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Pro 
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Ser 
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Ser 
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Asp 
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Arg 
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Arg 
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205 








He 


Ser 


Pro 


Lys 


Tyr 


Val 


Thr 


Phe 


Ser 
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215 
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Ala 


Ser 


Val 


His 


Thr 


Leu 


Ser 


Pro 


Ser 
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Lys 


Thr 


Ala 


Gin 


225 










230 
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240 


Met 


Glu 


He 


lie 
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Ala 


Pro 


Gly 


Asp 


Pro 


Asn 


Ala 


Glu 


He 


Val 


Leu 










245 










250 










255 




Gly 


Gin 


Ser 


Gly 


Pro 


Val 


Leu 


Pro 


Thr 


His 


Thr 


Gly 


Gly 


Arg 


Val 


Leu 






260 










265 










270 






Gly 


Val 


Tyr 


Ala 


Asp 


Ala 


Glu 


Lys 


Thr 


He 


Gin 


Pro 


Gly 


Ser 


Ser 


Ala 






275 










280 










285 








Glu 


Val 


Arg 


Val 


Gin 


Leu 


He 


Phe 


Gin 


Gin 


Gly 


Ala 


Ala 


Ala 


Arg 


Gly 




290 










295 










300 










Asp 


Leu 
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Phe 


Leu 


Val 


Thr 


Gly 


Val 


Ala 


Pro 


Glu 


Pro 


Leu 


Phe 


Val 


305 
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Thr 


Pro 
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Leu 


Leu 


Leu 


Ser 
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Thr 


Thr 


His 


Leu 
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Leu 
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Asn 


Gly 


Thr 


Pro 


Thr 


Thr 


He 


Lys 
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Thr 


Leu 


Val 
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Asp 






355 
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Ser 


He 
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375 
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Gly 


Val 
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Glu 


385 
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<210> 18 

<211> 1230 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (1230) 

<400> 18 

atg gga aca ccg gtg cgt ttc ttt cgc ggc gag tgg cag acc teg agt 

Met Gly Thr Pro Val Arg Phe Phe Arg Gly Glu Trp Gin Thr Ser Ser 
1 5 10 15 

eta gtg gac aac ggc acg cca egg tac age tec ctg gtg tgg gec gec 
Leu Val Asp Asn Gly Thr Pro Arg Tyr Ser Ser Leu Val Trp Ala Ala 
20 25 30 



73 



act att cac gac ggc tac ctg aca ctg gtg aac agg tea gag ctg tgc 144 
Thr He His Asp Gly Tyr Leu Thr Leu Val Asn Arg Ser Glu Leu Cys 
35 40 45 



gtc acg gag agg tct ccg tgt ctg ccg gca tgc ccc age ate ggg aga 
Val Thr Glu Arg Ser Pro Cys Leu Pro Ala Cys Pro Ser lie Gly Arg 



50 



55 60 



ctg gtc ggg aag agg ttt ccc ggc ttc gee ttt gec age gec act ctg 
Leu Val Gly Lys Arg Phe Pro Gly Phe Ala Phe Ala Ser Ala Thr Leu 



65 



70 75 80 



ggc gat egg gga aca cgc acc gtg ttc tac gcg ttc ggt cac cgc gac 
Gly Asp Arg Gly Thr Arg Thr Val Phe Tyr Ala Phe Gly His Arg Asp 



85 90 95 



aac cca ctg gac ata gta ccc gec gtg gtc gag cgc gcg gat cgc gag 

Asn Pro Leu Asp He Val Pro Ala Val Val Glu Arg Ala Asp Arg Glu 

100 105 HO 

ctg gtg ctg egg gtt cac get ccg caa aca acg egg gtg teg cga tac 

Leu Val Leu Arg Val His Ala Pro Gin Thr Thr Arg Val Ser Arg Tyr 



145 



gtc acc gac ggc ccc gtg acg gcg acc ctg tec etc acc ggt aat gcg 

Val Thr Asp Gly Pro Val Thr Ala Thr Leu Ser Leu Thr Gly Asn Ala 
245 250 255 

cca ate gec ttt cgc cac aac cca tac ttt gaa etc ccg tgg teg tec 

Pro He Ala Phe Arg His Asn Pro Tyr Phe Glu Leu Pro Trp Ser Ser 

260 265 270 



192 



240 



288 



336 



384 



115 120 125 

ctt aag gta ttc gtg gcg ate gtt acg gtg gtg cgc ccg ccc ggg 432 



480 



Gly Leu Lys Val Phe Val Ala He Val Thr Val Val Arg Pro Pro Gly 
130 135 140 

gtg ttc eta cac ttt cca caa gac cgc gtt ccg ate gcg ctg aca gac 

Val Phe Leu His Phe Pro Gin Asp Arg Val Pro He Ala Leu Thr Asp 

150 155 160 

gcg tgc age cag gag ggc tec agg eta acc tct gaa gag ccg tgg ata 528 

Ala Cys Ser Gin Glu Gly Ser Arg Leu Thr Ser Glu Glu Pro Trp He 
165 170 175 

aaa att caa ggc ttt ccc gtc eta tct gac gag acc gcg cac cca ttt 576 

Lys He Gin Gly Phe Pro Val Leu Ser Asp Glu Thr Ala His Pro Phe 

180 185 190 

etc tta acc cag aag acc aag ccc ttt acc gag cga aag ttt tgc cgc 624 

Leu Leu Thr Gin Lys Thr Lys Pro Phe Thr Glu Arg Lys Phe Cys Arg 
195 200 205 

ctg ate atg gac aac gac cag cgc age gec gtc aac acc gtc tac ctg 672 

Leu He Met Asp Asn Asp Gin Arg Ser Ala Val Asn Thr Val Tyr Leu 
210 215 220 

gga aag cag cac gtg agg gtg acc gtg acc cgc ccc ccg gaa aca ate 720 

Gly Lys Gin His Val Arg Val Thr Val Thr Arg Pro Pro Glu Thr He 

225 230 235 240 



768 



816 



74 



aca acg gcg ata ttc acg ccc gtg gtg tac gtg ggc ctg acc gtg tgc 864 
Thr Thr Ala He Phe Thr Pro Val Val Tyr Val Gly Leu Thr Val Cys 
275 280 285 

ate cca ccc aac tgt age aaa ttc gta agg tac ggt aac acc tac gtc 912 
He Pro Pro Asn Cys Ser Lys Phe Val Arg Tyr Gly Asn Thr Tyr Val 
290 295 300 

teg gca ttt aac cgc aag ctg acg gcg att att age aat cac gec cac 960 
Ser Ala Phe Asn Arg Lys Leu Thr Ala lie He Ser Asn His Ala His 
305 310 315 320 

aac ggc ggg ttc egg att cag gac tgc gag tgg cca ccg aac egg gag 1008 
Asn Gly Gly Phe Arg He Gin Asp Cys Glu Trp Pro Pro Asn Arg Glu 
325 330 335 

ata gag att ttg gta acc aac gtg tec cag gec ccg gtg tac ate age 1056 
He Glu He Leu Val Thr Asn Val Ser Gin Ala Pro Val Tyr He Ser 
340 345 350 

acc ggg acg cag ctg ggg caa gec ate ttc gtg ttc gcg ccg egg ttc 1104 
Thr Gly Thr Gin Leu Gly Gin Ala He Phe Val Phe Ala Pro Arg Phe 
355 360 365 

ggt ggc ccg gcg aaa ctg egg cag etc etc ggc cac cga teg cgc gec 1152 
Gly Gly Pro Ala Lys Leu Arg Gin Leu Leu Gly His Arg Ser Arg Ala 
370 375 ' 380 



ctg gag ctg ccg ggc ggg gtg aca gtg gac age caa aaa ctg tgt agg 
Leu Glu Leu Pro Gly Gly Val Thr Val Asp Ser Gin Lys Leu Cys Arg 
385 390 395 400 

ttt gag acc atg tac ctg ttt tec acg taa 
Phe Glu Thr Met Tyr Leu Phe Ser Thr 

405 410 



1200 



1230 



<210> 19 
<211> 409 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 19 



Met 


Gly 


Thr 


Pro 


Val 


Arg 


Phe 


Phe 


Arg 


Gly 


Glu 


Trp 


Gin 


Thr 


Ser 


Ser 


1 






5 










10 










15 




Leu 


Val 


Asp 


Asn 


Gly 


Thr 


Pro 


Arg 


Tyr 


Ser 


Ser 


Leu 


Val 


Trp 


Ala 


Ala 








20 










25 










30 






Thr 


He 


His 


Asp 


Gly 


Tyr 


Leu 


Thr 


Leu 


Val 


Asn 


Arg 


Ser 


Glu 


Leu 


Cys 






35 










40 










45 








Val 


Thr 


Glu 


Arg 


Ser 


Pro 


Cys 


Leu 


Pro 


Ala 


Cys 


Pro 


Ser 


He 


Gly 


Arg 




50 










55 










60 










Leu 


Val 


Gly 


Lys 


Arg 


Phe 


Pro 


Gly 


Phe 


Ala 


Phe 


Ala 


Ser 


Ala 


Thr 


Leu 


65 










70 










75 










80 


Gly 


Asp 


Arg 


Gly 


Thr 


Arg 


Thr 


Val 


Phe 


Tyr 


Ala 


Phe 


Gly 


His 


Arg 


Asp 






85 










90 










95 




Asn 


Pro 


Leu 


Asp 


He 


Val 


Pro 


Ala 


Val 


Val 


Glu 


Arg 


Ala 


Asp 


Arg 


Glu 








100 










105. 










110 






Leu 


Val 


Leu 


Arg 


Val 


His 


Ala 


Pro 


Gin 


Thr 


Thr 


Arg 


Val- 


Ser 


Arg 


Tyr 






115 










120 










125 








Gly 


Leu 


Lys 


Val 


Phe 


Val 


Ala 


He 


Val 


Thr 


Val 


Val 


Arg 


Pro 


Pro 


Gly 


130 










135 










140 











75 



Val 


Phe 


Leu 


His 


Phe 


Pro 


Gin 


Asp 


Arg 


Val 


Pro 


He 


Ala 


Leu 


Thr 


Asp 


145 










150 










155 










160 


Ala 


Cys 


Ser 


Gin 


Glu 


Gly 


Ser 


Arg 


Leu 


Thr 


Ser 


Glu 


Glu 


Pro 


Trp 


Tin 

lie 








165 










170 










17 5 




Lys 


He 


Gin 


Gly 


Phe 


Pro 


Val 


Leu 


Ser 


Asp 


Glu 


Thr 


Ala 


Hi s 


Pro 


Phe 






180 










185 










190 






Leu 


Leu 


Thr 


Gin 


Lys 


Thr 


Lys 


Pro 


Phe 


Thr 


Glu 


Arg 


Lys 


Phe 


Cys 


Arg 






195 










200 










205 








Leu 


He 


Met 


Asp 


Asn 


Asp 


Gin 


Arg 


Ser 


Ala 


Val 


Asn 


Thr 


Val 


Tyr 


Leu 




210 










215 










220 








He 


Gly 


Lys 


Gin 


His 


Val 


Arg 


Val 


Thr 


Val 


Thr 


Arg 


Pro 


Pro 


Glu 


Thr 


225 








230 










235 










24 0 


Val 


Thr 


Asp 


Gly 


Pro 


Val 


Thr 


Ala 


Thr 


Leu 


Ser 


Leu 


Thr 


Gly 


Asn 


Ala 








245 










250 










255 




Pro 


He 


Ala 


Phe 


Arg 


His 


Asn 


Pro 


Tyr 


Phe 


Glu 


Leu 


Pro 


Trp 


Ser 


Ser 








260 










265 










270 






Thr 


Thr 


Ala 


He 


Phe 


Thr 


Pro 


Val 


Val 


Tyr 


Val 


Gly 


Leu 


Thr 


Val 


Cys 






275 










280 










285 








He 


Pro 


Pro 


Asn 


Cys 


Ser 


Lys 


Phe 


Val 


Arg 


Tyr 


Gly 


Asn 


Thr 


Tyr 


Val 




290 










295 










300 










Ser 


Ala 


Phe 


Asn 


Arg 


Lys 


Leu 


Thr 


Ala 


He 


He 


Ser 


Asn 


His 


Ala 


His 


305 










310 










315 










320 


Asn 


Gly 


Gly 


Phe 


Arg 


He 


Gin 


Asp 


Cys 


Glu 


Trp 


Pro 


Pro 


Asn 


Arg 


Glu 








325 










330 










335 




He 


Glu 


He 


Leu 


Val 


Thr 


Asn 


Val 


Ser 


Gin 


Ala 


Pro 


Val 


Tyr 


He 


Ser 








340 










345 










350 






Thr 


Gly 


Thr 


Gin 


Leu 


Gly 


Gin 


Ala 


He 


Phe 


Val 


Phe 


Ala 


Pro 


Arg 


Phe 




355 










360 










365 








Gly 


Gly 


Pro 


Ala 


Lys 


Leu 


Arg 


Gin 


Leu 


Leu 


Gly 


His 


Arg 


Ser 


Arg 


Ala 


370 










375 










380 










Leu 


Glu 


Leu 


Pro 


Gly 


Gly 


Val 


Thr 


Val 


Asp 


Ser 


Gin 


Lys 


Leu 


Cys 


Arg 


385 










390 










395 










400 


Phe 


Glu 


Thr 


Met 


Tyr 


Leu 


Phe 


Ser 


Thr 

















405 



<210> 20 

<211> 624 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (624) 

<400> 20 

atg ttc cct gtc tgg ttc gtc ttg ttt tac ctg teg tgt tgg gcg gec 

Met Phe Pro Val Trp Phe Val Leu Phe Tyr Leu Ser Cys Trp Ala Ala 
! 5 10 15 

age cct acg ctg gcg cct ccc ccg act gec get gga att aac gtt etc 
Ser Pro Thr Leu Ala Pro Pro Pro Thr Ala Ala Gly He Asn Val Leu 
20 25 30 

ccc cag tgg gec ggc aac cgc gec tct ctt gac agg acc agg ggg cgc 
Pro Gin Trp Ala Gly Asn Arg Ala Ser Leu Asp Arg Thr Arg Gly Arg 
35 40 45 

ctg tct gaa gtg ggg tta aac ata cag cgc tgg ttc gtt tac ctg tgc 
Leu Ser Glu Val Gly Leu Asn lie Gin Arg Trp Phe Val Tyr Leu Cys 



76 
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50 55 60 

cac cac tec act etc tgt egg gtg cgt gag tac ccg cgc ate atg teg 240 

His His Ser Thr Leu Cys Arg Val Arg Glu Tyr Pro Arg lie Met Ser 

65 70 75 80 

ttt gtt cac ttc cct ata ttg atg tct aac gtt gag tgc cag cgc cgc 288 

Phe Val His Phe Pro lie Leu Met Ser Asn Val Glu Cys Gin Arg Arg 

85 90 95 

gag ttt cgc ggg gec gag tgt atg aac gec atg gtt cgc ggg etc egg 336 

Glu Phe Arg Gly Ala Glu Cys Met Asn Ala Met Val Arg Gly Leu Arg 

100 105 110 

gec tac gag agt tac ctg acg cga ctg agg atg ctg ctg gac gac gcg 384 

Ala Tyr Glu Ser Tyr Leu Thr Arg Leu Arg Met Leu Leu Asp Asp Ala 

115 120 125 

ccc ggg gac gcg gac gec gcg gec att ggc tec gcg gtg acc gtg gtg 432 

Pro Gly Asp Ala Asp Ala Ala Ala lie Gly Ser Ala Val Thr Val Val 

130 135 140 ' 

ctg tec gec etc gac tct eta att gag gag ctt ccc gta aat aac aag 480 

Leu Ser Ala Leu Asp Ser Leu lie Glu Glu Leu Pro Val Asn Asn Lys 

145 150 155 160 

ata ggt ggc gcg gag tct aat gaa aaa acc gtg cgt gcg ttg gga ggg 528 

lie Gly Gly Ala Glu Ser Asn Glu Lys Thr Val Arg Ala Leu Gly Gly 

165 170 175 

cag age ccc egg gac gtt gtt etc age gcg ttt cgc ata ctg gaa tat 576 

Gin Ser Pro Arg Asp Val Val Leu Ser Ala Phe Arg He Leu Glu Tyr 

180 185 190 

eta cag atg ttt ttg egg gac ggg cgc cgc gca ata get atg atg taa 624 

Leu Gin Met Phe Leu Arg Asp Gly Arg Arg Ala He Ala Met Met 

195 200 205 



<210> 21 
<211> 207 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 21 

Met Phe Pro Val Trp Phe Val Leu Phe Tyr Leu Ser Cys Trp Ala Ala 

15 10 15 

Ser Pro Thr Leu Ala Pro Pro Pro Thr Ala Ala Gly He Asn Val Leu 

2 0 25 3 0 

Pro Gin Trp Ala Gly Asn Arg Ala Ser Leu Asp Arg Thr Arg Gly Arg 

35 40 45 

Leu Ser Glu Val Gly Leu Asn He Gin Arg Trp Phe Val Tyr Leu Cys 

50 55 60 

His His Ser Thr Leu Cys Arg Val Arg Glu Tyr Pro Arg He Met Ser 
65 70 75 80 

Phe Val His Phe Pro He Leu Met Ser Asn Val Glu Cys Gin Arg Arg 

85 90 95 

Glu Phe Arg Gly Ala Glu Cys Met Asn Ala Met Val Arg Gly Leu Arg 

100 105 110 

Ala Tyr Glu Ser Tyr Leu Thr Arg Leu Arg Met Leu Leu Asp Asp Ala 
115 120 125 
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Pro Gly Asp Ala 
130 

Leu Ser Ala Leu 
145 

lie Gly Gly Ala 

Gin Ser Pro Arg 
180 

Leu Gin Met Phe 
195 



Asp Ala Ala Ala 
135 

Asp Ser Leu lie 
150 

Glu Ser Asn Glu 
165 

Asp Val Val Leu 

Leu Arg Asp Gly 
200 



lie Gly Ser Ala 
140 

Glu Glu Leu Pro 
155 

Lys Thr Val Arg 
170 

Ser Ala Phe Arg 
185 

Arg Arg Ala lie 



Val Thr Val Val 

Val Asn Asn Lys 
160 

Ala Leu Gly Gly 
175 

lie Leu Glu Tyr 

190 
Ala Met Met 
205 



<210> 22 
<211> 1002 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - . (1002) 

<400> 22 

atg att gtc ctg gtg cat etc ggt ate tgc tac gtt aaa aaa att ata 48 

Met lie Val Leu Val His Leu Gly lie Cys Tyr Val Lys Lys lie lie 

1 5 10 15 

ccc gtt tgc gtg get gga ate gcg gec gca cga etc egg gtt ttt tec 96 
Pro Val Cys Val Ala Gly He Ala Ala Ala Arg Leu Arg Val Phe Ser 
20 25 30 

gca ccc gaa ggt gec gcg gec gtc cgc tgc gcg tgc cgc ggc gac cac 144 
Ala Pro Glu Gly Ala Ala Ala Val Arg Cys Ala Cys Arg Gly Asp His 
35 40 45 

ggc gag ctg cag tac ttg gcg cac ctg gat tta att att aaa cac ggc 192 
Gly Glu Leu Gin Tyr Leu Ala His Leu Asp Leu He lie Lys His Gly 
50 55 60 

gtg cag agg gag gac agg acg ggc gtg ggc acc agg tec gtg ttc ggg 240 
Val Gin Arg Glu Asp Arg Thr Gly Val Gly Thr Arg Ser Val Phe Gly 
65 70 75 80 

ctt cag gec cga tat aac etc agg gac gag ttt cct ctg tta acc acc 288 
Leu Gin Ala Arg Tyr Asn Leu Arg Asp Glu Phe Pro Leu Leu Thr Thr 
85 90 95 

aaa agg gtg ttt tgg agg ggc gtc gtg gag gag ttg ctg tgg ttt ate 336 
Lys Arg Val Phe Trp Arg Gly Val Val Glu Glu Leu Leu Trp Phe He 
100 105 110 

a 99 99 c tec acc gac tec acc gaa ctg teg cgc cgc ggc gta aaa att 3 84 
Arg Gly Ser Thr Asp Ser Thr Glu Leu Ser Arg Arg Gly Val Lys He 
115 120 125 

tgg gac gcg cac ggg tec cgc gec ttt ttg gcg gcg cag ggc ttc ggg 432 
Trp Asp Ala His Gly Ser Arg Ala Phe Leu Ala Ala Gin Gly Phe Gly 
130 135 140 

gac cgc cgc gag ggc gat ctg ggg ccg gtg tac ggg ttc cag tgg aga 4 80 
Asp Arg Arg Glu Gly Asp Leu Gly Pro Val Tyr Gly Phe Gin Trp Arg 
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145 150 155 160 

cat ttt ggg gcg gag tac agg ggg gcc gac gcc aac tac gag ggt cag 52 8 

His Phe Gly Ala Glu Tyr Arg Gly Ala Asp Ala Asn Tyr Glu Gly Gin 

165 170 175 

ggg gtg gac cag ctg cgt tac gtg gtg gat eta att aac agg egg ccc 576 

Gly Val Asp Gin Leu Arg Tyr Val Val Asp Leu He Asn Arg Arg Pro 

180 185 190 

cac gat egg cgc ate gtt atg tgc gcg tgg aac ccc gcg gac etc gcg 624 

His Asp Arg Arg He Val Met Cys Ala Trp Asn Pro Ala Asp Leu Ala 

195 200 205 

egg atg get etc cct cct tgt cac gtt ttg tgt cag ttt tac gtg get 672 

Arg Met Ala Leu Pro Pro Cys His Val Leu Cys Gin Phe Tyr Val Ala 

210 215 220 

egg ggg gag ctg tec tgc cag ctg tac cag agg tec gcc gac atg ggc 72 0 

Arg Gly Glu Leu Ser Cys Gin Leu Tyr Gin Arg Ser Ala Asp Met Gly 

225 230 235 240 

etc ggg gtc ccg ttt aac ate gcc age tac gcc etc ctg acg tat ctg 768 

Leu Gly Val Pro Phe Asn lie Ala Ser Tyr Ala Leu Leu Thr Tyr Leu 

245. 250 255 

ate get cac gtc acg ggc ctg acc ccg gga gac ttt gtg cac ace ttg 816 

He Ala His Val Thr Gly Leu Thr Pro Gly Asp Phe Val His Thr Leu 

260 265 270 

ggt gac gcc cac gtt tac aac aac cac gtt gat ccc ctg ctg ctt cag 864 

Gly Asp Ala His Val Tyr Asn Asn His Val Asp Pro Leu Leu Leu Gin 

275 280 285 

ctg egg agg acc ccg cgt ccg ttt ccg egg ctg aag att ttg aga aag 912 

Leu Arg Arg Thr Pro Arg Pro Phe Pro Arg Leu Lys He Leu Arg Lys 

290 295 300 

gtg gcg cgt ctg gag gac ttt acg cgc gcg gat ctg agt etc gag ggc 960 

Val Ala Arg Leu Glu Asp Phe Thr Arg Ala Asp Leu Ser Leu Glu Gly 

305 310 315 320 

tac gac ccc cat ccc cac ata gag atg gag atg gcc gtt tga 1002 

Tyr Asp Pro His Pro His lie Glu Met Glu Met Ala Val 

325 330 



<210> 23 
<211> 333 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 23 

Met He Val Leu Val His Leu Gly He Cys Tyr Val Lys Lys He He 

15 10 15 

Pro Val Cys Val Ala Gly He Ala Ala Ala Arg Leu Arg Val Phe Ser 

20 25 30 

Ala Pro Glu Gly Ala Ala Ala Val Arg Cys Ala Cys Arg Gly Asp His 

35 40 45 

Gly Glu Leu Gin Tyr Leu Ala His Leu Asp Leu He He Lys His Gly 
50 55 60 
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Val Gin Arg Glu 
65 

Leu Gin Ala Arg 

Lys Arg Val Phe 
100 

Arg Gly Ser Thr 
115 

Trp Asp Ala His 
130 

Asp Arg Arg Glu 
145 

His Phe Gly Ala 

Gly Val Asp Gin 
180 

His Asp Arg Arg 
195 

Arg Met Ala Leu 
210 

Arg Gly Glu Leu 
225 

Leu Gly Val Pro 

lie Ala His Val 
260 

Gly Asp Ala His 
275 

Leu Arg Arg Thr 
290 

Val Ala Arg Leu 
305 

Tyr Asp Pro His 



Asp Arg Thr Gly 
70 

Tyr Asn Leu Arg 
85 

Trp Arg Gly Val 

Asp Ser Thr Glu 
120 

Gly Ser Arg Ala 
135 

Gly Asp Leu Gly 
150 

Glu Tyr Arg Gly 
165 

Leu Arg Tyr Val 

lie Val Met Cys 
200 

Pro Pro Cys His 
215 

Ser Cys Gin Leu 
230 

Phe Asn lie Ala 
245 

Thr Gly Leu Thr 

Val Tyr Asn Asn 
280 

Pro Arg Pro Phe 
295 

Glu Asp Phe Thr 
310 

Pro His lie Glu 
325 



Val Gly Thr Arg 
75 

Asp Glu Phe Pro 
90 

Val Glu Glu Leu 
105 

Leu Ser Arg Arg 

Phe Leu Ala Ala 
140 

Pro Val Tyr Gly 
155 

Ala Asp Ala Asn 
170 

Val Asp Leu lie 
185 

Ala Trp Asn Pro 

Val Leu Cys Gin 
220 

Tyr Gin Arg Ser 
235 

Ser Tyr Ala Leu 
250 

Pro Gly Asp Phe 
265 

His Val Asp Pro 

Pro Arg Leu Lys 
300 

Arg Ala Asp Leu 
315 

Met Glu Met Ala 
330 



Ser Val Phe Gly 
80 

Leu Leu Thr Thr 
95 

Leu Trp Phe lie 
110 

Gly Val Lys lie 
125 

Gin Gly Phe Gly 

Phe Gin Trp Arg 
160 

Tyr Glu Gly Gin 
175 

Asn Arg Arg Pro 
190 

Ala Asp Leu Ala 
205 

Phe Tyr Val Ala 

Ala Asp Met Gly 
240 

Leu Thr Tyr Leu 
255 

Val His Thr Leu 
270 

Leu Leu Leu Gin 
285 

lie Leu Arg Lys 

Ser Leu Glu Gly 
320 

Val 



<210> 24 

<211> 348 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (!) . . (348) 

<400> 24 

atg agg ggc ctt ttc gtg tgc gtt ttt ttt gcg gtg ttc gcg tgt gta 

Met Arg Gly Leu Phe Val Cys Val Phe Phe Ala Val Phe Ala Cys Val 

15 10 15 

gtt gat tat gcc ttt cct atg ggc tec atg age gga ccc gcg ccc gaa 
Val Asp Tyr Ala Phe Pro Met Gly Ser Met Ser Gly Pro Ala Pro Glu 
20 25 30 

etc tgc tgt ttg ggg tat gta act cat ctg ccg cca ccc ggt tta gtg 
Leu Cys Cys Leu Gly Tyr Val Thr His Leu Pro Pro Pro Gly Leu Val 
35 40 45' 

gtc tct tac tec cac acc teg teg cag tgc teg gtg gac gcc gtg ata 
Val Ser Tyr Ser His Thr Ser Ser Gin Cys Ser Val Asp Ala Val lie 
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50 55 60 

tta aac act cgc cgc ggt aaa aag ctg tgt gcc aat ccc ggg gac gac 240 

Leu Asn Thr Arg Arg Gly Lys Lys Leu Cys Ala Asn Pro Gly Asp Asp 

65 70 75 80 



gca gtg aag aaa ctg ctt cag gcg gtg gac aag cgt ccc aaa aag ggc 
Ala Val Lys Lys Leu Leu Gin Ala Val Asp Lys Arg Pro Lys Lys Gly 
85 90 95 



age ggg att tag 
Ser Gly lie 
115 



288 



aga aga acc egg cgc age ctg att gac gat tec gaa gag ggc ctt ggc 336 
Arg Arg Thr Arg Arg Ser Leu lie Asp Asp Ser Glu Glu Gly Leu Gly 
100 105 110 



348 



<210> 25 
<211> 115 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 25 

Met Arg Gly Leu Phe Val Cys Val Phe Phe Ala Val Phe Ala Cys Val 

1 5 10 15 

Val Asp Tyr Ala Phe Pro Met Gly Ser Met Ser Gly Pro Ala Pro Glu 

20 25 30 

Leu Cys Cys Leu Gly Tyr Val Thr His Leu Pro Pro Pro Gly Leu Val 

35 40 45 

Val Ser Tyr Ser His Thr Ser Ser Gin Cys Ser Val Asp Ala Val lie 

50 55 60 

Leu Asn Thr Arg Arg Gly Lys Lys Leu Cys Ala Asn Pro Gly Asp Asp 
65 70 75 80 

Ala Val Lys Lys Leu Leu Gin Ala Val Asp Lys Arg Pro Lys Lys Gly 

85 90 95 

Arg Arg Thr Arg Arg Ser Leu lie Asp Asp Ser Glu Glu Gly Leu Gly 
100 105 110 

Ser Gly lie 
115 



<210> 26 
<211> 564 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (564) 

<400> 26 

atg gcg get gtt cag ggc cct ccg ccg ccc cct gag gaa gaa aat gaa 48 
Met Ala Ala Val Gin Gly Pro Pro Pro Pro Pro Glu Glu Glu Asn Glu 
15 10 15 

aat tct ttg cca gtt gac gtt tat get ata gag ggc ate ttt ctt tae 96 
Asn Ser Leu Pro Val Asp Val Tyr Ala lie Glu Gly lie Phe Leu Tyr 
20 25 30 



81 



li.J! O ■ ILJIi 1LJ si H,J ; !Ui !! 



ggc gaa tgg gag gga etc agg aat tac tgc gag cga ata ctg agg cat 
Gly Glu Trp Glu Gly Leu Arg Asn Tyr Cys Glu Arg lie Leu Arg His 
145 150 155 160 



288 



tgt ggg etc ggg cag gcg gag tac ttg cac cat ccc gtg ttt age cct 144 
Cys Gly Leu Gly Gin Ala Glu Tyr Leu His His Pro Val Phe Ser Pro 
35 40 45 

att aag gaa ttt ate age gee ttt etc aag gac age get cgc ctg tac 192 
lie Lys Glu Phe lie Ser Ala Phe Leu Lys Asp Ser Ala Arg Leu Tyr 
50 55 60 

gag agg etc ttg cgc cac acc gat tac cgc tct ctg egg gga eta aac 240 
Glu Arg Leu Leu Arg His Thr Asp Tyr Arg Ser Leu Arg Gly Leu Asn 
65 70 75 80 

gee ata ggc caa ggg atg ctg caa ata aac acg gac gga cgc cac aac 
Ala He Gly Gin Gly Met Leu Gin He Asn Thr Asp Gly Arg His Asn 
85 90 95 

tgg ggt cgc get ttg gee gtg ttg ggt ctt ggt gcg tat gtg gtg gat 336 
Trp Gly Arg Ala Leu Ala Val Leu Gly Leu Gly Ala Tyr Val Val Asp 
100 105 HO 

aag gtt aaa gac gac gag cgt ctt tta acg ttc gee ata gee gtt eta 384 
Lys Val Lys Asp Asp Glu Arg Leu Leu Thr Phe Ala He Ala Val Leu 
115 120 125 

ccc gtg tac gcg tac gag gcg ctg gag tct cag tgg ttt cgt tea cac 432 
Pro Val Tyr Ala Tyr Glu Ala Leu Glu Ser Gin Trp Phe Arg Ser His 
130 135 140 



480 



cgc cgc aac gcg agg aga cac atg tgc tac gga gtt gcg get ggt ctt 52 8 

Arg Arg Asn Ala Arg Arg His Met Cys Tyr Gly Val Ala Ala Gly Leu 
165 170 175 

ctg gcg eta gtg gcg ctg ttt gee ate agg cga tag 564 

Leu Ala Leu Val Ala Leu Phe Ala He Arg Arg 

180 185 



<210> 27 
<211> 187 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 27 

Met Ala Ala Val Gin Gly Pro Pro Pro Pro Pro Glu Glu Glu Asn Glu 

15 10 15 

Asn Ser Leu Pro Val Asp Val Tyr Ala He Glu Gly He Phe Leu Tyr 

20 25 30 

Cys Gly Leu Gly Gin Ala Glu Tyr Leu His His Pro Val Phe Ser Pro 

35 40 45 

He Lys Glu Phe He Ser Ala Phe Leu Lys Asp Ser Ala Arg Leu Tyr 

50 55 60 

Glu Arg Leu Leu Arg His Thr Asp Tyr Arg Ser Leu Arg Gly Leu Asn 
65 70 75 80 

Ala lie Gly Gin Gly Met Leu Gin lie Asn Thr Asp Gly Arg His Asn 

85 90 95 

Trp Gly Arg Ala Leu Ala Val Leu Gly Leu Gly Ala Tyr Val Val Asp 
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100 105 HO 



Lys 


Val 


Lys 


Asp 


Asp 


Glu 


Arg 


Leu 


Leu 


Thr 


Phe 


Ala 


lie 


Ala 


Val 


Leu 






115 










120 










125 








Pro 


Val 


Tyr 


Ala 


Tyr 


Glu 


Ala 


Leu 


Glu 


Ser 


Gin 


Trp 


Phe 


Arg 


Ser 


His 




130 










135 










140 










Gly 


Glu 


Trp 


Glu 


Gly 


Leu 


Arg 


Asn 


Tyr 


Cys 


Glu 


Arg 


He 


Leu 


Arg 


His 


145 










150 










155 










160 


Arg 


Arg 


Asn 


Ala 


Arg 


Arg 


His 


Met 


Cys 


Tyr 


Gly 


Val 


Ala 


Ala 


Gly 


Leu 










165 










170 










175 




Leu 


Ala 


Leu 


Val 


Ala 


Leu 


Phe 


Ala 


lie 


Arg 


Arg 













180 185 



<210> 28 

<211> 1611 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1611) 

<400> 28 

atg act ccc gtg tac gtt ggg gga tac gtg gac gtg gtc age eta cca 
Met Thr Pro Val Tyr Val Gly Gly Tyr Val Asp Val Val Ser Leu Pro 
1 5 10 15 

aag ata gaa aag gag ctg tat tta gag ccc tea ate gtg gcg acc ctg 
Lys He Glu Lys Glu Leu Tyr Leu Glu Pro Ser He Val Ala Thr Leu 
20 25 30 

etc ccg tat acg gac cct eta ccg ata aac ata gag cac gtc ccc gaa 
Leu Pro Tyr Thr Asp Pro Leu Pro He Asn He Glu His Val Pro Glu 
35 40 45 



ttt tgc ctg ggc aag eta acg age cac gat ttc ttg gec ctg gcg tea 

Phe Cys Leu Gly Lys Leu Thr Ser His Asp Phe Leu Ala Leu Ala Ser 

65 70 75 80 

egg etc gcg ggg gac teg cga gcg gee cag ate cag eta aac ccc atg 

Arg Leu Ala Gly Asp Ser Arg Ala Ala Gin He Gin Leu Asn Pro Met 

85 90 95 

cct egg gac ccc ctg ttg gag atg tta cac aca tgg etc ccg gag etc 

Pro Arg Asp Pro Leu Leu Glu Met Leu His Thr Trp Leu Pro Glu Leu 

100 105 HO 



48 



96 



144 



gee cac gta ggt cac aca ate ggt etc ttc caa gta aca cac ggg ata 192 
Ala His Val Gly His Thr He Gly Leu Phe Gin Val Thr His Gly He 
50 55 60 



240 



288 



336 



tec etc teg tec etc cac cca gaa gag ctg caa gac ccc aat cac cca 384 

Ser Leu Ser Ser Leu His Pro Glu Glu Leu Gin Asp Pro Asn His Pro 
115 120 125 

ccc gcg ttt cag cac gtc tct ctg tgc gca ctg ggg agg cgt agg ggg 432 

Pro Ala Phe Gin His Val Ser Leu Cys Ala Leu Gly Arg Arg Arg Gly 

130 135 140 



tea ate gee gtt tac gga cca gac cct aca tgg gtc gtc tec aag ttt 

83 



480 



Ser lie Ala Val Tyr Gly Pro Asp Pro Thr Trp Val Val Ser Lys Phe 
145 150 155 160 



gat tct ctt acg cga gag gaa gcc ggt aaa ata acg gtc aat tgt tta 528 

Asp Ser Leu Thr Arg Glu Glu Ala Gly Lys lie Thr Val Asn Cys Leu 
165 170 175 

gat ctt tgc gaa cgt cag gtg aca ccg cca gaa ttc gcg gcc cct ctt 576 

Asp Leu Cys Glu Arg Gin Val Thr Pro Pro Glu Phe Ala Ala Pro Leu 
180 185 190 

gag aca ctg atg gcc aag gcg ate gac gcc gga ttt ate cga gat cgc 624 

Glu Thr Leu Met Ala Lys Ala lie Asp Ala Gly Phe lie Arg Asp Arg 

195 200 205 

aca gac ctg etc aaa aca gat aaa ggc gtg gcc aga gtc gcg aga agt 672 

Thr Asp Leu Leu Lys Thr Asp Lys Gly Val Ala Arg Val Ala Arg Ser 

210 215 220 

acg tat tta aag gca age cag ttt ccc tgc get cag cac tgc ggt aac 720 

Thr Tyr Leu Lys Ala Ser Gin Phe Pro Cys Ala Gin His Cys Gly Asn 
225 230 235 240 

cgc gac acc cga acc atg age gcc etc ccg gag gac aac ate acc att 768 

Arg Asp Thr Arg Thr Met Ser Ala Leu Pro Glu Asp Asn lie Thr lie 
245 250 255 

ccc aag age acc ttt ctg acc atg gtg cag age agt etc gat aac atg 816 

Pro Lys Ser Thr Phe Leu Thr Met Val Gin Ser Ser Leu Asp Asn Met 
260 265 270 

cgc aac cag ggc cac cgc acg tac gtt tct gcg cca ccc teg atg ccg 864 

Arg Asn Gin Gly His Arg Thr Tyr Val Ser Ala Pro Pro Ser Met Pro 

275 280 285 

gca acg gcg gcg tat cca teg tgg ata ccg ccg cct gag ctg acc gtc 912 

Ala Thr Ala Ala Tyr Pro Ser Trp lie Pro Pro Pro Glu Leu Thr Val 

290 295 300 

cca teg tac gca ccg ccc gtc gcg cca ccg ttc cct ttc cag teg gcg 960 

Pro Ser Tyr Ala Pro Pro Val Ala Pro Pro Phe Pro Phe Gin Ser Ala 
305 310 315 320 

ttt gcc ccc caa ccc tct ccg tat gcg gcc acg tac tat tct ccg acg 1008 

Phe Ala Pro Gin Pro Ser Pro Tyr Ala Ala Thr Tyr Tyr Ser Pro Thr 
325 330 335 

tac ggc tac get ccg gca ccg tct cgc cac cag aag cgc aaa egg gac 1056 

Tyr Gly Tyr Ala Pro Ala Pro Ser Arg His Gin Lys Arg Lys Arg Asp 
340 345 350 

gtg gag ctt tec gac gag ccc gtg ttt cca ggc gag gag gtc ggc ata 1104 

Val Glu Leu Ser Asp Glu Pro Val Phe Pro Gly Glu Glu Val Gly He 

355 360 365 

cac aag gac gtc atg gcg ctg tec aaa aac ata ttg gac att cag gcg 1152 

His Lys Asp Val Met Ala Leu Ser Lys Asn He Leu Asp He Gin Ala 

370 375 380 

gat ctg agg gac eta aaa cga gcc gcg teg cag acc tec ggc gca cag 1200 

Asp Leu Arg Asp Leu Lys Arg Ala Ala Ser Gin Thr Ser Gly Ala Gin 

84 
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385 390 395 400 



gac gcc gac caa cga ccg caa cca ccg ccc gtt cag ttc teg tgg cca 

Asp Ala Asp Gin Arg Pro Gin Pro Pro Pro Val Gin Phe Ser Trp Pro 

405 410 415 

cag acc tac gcg tct gcg ccg tac eta gcg tac cag ccg cag tgg tat 

Gin Thr Tyr Ala Ser Ala Pro Tyr Leu Ala Tyr Gin Pro Gin Trp Tyr 

420 425 430 

tec gga acg gac acc cat etc cac gcc cca cag ccg tac cag age gcg 

Ser Gly Thr Asp Thr His Leu His Ala Pro Gin Pro Tyr Gin Ser Ala 

435 440 445 

cag ggc ate cag caa acg caa cca ccg ccc ccg cag ccg gcg age cac 

Gin Gly He Gin Gin Thr Gin Pro Pro Pro Pro Gin Pro Ala Ser His 

450 455 460 

cac gcc ggt etc gcc acg cag ccg gca act cca gcc ccc gcc gcc caa 

His Ala Gly Leu Ala Thr Gin Pro Ala Thr Pro Ala Pro Ala Ala Gin 

465 470 475 480 

gaa tct gta atg tea aac gcc att cct tea gcc tct gcc ccc cgc get 

Glu Ser Val Met Ser Asn Ala He Pro Ser Ala Ser Ala Pro Arg Ala 

485 490 495 

ggc gcg tgc ccg ccc etc gat cca gaa tgc gga cag tec gcg egg gcc 

Gly Ala Cys Pro Pro Leu Asp Pro Glu Cys Gly Gin Ser Ala Arg Ala 

500 505 510 



12 4 8 



1296 



1344 



1392 



1440 



1488 



1536 



ccg gtg gag gcc age gca cag ccg gcc ccc gtg tea cag ata caa aaa 1584 
Pro Val Glu Ala Ser Ala Gin Pro Ala Pro Val Ser Gin He Gin Lys 
515 520 525 

atg ttc tgc gag gaa ctg etc aaa taa 1611 
Met Phe Cys Glu Glu Leu Leu Lys 
530 535 



<210> 29 
<211> 536 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 29 

Met Thr Pro Val Tyr Val Gly Gly Tyr Val Asp Val Val Ser Leu Pro 

1 5 10 15 

Lys He Glu Lys Glu Leu Tyr Leu Glu Pro Ser He Val Ala Thr Leu 

20 25 30 

Leu Pro Tyr Thr Asp Pro Leu Pro He Asn He Glu His Val Pro Glu 

35 40 45 

Ala His Val Gly His Thr He Gly Leu Phe Gin Val Thr His Gly He 

50 55 60 

Phe Cys Leu Gly Lys Leu Thr Ser His Asp Phe Leu Ala Leu Ala Ser 
65 70 75 80 

Arg Leu Ala Gly Asp Ser Arg Ala Ala Gin He Gin Leu Asn Pro Met 

85 90 95 

Pro Arg Asp Pro Leu Leu Glu Met Leu His Thr Trp Leu Pro Glu Leu 

100 105 HO 

Ser Leu Ser Ser Leu His Pro Glu Glu Leu Gin Asp Pro Asn His Pro 
115 120 125 
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Pro 


Ala 


Phe 


Gin 


His 


Val 


Ser 


Leu 


Cys 


Ala 


Leu 


Gly 


Arg 


Arg 


Arg 


Gly 




130 










135 










140 










Ser 


lie 


Ala 


Val- 


Tvr 


Gly 


Pro 


Asp 
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Thr 


Trp 


Val 


Val 


Ser 


Lys 


Phe 


14 5 










150 










155 










160 


Asp 


Ser 


Leu 


Thr 


Arg 


Glu 


Glu 


Ala 


Gly 


Lys 


He 


Thr 


Val 


Asn 


Cys 


Leu 








165 










170 










175 




Asp 




Cy s 


Glu 


Arg 


Gin 


Val 


Thr 


Pro 


Pro 


Glu 


Phe 


Ala 


Ala 


Pro 


Leu 






180 










185 










190 






Glu 


Thr 


Leu 


Met 


Ala 


Lys 


Ala 


lie 


Asp 


Ala 


Gly 


Phe 


He 


Arg 


Asp 


Arg 






195 










200 










205 
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Asp 
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Thr 


Asp 


Lys 


Gly 


Val 


Ala 


Arg 


Val 
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Arg 


Ser 
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215 










220 
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Lys 
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Gin 


His 


Cys 
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Asn 
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235 










240 


Arg 


sp 


Thr 


Arg 


Thr 


Met 


Ser 


Ala 


Leu 


Pro 


Glu 


Asp 


Asn 


He 


Thr 


He 






245 










250 










255 




Pro 


Ly s 


Ser 


Thr 


Phe 


Leu 


Thr 


Met 


Val 


Gin 


Ser 


Ser 


Leu 


Asp 


Asn 


Met 






260 










265 










270 






Arg 


Asn 


Lj 111 


Gly 


Hi s 


rg 


Thr 




Val 


Ser 


Ala 


Pro 


Pro 


Ser 


Met 


Pro 






27 5 










280 










285 
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Aia 
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Ala 


±y l 


Pro 
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Trn 


lie 


Pro 


Pro 


Pro 


Glu 


Leu 
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2 95 










300 










Piro 


Ser 


Tyr 


Al a 


Pro 


Pro 


Val 


Ala 


Pro 


Pro 


Phe 


Pro 


Phe 


Gin 


Ser 


Ala 


O U 3 








310 










315 










320 


rile 


J-\-L ci 


Pro 


Gin 


Pro 


Ser 


Pro 


Tvr 


Ala 


Ala 


Thr 


Tyr 


Tyr 


Ser 


Pro 


Thr 










3 2 5 










330 










335 




Ty r 


Gly 


Tyr 






Al a 


Pro 


Ser 


Arg 


Hi s 


Gin 


Lys 


Arq 


Lys 


Arg 


Asp 




34 0 










34 5 










350 






Val 


Glu 


Leu 


Ser 


sp 


Glu 


Pro 


Va 1 


Phe 


Pro 


Glv 


Glu 


Glu 


Val 


Gly 


He 






3 5 5 










3 60 










365 








Hi s 


Lys 


Asp 


Val 


Met 


Ala 


Leu 


Ser 


Lys 


Asn 


He 


Leu 


Asp 


He 


Gin 


Ala 
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375 










380 










Asp 


Leu 


Arg 


A 

sp 


Leu 


Lys 


Arg 


Ala 


Ala 


Ser 


Gin 


Thr 


Ser 


Gly 


Ala 


Gin 
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3 90 










3 95 










400 


Asp 


Ala 


Asp 


Gin 


Arg 


Pro 


Gin 


Pro 


Pro 


Pro 


Val 


Gin 


Phe 


Ser 


Trp 


Pro 








4 05 










4 10 










415 






1 ILL 


Tyr 
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Ser 
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Pro 


Tvr 
j 


Leu 


Ala 


Tvr 


Gin 


Pro 


Gin 


Trp 


Tyr 






42 0 










425 










430 






S6 r 
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Thr 


Asp 


Thr 


His 


Leu 


His 


Ala 


Pro 


Gin 


Pro 


Tyr 


Gin 


Ser 


Ala 






43 5 










440 










445 








Gin 


Gly 


lie 


Gin 


Gin 


Thr 


Gin 


Pro 


Pro 


Pro 


Pro 


Gin 


Pro 


Ala 


Ser 


His 




450 










455 
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His 
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Gly 


Leu 
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Thr 


Gin 


Pro 


Ala 


Thr 


Pro 


Ala 


Pro 


Ala 


Ala 


Gin 


465 








470 










475 










480 


Glu 


Ser 


Val 


Met 


Ser 


Asn 


Ala 


lie 


Pro 


Ser 


Ala 


Ser 


Ala 


Pro 


Arg 


Ala 










485 










490 
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530 535 



<210> 30 
<211> 900 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 
<220> 



86 
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<221> CDS 

<222> (1) - - (900) 



<400> 30 

atg ttt ate ggt aga ggg tec gta tac ggg age agg gtc gee acg att 

Met Phe lie Gly Arg Gly Ser Val Tyr Gly Ser Arg Val Ala Thr lie 

1 5 10 15 

gag ggc tct aaa tac age tec ttt tct ate ttt ggt agg ctg ace acg 

Glu Gly Ser Lys Tyr Ser Ser Phe Ser lie Phe Gly Arg Leu Thr Thr 

20 25 30 



48 



96 



tec acg tat ccc cca acg tac acg gga gtc atg ttg ggt egg tgt ctg 
Ser Thr Tyr Pro Pro Thr Tyr Thr Gly Val Met Leu Gly Arg Cys Leu 
35 40 45 



144 



cgt gag ccg aag gag atg age gec ggt ctt egg ggg ctg atg tgg cgc 
Arg Glu Pro Lys Glu Met Ser Ala Gly Leu Arg Gly Leu Met Trp Arg 
50 55 60 



192 



gtt ate cgt tgt gag aat tta aat acg ttt etc cct ggc gag etc agg 
Val lie Arg Cys Glu Asn Leu Asn Thr Phe Leu Pro Gly Glu Leu Arg 
65 70 75 80 



240 



ttt ctt cac ctg gtg ctg tgc gaa atg tac aac tac ggt ctt aac gtg 
Phe Leu His Leu Val Leu Cys Glu Met Tyr Asn Tyr Gly Leu Asn Val 
85 90 95 



288 



tac ctt ctg aag gag gca att gee aat ace gga acg agg gac gat att 
Tyr Leu Leu Lys Glu Ala He Ala Asn Thr Gly Thr Arg Asp Asp He 
100 105 HO 



336 



gtc etc ggg cgt aag gtt ccg gtg gag ttc tgg aaa ate gtt tac gat 
Val Leu Gly Arg Lys Val Pro Val Glu Phe Trp Lys He Val Tyr Asp 
115 120 125 



384 



ggg tta aag gag atg ggg gtt tct gac gca acg ttg etc tea gaa ace 
Gly Leu Lys Glu Met Gly Val Ser Asp Ala Thr Leu Leu Ser Glu Thr 
130 135 140 , 



432 



aaa agg ggg get ctg tgg ctt tac ttc aac ggg cgt ccg tgc ctg tta 
Lys Arg Gly Ala Leu Trp Leu Tyr Phe Asn Gly Arg Pro Cys Leu Leu 
145 150 155 160 



480 



aaa ggt eta ggc gac tac gtt ttt tgc caa ctg ggg tta age cac age 
Lys Gly Leu Gly Asp Tyr Val Phe Cys Gin Leu Gly Leu Ser His Ser 
165 170 175 



528 



gta cgc gtg gtt ccg gaa aac eta acg gat ggg aat tat etc tat aac 
Val Arg Val Val Pro Glu Asn Leu Thr Asp Gly Asn Tyr Leu Tyr Asn 
180 185 190 



576 



eta ggc age gtc ata cca tgt cgc ctg ctt gtg gca ctg age tac tgt 
Leu Gly Ser Val He Pro Cys Arg Leu Leu Val Ala Leu Ser Tyr Cys 
195 200 205 



624 



ttg gcg ttt tgg ggg cac get gac cac gag ccc tgg gtc agg ttg ttc 
Leu Ala Phe Trp Gly His Ala Asp His Glu Pro Trp Val Arg Leu Phe 
210 215 220 



672 



gec ggc aag att ttt ata ctg tac tta ata att tec ggc cac att atg 

87 



720 



Ala Gly Lys lie Phe lie Leu Tyr Leu lie lie Ser Gly His lie Met 
225 230 235 240 



cca egg aag tea ate tta gaa cag gtg gga acg tec ggg tac ggt ggc 
Pro Arg Lys Ser lie Leu Glu Gin Val Gly Thr Ser Gly Tyr Gly Gly 
245 250 255 

ttc gtc gag gca gtg tgt cgc gat gtt cgc gcg gta cac ggg att ccg 
Phe Val Glu Ala Val Cys Arg Asp Val Arg Ala Val His Gly lie Pro 
260 265 270 

gcg tgg gat ttt gcg tec gcg get ccg gcg tta acg tct caa cag acg 
Ala Trp Asp Phe Ala Ser Ala Ala Pro Ala Leu Thr Ser Gin Gin Thr 
275 280 285 

gac tac ttg ttt gcg ttc aat aac age gta gtg tag 
Asp Tyr Leu Phe Ala Phe Asn Asn Ser Val Val 

290 295 300 



<210> 31 
<211> 299 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 31 

Met Phe lie Gly Arg Gly Ser Val Tyr Gly Ser Arg Val Ala Thr lie 

1 5 10 ' 15 

Glu Gly Ser Lys Tyr Ser Ser Phe Ser lie Phe Gly Arg Leu Thr Thr 

20 25 30 

Ser Thr Tyr Pro Pro Thr Tyr Thr Gly Val Met Leu Gly Arg Cys Leu 

35 40 45 

Arg Glu Pro Lys Glu Met Ser Ala Gly Leu Arg Gly Leu Met Trp Arg 

50 55 60 

Val lie Arg Cys Glu Asn Leu Asn Thr Phe Leu Pro Gly Glu Leu Arg 
65 70 75 80 

Phe Leu His Leu Val Leu Cys Glu Met Tyr Asn Tyr Gly Leu Asn Val 

85 90 95 

Tyr Leu Leu Lys Glu Ala He Ala Asn Thr Gly Thr Arg Asp Asp He 

100 105 110 

Val Leu Gly Arg Lys Val Pro Val Glu Phe Trp Lys He Val Tyr Asp 

115 120 125 

Gly Leu Lys Glu Met Gly Val Ser Asp Ala Thr Leu Leu Ser Glu Thr 

130 135 140 

Lys Arg Gly Ala Leu Trp Leu Tyr Phe Asn Gly Arg Pro Cys Leu Leu 
145 150 155 160 

Lys Gly Leu Gly Asp Tyr Val Phe Cys Gin Leu Gly Leu Ser His Ser 

165 170 175 

Val Arg Val Val Pro Glu Asn Leu Thr Asp Gly Asn Tyr Leu Tyr Asn 

180 185 190 

Leu Gly Ser Val He Pro Cys Arg Leu Leu Val Ala Leu Ser Tyr Cys 

195 200 205 

Leu Ala Phe Trp Gly His Ala Asp His Glu Pro Trp Val Arg Leu Phe 

210 215 220 

Ala Gly Lys He Phe He Leu Tyr Leu He He Ser Gly His He Met 
225 230 235 240 

Pro Arg Lys Ser He Leu Glu Gin Val Gly Thr Ser Gly Tyr Gly Gly 

245 250 255 

Phe Val Glu Ala Val Cys Arg Asp Val Arg Ala Val His Gly He Pro 

260 265 270 

Ala Trp Asp Phe Ala Ser Ala Ala Pro Ala Leu Thr Ser Gin Gin Thr 
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275 280 285 

Asp Tyr Leu Phe Ala Phe Asn Asn Ser Val Val 
290 295 



<210> 32 
<211> 1644 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (1644) 

<400> 32 

atg cgc acg tea gaa aaa tgc tgt atg aga tac cca cga aaa cca gcg 

Met Arg Thr Ser Glu Lys Cys Cys Met Arg Tyr Pro Arg Lys Pro Ala 

X 5 10 15 

egg caa att acc gca act ttt tgg gec ccc cat cca aac aac gtt ctg 
Arg Gin lie Thr Ala Thr Phe Trp Ala Pro His Pro Asn Asn Val Leu 
20 25 30 



ctg etc egg eta gaa eta gac aac gtg ctg caa acg cac caa egg gaa 
Leu Leu Arg Leu Glu Leu Asp Asn Val Leu Gin Thr His Gin Arg Glu 
65 70 75 80 



cgc agg gtg ttc ccc aga gat ttg cga ggc aca aca aac ttt caa aac 

89 



48 



96 



ttt att cac aaa cca tct ctg ata gaa gaa aga aga aac gcg ttt gtg 144 

Phe lie His Lys Pro Ser Leu lie Glu Glu Arg Arg Asn Ala Phe Val 
35 40 45 

atg cga aat caa cag ctg gcg eta agg gtt cac acg ctg cga aaa aac 192 

Met Arg Asn Gin Gin Leu Ala Leu Arg Val His Thr Leu Arg Lys Asn 
50 55 60 



240 



aca gaa gtg gtc atg cgc gac eta gaa aca att caa aac atg gtc ggg 288 

Thr Glu Val Val Met Arg Asp Leu Glu Thr lie Gin Asn Met Val Gly 
85 90 95 

gat ctg egg tec ccg ggc agg gaa acc gec aat gee caa aca tea eta 336 

Asp Leu Arg Ser Pro Gly Arg Glu Thr Ala Asn Ala Gin Thr Ser Leu 
100 105 110 

aac ccg caa ccc aaa ate gec ccg cag aca cat gga gac gcg ttc gtt 384 

Asn Pro Gin Pro Lys lie Ala Pro Gin Thr His Gly Asp Ala Phe Val 
115 120 125 

gtc aca ata gec ccc gga gac ccg gga ttc acg gta aac cag gat etc 432 

Val Thr lie Ala Pro Gly Asp Pro Gly Phe Thr Val Asn Gin Asp Leu 

130 135 140 

cgt ctg gag etc ctg cca age ctg tac atg aac caa aac caa tgg ctg 480 

Arg Leu Glu Leu Leu Pro Ser Leu Tyr Met Asn Gin Asn Gin Trp Leu 
145 150 155 160 

cca caa tac ggg ccg tgg tac agt age etc aca gat aac gee atg caa 528 

Pro Gin Tyr Gly Pro Trp Tyr Ser Ser Leu Thr Asp Asn Ala Met Gin 
165 170 175 



576 
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Arg Arg Val Phe Pro Arg Asp Leu Arg Gly Thr Thr Asn Phe Gin Asn 
180 185 190 

tec acg tct eta aag eta atg tct gee gta ate tea aca gee gee teg 624 
Ser Thr Ser Leu Lys Leu Met Ser Ala Val lie Ser Thr Ala Ala Ser 
195 200 205 

ate aca caa gac ttt tac gcg gat gtc cgc aac gta teg gac acg cag 672 
lie Thr Gin Asp Phe Tyr Ala Asp Val Arg Asn Val Ser Asp Thr Gin 
210 215 220 

gee gcg ctg tgc etc etc aac ggg tac tac tgt cac egg aca ggt ace 720 
Ala Ala Leu Cys Leu Leu Asn Gly Tyr Tyr Cys His Arg Thr Gly Thr 
225 230 235 240 

ccg ttg ccg cca act cga aac ggc etc tgg gat aac etc ggc aca aag 768 
Pro Leu Pro Pro Thr Arg Asn Gly Leu Trp Asp Asn Leu Gly Thr Lys 
245 250 255 

ctg gcg acg ctt gtc tct cac eta aaa caa aac acg aag ggc ctg ggt 816 
Leu Ala Thr Leu Val Ser His Leu Lys Gin Asn Thr Lys Gly Leu Gly 
260 265 270 

ttc gag ttt aca tat teg aat ccg agg caa cgc gec tct eta gcg ccc 864 
Phe Glu Phe Thr Tyr Ser Asn Pro Arg Gin Arg Ala Ser Leu Ala Pro 
275 280 285 

eta aac aaa gaa aca aag tac aac gcg gac ttc ttt aca aac cac gtg 912 
Leu Asn Lys Glu Thr Lys Tyr Asn Ala Asp Phe Phe Thr Asn His Val 
290 295 300 

ata tac gcg acg ttg gca cag age gga ctg eta ccc ggt tec aag aac 960 
lie Tyr Ala Thr Leu Ala Gin Ser Gly Leu Leu Pro Gly Ser Lys Asn 
305 310 315 320 

ccc ggt ace ggt cag cca cet gga cca gat etc gta tac ate ctt gcg 1008 
Pro Gly Thr Gly Gin Pro Pro Gly Pro Asp Leu Val Tyr lie Leu Ala 
325 330 335 

ace act ttg ttc age gaa gac gtg cet ccc ttt caa gcg tac caa tgg 1056 
Thr Thr Leu Phe Ser Glu Asp Val Pro Pro Phe Gin Ala Tyr Gin Trp 
340 345 350 

aac etc cga gee ggc etc teg gca eta ggc tgt tta gtt ttg gtt tac 1104 
Asn Leu Arg Ala Gly Leu Ser Ala Leu Gly Cys Leu Val Leu Val Tyr 
355 360 365 

gtg eta ttg gag ctg gee caa ate aca ccc aga teg ccc cac egg cgc 1152 
Val Leu Leu Glu Leu Ala Gin lie Thr Pro Arg Ser Pro His Arg Arg 
370 375 380 

etc aac ctg gee teg etc ctg ggt ggg aga ttt tec aag gta gaa gac 1200 
Leu Asn Leu Ala Ser Leu Leu Gly Gly Arg Phe Ser Lys Val Glu Asp 
385 390 395 400 

ccg tec ggg tec aaa caa tac ctt aaa aag gga caa ctt ttt gat ttc 1248 
Pro Ser Gly Ser Lys Gin Tyr Leu Lys Lys Gly Gin Leu Phe Asp Phe 
405 410 415 

ctg aca gaa aat tac ate tec cca att ctt teg cgc gcg cca gac gee 1296 
Leu Thr Glu Asn Tyr lie Ser Pro He Leu Ser Arg Ala Pro Asp Ala 

90 




1440 



1488 



420 425 430 

ccc acg teg ttt ttg ttc ccg ggt gcg tat eta gcg gcg ctg gag gca 1344 
Pro Thr Ser Phe Leu Phe Pro Gly Ala Tyr Leu Ala Ala Leu Glu Ala 
435 440 445 

aag gec ata tec cat eta aaa cac act cgt ccg ttc gtc aac ctg acc 1392 
Lys Ala He Ser His Leu Lys His Thr Arg Pro Phe Val Asn Leu Thr 
450 455 460 

ggt tec aga ttt aac gaa ate ttt gac ate ctg aac caa aag etc acg 
Gly Ser Arg Phe Asn Glu He Phe Asp He Leu Asn Gin Lys Leu Thr 
465 470 475 480 

ttc agg gac get ggc age ctg att cag get caa acg teg eta cgt tta 
Phe Arg Asp Ala Gly Ser Leu He Gin Ala Gin Thr Ser Leu Arg Leu 
485 490 495 

acg gee gag gaa ggc ctg gee gee ate ctt teg cac cca teg ccc ccc 1536 
Thr Ala Glu Glu Gly Leu Ala Ala He Leu Ser His Pro Ser Pro Pro 
500 505 510 

ggt ctg gca cac gag ata atg aaa age caa ttc gga gtt tac gac gac 1584 
Gly Leu Ala His Glu He Met Lys Ser Gin Phe Gly Val Tyr Asp Asp 
515 520 525 

tac gac aga gtt tat ttt ttg gtc ctg gga tac ctg ccg gtc gec acg 1632 
Tyr Asp Arg Val Tyr Phe Leu Val Leu Gly Tyr Leu Pro Val Ala Thr 
530 535 540 

teg gta gtg tga 1644 

Ser Val Val 

545 



<210> 33 
<211> 547 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 33 

Met Arg Thr Ser Glu Lys Cys Cys Met Arg Tyr Pro Arg Lys Pro Ala 

15 10 15 

Arg Gin He Thr Ala Thr Phe Trp Ala Pro His Pro Asn Asn Val Leu 

20 25 30 

Phe He His Lys Pro Ser Leu He Glu Glu Arg Arg Asn Ala Phe Val 

35 40 45 

Met Arg Asn Gin Gin Leu Ala Leu Arg Val His Thr Leu Arg Lys Asn 

50 55 60 

Leu Leu Arg Leu Glu Leu Asp Asn Val Leu Gin Thr His Gin Arg Glu 
65 70 75 80 

Thr Glu Val Val Met Arg Asp Leu Glu Thr He Gin Asn Met Val Gly 

85 90 95 

Asp Leu Arg Ser Pro Gly Arg Glu Thr Ala Asn Ala Gin Thr Ser Leu 

100 105 HO 

Asn Pro Gin Pro Lys He Ala Pro Gin Thr His Gly Asp Ala Phe Val 

115 120 125 

Val Thr He Ala Pro Gly Asp Pro Gly Phe Thr Val Asri Gin Asp Leu 

130 135 140 

Arg Leu Glu Leu Leu Pro Ser Leu Tyr Met Asn Gin Asn Gin Trp Leu 
145 150 155 160 



91 
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Pro Gin Tyr Gly Pro Trp Tyr Ser Ser Leu Thr Asp Asn Ala Met Gin 

165 170 175 

Arg Arg Val Phe Pro Arg Asp Leu Arg Gly Thr Thr Asn Phe Gin Asn 

180 185 190 

Ser Thr Ser Leu Lys Leu Met Ser Ala Val lie Ser Thr Ala Ala Ser 

195 200 205 

lie Thr Gin Asp Phe Tyr Ala Asp Val Arg Asn Val Ser Asp Thr Gin 

210 215 220 

Ala Ala Leu Cys Leu Leu Asn Gly Tyr Tyr Cys His Arg Thr Gly Thr 
225 230 235 240 

Pro Leu Pro Pro Thr Arg Asn Gly Leu Trp Asp Asn Leu Gly Thr Lys 

245 250 255 

Leu Ala Thr Leu Val Ser His Leu Lys Gin Asn Thr Lys Gly Leu Gly 

260 265 270 

Phe Glu Phe Thr Tyr Ser Asn Pro Arg Gin Arg Ala Ser Leu Ala Pro 

275 280 285 

Leu Asn Lys Glu Thr Lys Tyr Asn Ala Asp Phe Phe Thr Asn His Val 

290 295 300 

lie Tyr Ala Thr Leu Ala Gin Ser Gly Leu Leu Pro Gly Ser Lys Asn 
305 310 315 320 

Pro Gly Thr Gly Gin Pro Pro Gly Pro Asp Leu Val Tyr lie Leu Ala 

325 330 335 

Thr Thr Leu Phe Ser Glu Asp Val Pro Pro Phe Gin Ala Tyr Gin Trp 

340 345 350 

Asn Leu Arg Ala Gly Leu Ser Ala Leu Gly Cys Leu Val Leu Val Tyr 

355 360 365 

Val Leu Leu Glu Leu Ala Gin He Thr Pro Arg Ser Pro His Arg Arg 

370 375 380 

Leu Asn Leu Ala Ser Leu Leu Gly Gly Arg Phe Ser Lys Val Glu Asp 
385 390 395 400 

Pro Ser Gly Ser Lys Gin Tyr Leu Lys Lys Gly Gin Leu Phe Asp Phe 

405 410 415 

Leu Thr Glu Asn Tyr He Ser Pro He Leu Ser Arg Ala Pro Asp Ala 

420 425 430 

Pro Thr Ser Phe Leu Phe Pro Gly Ala Tyr Leu Ala Ala Leu Glu Ala 

435 440 445 

Lys Ala He Ser His Leu Lys His Thr Arg Pro Phe Val Asn Leu Thr 

450 455 460 

Gly Ser Arg Phe Asn Glu He Phe Asp He Leu Asn Gin Lys Leu Thr 
465 470 475 480 

Phe Arg Asp Ala Gly Ser Leu He Gin Ala Gin Thr Ser Leu Arg Leu 

485 490 495 

Thr Ala Glu Glu Gly Leu Ala Ala He Leu Ser His Pro Ser Pro Pro 

500 505 510 

Gly Leu Ala His Glu He Met Lys Ser Gin Phe Gly Val Tyr Asp Asp 

515 520 525 

Tyr Asp Arg Val Tyr Phe Leu Val Leu Gly Tyr Leu Pro Val Ala Thr 

530 535 540 

Ser Val Val 
545 



<210> 34 
<211> 1053 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1053) 
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<400> 34 

atg gcg ttt gca aat caa tgt aag cac gtc gcg acc etc gaa gec ctt 48 

Met Ala Phe Ala Asn Gin Cys Lys His Val Ala Thr Leu Glu Ala Leu 
15 10 15 

ccg gec tea cgc aaa egg get gga act cgc gee cac tta gca gta tac 96 
Pro Ala Ser Arg Lys Arg Ala Gly Thr Arg Ala His Leu Ala Val Tyr 
20 25 30 

cga egg tta ate aaa cac egg teg ctt gat gat att tta aag ttt ctt 144 
Arg Arg Leu lie Lys His Arg Ser Leu Asp Asp lie Leu Lys Phe Leu 
35 40 45 

tea att cgc cca aca etc agg gca acc aaa aac gtt aaa ttt agg att 192 
Ser lie Arg Pro Thr Leu Arg Ala Thr Lys Asn Val Lys Phe Arg lie 
50 55 60 

ttt ttc gag gtt tec eta ggt agg cga ata gca gat tgt gtg ctt acc 240 
Phe Phe Glu Val Ser Leu Gly Arg Arg lie Ala Asp Cys Val Leu Thr 
65 70 75 80 

gtt aat age gaa cat caa aaa acg tgc tac gta att gaa etc aag acg 288 
Val Asn Ser Glu His Gin Lys Thr Cys Tyr Val He Glu Leu Lys Thr 
85 90 95 

tgc eta age gcg get gta ttt ccc ggc aac gca ata aaa ata tea cag 336 
Cys Leu Ser Ala Ala Val Phe Pro Gly Asn Ala He Lys He Ser Gin 
100 105 110 

cga tgg cag ggc tta cac caa ctg aca gat tec gtg gca tat ata ggc 384 
Arg Trp Gin Gly Leu His Gin Leu Thr Asp Ser Val Ala Tyr He Gly 
115 120 125 

aga gee gcg cca aga gga cac gaa aac tgg teg gta agg ccc tgg ctg 432 
Arg Ala Ala Pro Arg Gly His Glu Asn Trp Ser Val Arg Pro Trp Leu 
130 135 140 

eta ttt aag aac cag aaa aca ctt aaa aca att cat act gag tea tea 480 
Leu Phe Lys Asn Gin Lys Thr Leu Lys Thr He His Thr Glu Ser Ser 
145 150 155 160 

gcg ttt ccg cct aca ttt ate aac acc acg teg gcg gcg ctt aac ggg 528 
Ala Phe Pro Pro Thr Phe He Asn Thr Thr Ser Ala Ala Leu Asn Gly 
165 170 175 

ttt ttt age caa tgg gaa gat gcg cac gtc aga aaa atg ctg tat gag 576 
Phe Phe Ser Gin Trp Glu Asp Ala His Val Arg Lys Met Leu Tyr Glu 
180 185 190 

ata ccc acg aaa acc age gcg gca aat tac cgc aac ttt ttg ggc ccc 624 
He Pro Thr Lys Thr Ser Ala Ala Asn Tyr Arg Asn Phe Leu Gly Pro 
195 200 205 

cca tec aaa caa cgt tct gtt tat tea caa acc ate tct gat aga aga 672 
Pro Ser Lys Gin Arg Ser Val Tyr Ser Gin Thr He Ser Asp Arg Arg 
210 215 220 

aag aag aaa cgc gtt tgt gat gcg aaa tea aca get ggc get aag ggt 720 
Lys Lys Lys Arg Val Cys Asp Ala Lys Ser Thr Ala Gly Ala Lys Gly 
225 230 235 240 
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tea cac get gcg aaa aaa cct get ccg get aga act aga caa cgt get 768 
Ser His Ala Ala Lys Lys Pro Ala Pro Ala Arg Thr Arg Gin Arg Ala 
245 250 255 

gca aac gca cca acg gga aac aga agt ggt cat gcg cga cct aga aac 816 
Ala Asn Ala Pro Thr Gly Asn Arg Ser Gly His Ala Arg Pro Arg Asn 
260 265 270 

aat tea aaa cat ggt egg gga tct gcg gtc ccc ggg cag gga aac cgc 864 
Asn Ser Lys His Gly Arg Gly Ser Ala Val Pro Gly Gin Gly Asn Arg 
275 280 285 

caa tgc cca aac ate act aaa ccc gca acc caa aat cgc ccc gca gac 912 
Gin Cys Pro Asn lie Thr Lys Pro Ala Thr Gin Asn Arg Pro Ala Asp 
290 295 300 

aca tgg aga cgc gtt cgt tgt cac aat age ccc egg aga ccc ggg att 960 
Thr Trp Arg Arg Val Arg Cys His Asn Ser Pro Arg Arg Pro Gly He 
305 310 315 320 

cac ggt aaa cca gga tct ccg tct gga get cct gec aag cct gta cat 1008 
His Gly Lys Pro Gly Ser Pro Ser Gly Ala Pro Ala Lys Pro Val His 
325 330 335 

gaa cca aaa cca atg get gec aca ata egg gec gtg gta cag tag 1053 
Glu Pro Lys Pro Met Ala Ala Thr He Arg Ala Val Val Gin 
340 345 350 



<210> 35 
<211> 350 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 35 

Met Ala Phe Ala Asn Gin Cys Lys His Val Ala Thr Leu Glu Ala Leu 

15 10 15 

Pro Ala Ser Arg Lys Arg Ala Gly Thr Arg Ala His Leu Ala Val Tyr 

20 25 30 

Arg Arg Leu lie Lys His Arg Ser Leu Asp Asp He Leu Lys Phe Leu 

35 40 45 

Ser He Arg Pro Thr Leu Arg Ala Thr Lys Asn Val Lys Phe Arg He 

50 55 60 

Phe Phe Glu Val Ser Leu Gly Arg Arg He Ala Asp Cys Val Leu Thr 
65 70 75 80 

Val Asn Ser Glu His Gin Lys Thr Cys Tyr Val He Glu Leu Lys Thr 

85 90 95 

Cys Leu Ser Ala Ala Val Phe Pro Gly Asn Ala He Lys He Ser Gin 

100 105 110 

Arg Trp Gin Gly Leu His Gin Leu Thr Asp Ser Val Ala Tyr He Gly 

115 120 125 

Arg Ala Ala Pro Arg Gly His Glu Asn Trp Ser Val Arg Pro Trp Leu 

130 135 140 

Leu Phe Lys Asn Gin Lys Thr Leu Lys Thr He His Thr Glu Ser Ser 
145 150 155 160 

Ala Phe Pro Pro Thr Phe lie Asn Thr Thr Ser Ala Ala Leu Asn Gly 

165 170 175 

Phe Phe Ser Gin Trp Glu Asp Ala His Val Arg Lys Met Leu Tyr Glu 

180 185 190 

He Pro Thr Lys Thr Ser Ala Ala Asn Tyr Arg Asn Phe Leu Gly Pro 
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195 200 205 

Pro Ser Lys Gin Arg Ser Val Tyr Ser Gin Thr lie Ser Asp Arg Arg 

210 215 220 

Lys Lys Lys Arg Val Cys Asp Ala Lys Ser Thr Ala Gly Ala Lys Gly 
225 230 235 240 

Ser His Ala Ala Lys Lys Pro Ala Pro Ala Arg Thr Arg Gin Arg Ala 

245 250 255 

Ala Asn Ala Pro Thr Gly Asn Arg Ser Gly His Ala Arg Pro Arg Asn 

260 265 270 

Asn Ser Lys His Gly Arg Gly Ser Ala Val Pro Gly Gin Gly Asn Arg 

275 280 285 

Gin Cys Pro Asn lie Thr Lys Pro Ala Thr Gin Asn Arg Pro Ala Asp 

290 295 300 

Thr Trp Arg Arg Val Arg Cys His Asn Ser Pro Arg Arg Pro Gly lie 
305 310 315 320 

His Gly Lys Pro Gly Ser Pro Ser Gly Ala Pro Ala Lys Pro Val His 

325 330 335 

Glu Pro Lys Pro Met Ala Ala Thr lie Arg Ala Val Val Gin 
340 345 350 



<210> 36 
<211> 1674 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1674) 

<400> 36 

atg gcc gaa gga ggg tct ggc ttt ggc gac gag ctt gtc cgt cag atg 4 8 

Met Ala Glu Gly Gly Ser Gly Phe Gly Asp Glu Leu Val Arg Gin Met 

15 10 15 

egg gat egg aag cct aga tgg gac gag tct tec gac gat act gat gac 96 
Arg Asp Arg Lys Pro Arg Trp Asp Glu Ser Ser Asp Asp Thr Asp Asp 
20 25 30 

gtt gac acc gaa age acc gat ctt gag tat gat gat gtg ttc ccc gtg 144 
Val Asp Thr Glu Ser Thr Asp Leu Glu Tyr Asp Asp Val Phe Pro Val 
35 40 45 

gta gat acg cac ggc tta atg age cct gga agt caa aac tat gac gta 192 
Val Asp Thr His Gly Leu Met Ser Pro Gly Ser Gin Asn Tyr Asp Val 
50 55 60 

ccc acg tct ccg tec gga acg ccg tgg gaa tta ctg cac ccg gac gcc 240 
Pro Thr Ser Pro Ser Gly Thr Pro Trp Glu Leu Leu His Pro Asp Ala 
65 70 75 80 

ctg tat gca cat ccg agg tgc ccg cct aaa agg gcg gtg gtg ccg ggc 288 
Leu Tyr Ala His Pro Arg Cys Pro Pro Lys Arg Ala Val Val Pro Gly 
85 90 95 

ggt gg° 9 c 9 c g t ccc aa 9 gtg tcc g°g ttc tc 9 9 ct a 9 a ctt caa tat 336 

Gly Gly Ala Arg Pro Lys Val Ser Ala Phe Ser Ala Arg Leu Gin Tyr 
100 105 110 



gtt gga cga cag agt ttt ggg gat aga gag acg egg cag ctt acc ggg 384 
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Val Gly Arg Gin Ser Phe Gly Asp Arg Glu Thr Arg Gin Leu Thr Gly 

115 120 125 

get cag ttt tct tec gag age gag cac gaa tat gcg gaa ata ccg gag 432 

Ala Gin Phe Ser Ser Glu Ser Glu His Glu Tyr Ala Glu lie Pro Glu 

130 135 140 

cgt act act aca cgc ccg gtt gaa age ggt gac aag aga aat ttc ace 480 

Arg Thr Thr Thr Arg Pro Val Glu Ser Gly Asp Lys Arg Asn Phe Thr 

145 150 155 160 

tec ggg cgt agg ggt gca ate teg gga ccc teg teg aca aaa cct agt 52 8 

Ser Gly Arg Arg Gly Ala He Ser Gly Pro Ser Ser Thr Lys Pro Ser 
165 170 175 

cac ggt gcg ggg tta aca cgt aag act aaa acg teg ctg agt gtt agt 576 

His Gly Ala Gly Leu Thr Arg Lys Thr Lys Thr Ser Leu Ser Val Ser 
180 185 190 

ctt aaa aac ttg ctg egg ata aag gac gat gat get aag gtt gat gta 624 

Leu Lys Asn Leu Leu Arg He Lys Asp Asp Asp Ala Lys Val Asp Val 

195 200 205 

ccc agg ccg gtt ace gtg ccg gtt cat etc atg cag ccg cat cct atg 672 

Pro Arg Pro Val Thr Val Pro Val His Leu Met Gin Pro His Pro Met 

210 215 220 

acg gaa tat aga aat gec ttt tta ate tat eta gag ggg gtc atg ggt 720 

Thr Glu Tyr Arg Asn Ala Phe Leu He Tyr Leu Glu Gly Val Met Gly 

225 230 235 240 

gtt gga aaa aca acg eta ttg aat tea atg act ggc atg gtg ccg cag 

Val Gly Lys Thr Thr Leu Leu Asn Ser Met Thr Gly Met Val Pro Gin 
245 250 255 

gaa aac gtc tta age tgt ccc gag ccc atg aaa ttt tgg acg tgt gtt 

Glu Asn Val Leu Ser Cys Pro Glu Pro Met Lys Phe Trp Thr Cys Val 
260 265 270 

tat tea aat tgc ctt aaa gaa cag cgc age ata gtt aag caa ggc acc 864 

Tyr Ser Asn Cys Leu Lys Glu Gin Arg Ser He Val Lys Gin Gly Thr 

275 280 285 

cac ggg aaa ttg ate act tct get cgc gta tac gcg tgt cag age aag 912 

His Gly Lys Leu He Thr Ser Ala Arg Val Tyr Ala Cys Gin Ser Lys 

290 295 300 

ttt gcg eta ccg ttt cgt gcg aca gee gee ggc ate ggt cgc aac ctg 

Phe Ala Leu Pro Phe Arg Ala Thr Ala Ala Gly He Gly Arg Asn Leu 

305 310 315 . 320 

caa ccg tgg ctg gtg gga aac ggt age aca aag cca gcg aat tgg att 

Gin Pro Trp Leu Val Gly Asn Gly Ser Thr Lys Pro Ala Asn Trp He 
325 330 335 

gtt ttt gac aga cac eta ctg tec gca acg gtg gtt ttt ccg ctg gtt 

Val Phe Asp Arg His Leu Leu Ser Ala Thr Val Val Phe Pro Leu Val 
340 345 350 



768 



816 



960 



1008 



1056 



cac gta aag tac aac agg eta acg ccg gat cat ctg ttt caa ate eta 1104 
His Val Lys Tyr Asn Arg Leu Thr Pro Asp His Leu Phe Gin He Leu 
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355 360 365 

tct ctt ttc teg gca cac gac ggc gat gtg gtc gtt ttg eta acg etc 1152 

Ser Leu Phe Ser Ala His Asp Gly Asp Val Val Val Leu Leu Thr Leu 

370 375 380 

aac agt teg gag gcg cac agg cgc att caa agt cga ggc cgt aag gaa 1200 

Asn Ser Ser Glu Ala His Arg Arg lie Gin Ser Arg Gly Arg Lys Glu 

385 390 395 400 

gaa aaa gga ate acg caa aac tac ttg cga cag gta gcg tgg gcg tac 1248 

Glu Lys Gly lie Thr Gin Asn Tyr Leu Arg Gin Val Ala Trp Ala Tyr 

405 410 415 

cat gec gtg ttc tgt acg tgg gtg atg atg caa tat etc aca ccg gag 12 96 

His Ala Val Phe Cys Thr Trp Val Met Met Gin Tyr Leu Thr Pro Glu 

420 425 430 

caa atg gtt cag ctg tgt gta caa act gtg tec ata gag gac ata tgc 1344 

Gin Met Val Gin Leu Cys Val Gin Thr Val Ser lie Glu Asp lie Cys 

435 440 445 

aat atg aat tec aga ttg act cat egg ttt eta acc eta aca aaa eta 1392 

Asn Met Asn Ser Arg Leu Thr His Arg Phe Leu Thr Leu Thr Lys Leu 

450 455 460 

cat gaa cag agt atg ata ccc atg gtg gca gaa atg tta gta teg gtt 1440 

His Glu Gin Ser Met He Pro Met Val Ala Glu Met Leu Val Ser Val 

465 470 475 480 

aaa gaa cac gtg acc tta atg gag gtc tgt ttg gga etc ttt aaa gag 1488 

Lys Glu His Val Thr Leu Met Glu Val Cys Leu Gly Leu Phe Lys Glu 

485 490 495 

eta cga aag ctt caa att tta att gtt gac gca gga gaa cat eta gat 1536 

Leu Arg Lys Leu Gin He Leu He Val Asp Ala Gly Glu His Leu Asp 

500 505 510 

gat gcg tgt ggc etc tgg gga aat att tat ggg cag gtg atg tea aat 1584 

Asp Ala Cys Gly Leu Trp Gly Asn He Tyr Gly Gin Val Met Ser Asn 

515 520 525 

gag get att aaa cca cga gca gtg aac tgg cca get ctt gaa age tac 1632 

Glu Ala He Lys Pro Arg Ala Val Asn Trp Pro Ala Leu Glu Ser Tyr 

530 535 540 

att caa acg eta acc aaa ttg gaa ggc aat ggc gcg tat tag 1674 

He Gin Thr Leu Thr Lys Leu Glu Gly Asn Gly Ala Tyr 

545 550 555 



<210> 37 
<211> 557 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 37 

Met Ala Glu Gly Gly Ser Gly Phe Gly Asp Glu Leu Val Arg Gin Met 

15 10 15 

Arg Asp Arg Lys Pro Arg Trp Asp Glu Ser Ser Asp Asp Thr Asp Asp 
20 25 30 
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Val Asp Thr Glu Ser Thr Asp Leu Glu Tyr Asp Asp Val Phe Pro Val 

35 40 45 

Val Asp Thr His Gly Leu Met Ser Pro Gly Ser Gin Asn Tyr Asp Val 

50 55 60 

Pro Thr Ser Pro Ser Gly Thr Pro Trp Glu Leu Leu His Pro Asp Ala 
65 70 75 80 

Leu Tyr Ala His Pro Arg Cys Pro Pro Lys Arg Ala Val Val Pro Gly 

85 90 95 

Gly Gly Ala Arg Pro Lys Val Ser Ala Phe Ser Ala Arg Leu Gin Tyr 

100 105 HO 

Val Gly Arg Gin Ser Phe Gly Asp Arg Glu Thr Arg Gin Leu Thr Gly 

115 120 125 

Ala Gin Phe Ser Ser Glu Ser Glu His Glu Tyr Ala Glu lie Pro Glu 

130 135 140 

Arg Thr Thr Thr Arg Pro Val Glu Ser Gly Asp Lys Arg Asn Phe Thr 
145 150 155 160 

Ser Gly Arg Arg Gly Ala He Ser Gly Pro Ser Ser Thr Lys Pro Ser 

165 170 175 

His Gly Ala Gly Leu Thr Arg Lys Thr Lys Thr Ser Leu Ser Val Ser 

180 185 190 

Leu Lys Asn Leu Leu Arg He Lys Asp Asp Asp Ala Lys Val Asp Val 

195 200 205 

Pro Arg Pro Val Thr Val Pro Val His Leu Met Gin Pro His Pro Met 

210 215 220 

Thr Glu Tyr Arg Asn Ala Phe Leu He Tyr Leu. Glu Gly Val Met Gly 
225 230 235 240 

Val Gly Lys Thr Thr Leu Leu Asn Ser Met Thr Gly Met Val Pro Gin 

245 250 255 

Glu Asn Val Leu Ser Cys Pro Glu Pro Met Lys Phe Trp Thr Cys Val 

260 265 270 

Tyr Ser Asn Cys Leu Lys Glu Gin Arg Ser He Val Lys Gin Gly Th 

275 280 285 

His Gly Lys Leu He Thr Ser Ala Arg Val Tyr Ala Cys Gin Ser Lys 

290 295 300 

Phe Ala Leu Pro Phe Arg Ala Thr Ala Ala Gly He Gly Arg Asn Leu 
305 310 315 320 

Gin Pro Trp Leu Val Gly Asn Gly Ser Thr Lys Pro Ala Asn Trp He 

325 330 335 

Val Phe Asp Arg His Leu Leu Ser Ala Thr Val Val Phe Pro Leu Val 

340 345 350 

His Val Lys Tyr Asn Arg Leu Thr Pro Asp His Leu Phe Gin He Leu 

355 360 365 

Ser Leu Phe Ser Ala His Asp Gly Asp Val Val Val Leu Leu Thr Leu 

370 375 380 

Asn Ser Ser Glu Ala His Arg Arg He Gin Ser Arg Gly Arg Lys Glu 
385 390 395 400 

Glu Lys Gly He Thr Gin Asn Tyr Leu Arg Gin Val Ala Trp Ala Tyr 

405 410 415 

His Ala Val Phe Cys Thr Trp Val Met Met Gin Tyr Leu Thr Pro Glu 

420 425 430 

Gin Met Val Gin Leu Cys Val Gin Thr Val Ser He Glu Asp He Cys 

435 440 445 

Asn Met Asn Ser Arg Leu Thr His Arg Phe Leu Thr Leu Thr Lys Leu 

450 455 460 

His Glu Gin Ser Met He Pro Met Val Ala Glu Met Leu Val Ser Val 
465 470 475 480 

Lys Glu His Val Thr Leu Met Glu Val Cys Leu Gly Leu Phe Lys Glu 

485 490 495 

Leu Arg Lys Leu Gin He Leu He Val Asp Ala Gly Glu His Leu Asp 

500 505 510 

Asp Ala Cys Gly Leu Trp Gly Asn He Tyr Gly Gin Val Met Ser Asn 
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515 520 525 

Glu Ala He Lys Pro Arg Ala Val Asn Trp Pro Ala Leu Glu Ser Tyr 

530 535 540 

He Gin Thr Leu Thr Lys Leu Glu Gly Asn Gly Ala Tyr 
545 550 555 



<210> 38 
<211> 2115 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) ■ - (2115) 

<400> 38 

atg gcg cgt att agt ttt ata ttt ttt ttt act ata ata agg tgt tct 48 
Met Ala Arg He Ser Phe He Phe Phe Phe Thr He lie Arg Cys Ser 



1 



5 10 15 



gta acg gac aaa tac gtg tat gat gaa aaa tea aat gta gag ttg gaa 
Val Thr Asp Lys Tyr Val Tyr Asp Glu Lys Ser Asn Val Glu Leu Glu 
20 25 30 



gaa 



cca ttg tct gtg act ttg gaa aaa aga aaa agt ttg ctg cga tea 
Glu Pro Leu Ser Val Thr Leu Glu Lys Arg Lys Ser Leu Leu Arg Ser 
65 70 75 80 



cca ggt ttt act ggg aat ttt aat gta atg aca cat get tta aca caa 
Pro Gly Phe Thr Gly Asn Phe Asn Val Met Thr His Ala Leu Thr Gin 
115 120 125 

ggc gtt tta tta acg aaa cgt gaa tta ttt acc aac tea aca aac att 
Gly Val Leu Leu Thr Lys Arg Glu Leu Phe Thr Asn Ser Thr Asn He 
130 135 140 

atg gat ctt ttt tac gcg gaa aaa ata aat gca gaa atg ttc aag att 
Met Asp Leu Phe Tyr Ala Glu Lys He Asn Ala Glu Met Phe Lys He 
145 150 155 160 

act ttt gat tat tec aat gta ata att tec ggc att ata aca gaa aat 
Thr Phe Asp Tyr Ser Asn Val He He Ser Gly He He Thr Glu Asn 
165 170 175 
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ttt aat gga aca att tac caa att aat tgg aga aac gta age aaa gaa 144 

Phe Asn Gly Thr He Tyr Gin He Asn Trp Arg Asn Val Ser Lys Glu 
35 40 45 

tta aca tct att gta atg gaa gat gcg tgg tac gat tct tta ctg ctg 192 

Leu Thr Ser He Val Met Glu Asp Ala Trp Tyr Asp Ser Leu Leu Leu 
50 55 60 



240 



288 



agt ata gtt aat gtg cat aat aat gac tat aca ttt tgt aaa tct tea 

Ser He Val Asn Val His Asn Asn Asp Tyr Thr Phe Cys Lys Ser Ser 
85 90 95 

agt gac cat gta att aac ctt acg gtg gat ttt aat tac agt tct ttg 336 

Ser Asp His Val He Asn Leu Thr Val Asp Phe Asn Tyr Ser Ser Leu 

100 105 HO 



384 



432 



480 



528 
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tgg ata tta gta tea gtt aca aat tea agt gta aaa tea aat atg caa 576 

Trp lie Leu Val'Ser Val Thr Asn Ser Ser Val Lys Ser Asn Met Gin 

180 185 190 

tgt gtt gca ctg ttg ttt ggt gtt cct age aca ttt cca gcg tta aaa 624 

Cys Val Ala Leu Leu Phe Gly Val Pro Ser Thr Phe Pro Ala Leu Lys 

195 200 205 

ggc tat gta tct tat cgt gat tta ttg gtc gtt aaa aat tea aac tat 672 

Gly Tyr Val Ser Tyr Arg Asp Leu Leu Val Val Lys Asn Ser Asn Tyr 

210 215 220 

get ctg ggt gta att gcg ccg aaa tct tat aat act ctt gat ttg gca 720 

Ala Leu Gly Val He Ala Pro Lys Ser Tyr Asn Thr Leu Asp Leu Ala 
225 230 235 240 

ttt tta cca aaa aat ttt aca gaa atg ttt gtt tea gtt ata gat agt 768 

Phe Leu Pro Lys Asn Phe Thr Glu Met Phe Val Ser Val He Asp Ser 

245 250 255 

ccg ctt aat gec att gat tat etc aaa gga aaa ctt ttg get att gaa 816 

Pro Leu Asn Ala He Asp Tyr Leu Lys Gly Lys Leu Leu Ala lie Glu 

260 265 270 

gca aag ggt gee tgt caa aat cct tea aat gaa aat gat ate tta tct 864 

Ala Lys Gly Ala Cys Gin Asn Pro Ser Asn Glu Asn Asp He Leu Ser 

275 280 285 

ttc ttt ttt gaa gtt act gca gta aat ttt tta ttt ata aaa aat ctt 912 

Phe Phe Phe Glu Val Thr Ala Val Asn Phe Leu Phe He Lys Asn Leu 

290 295 300 

caa aag cag caa tta gtt aac gtt ggg tgt gtt gta egg cat gtt gca 960 

Gin Lys Gin Gin Leu Val Asn Val Gly Cys Val Val Arg His Val Ala 
305 310 315 320 

gcg tta gaa agt tta atg cat tta tta agg etc tgc tac cca acg ttt 1008 

Ala Leu Glu Ser Leu Met His Leu Leu Arg Leu Cys Tyr Pro Thr Phe 

325 330 335 

aag tta tat gag eta aat ttg gaa acg ttg teg cat att gca gag tec 1056 

Lys Leu Tyr Glu Leu Asn Leu Glu Thr Leu Ser His He Ala Glu Ser 

340 345 350 

cag gtt ttt aac ttg ccg gca aat age atg tta teg tta tea gta aat 1104 

Gin Val Phe Asn Leu Pro Ala Asn Ser Met Leu Ser Leu Ser Val Asn 

355 360 365 

gat caa gag gtc gta ttt tea atg ttt aaa att gta tat aat aca cct 1152 

Asp Gin Glu Val Val Phe Ser Met Phe Lys He Val Tyr Asn Thr Pro 

370 375 380 

aag gtt ggt ggt aaa att eta aac gaa att gta tat att act aat tac 1200 

Lys Val Gly Gly Lys He Leu Asn Glu He Val Tyr He Thr Asn Tyr 
385 390 395 400 

atg tac ace aaa tac agt gaa aat tat cag eta ace aac aca ttt agg 1248 

Met Tyr Thr Lys Tyr Ser Glu Asn Tyr Gin Leu Thr Asn Thr Phe Arg 

405 410 415 



100 



egg aat gta 
Arg Asn Val 



gtc aca gat 
Val Thr Asp 
435 

tgt aat aat 
Cys Asn Asn 
450 

gat gac ate 
Asp Asp He 
465 

agg ttt gat 
Arg Phe Asp 



tea aaa aga 
Ser Lys Arg 



egg ctg ttg 
Arg Leu Leu 
515 

ata aat tgt 
He Asn Cys 
530 

ctt acg aac 
Leu Thr Asn 
545 

cgc gtt tac 
Arg Val Tyr 



teg gec gtt 
Ser Ala Val 



eta cac att 
Leu His He 
595 

ccg ctg tgt 
Pro Leu Cys 
610 

cag acc atg 
Gin Thr Met 
625 

tec ccg tat 
Ser Pro Tyr 



ttg tgg tta 




atg aac atg 
Met Asn Met 
420 

tct tct gtg 
Ser Ser Val 



gta gag ata 
Val Glu He 



acc ata ttt 
Thr He Phe 
470 

tta gat gaa 
Leu Asp Glu 
485 

acc ggc teg 
Thr Gly Ser 
500 

cac get ttt 
His Ala Phe 



acc agg ctg 
Thr Arg Leu 



ate acg tac 
He Thr Tyr 
550 

gag gtt teg 
Glu Val Ser 
565 

tat cca aat 
Tyr Pro Asn 
580 

ccg ata ttg 
Pro He Leu 



gat teg att 
Asp Ser He 



atg tac gtg 
Met Tyr Val 
630 

tct cct ttc 
Ser Pro Phe 
645 

atg aac aac 



tat gaa gtt 
Tyr Glu Val 
425 

ttt tat ccg 
Phe Tyr Pro 
440 

tea tat atg 
Ser Tyr Met 

455 

cgc gtg ttt 
Arg Val Phe 



aac aag eta 
Asn Lys Leu 



gaa ctg gca 
Glu Leu Ala 
505 

cac gec acg 
His Ala Thr 
520 

gca tgg aag 
Ala Trp Lys 
535 

gtc ata age 
Val He Ser 



gag gtg ttc 
Glu Val Phe 



tgt tct cat 
Cys Ser His 
585 

tat aac ttt 
Tyr Asn Phe 
600 

gtt ttg age 
Val Leu Ser 
615 

tea aac ccc 
Ser Asn Pro 



ttt gac aac 
Phe Asp Asn 



ggt acc gtt 



tta aca aca 
Leu Thr Thr 



tat ata ttg 
Tyr He Leu 



att aat caa 
He Asn Gin 
460 

teg cct tgt 
Ser Pro Cys 
475 

cga tec gac 
Arg Ser Asp 
490 

caa gga gcg 
Gin Gly Ala 



cgt ata aac 
Arg He Asn 



caa gta acc 
Gin Val Thr 
540 

tec gtg cgc 
Ser Val Arg 
555 

ctt aac age 
Leu Asn Ser 
570 

ttt aca ccg 
Phe Thr Pro 



tec gec ccg 
Ser Ala Pro 



tat gat gag 
Tyr Asp Glu 
620 

aca gtt caa 
Thr Val Gin 
635 

gat aat ttt 
Asp Asn Phe 
650 

gtg gaa att 
101 




att aaa tta 
He Lys Leu 
430 

ttc aca tec 
Phe Thr Ser 
445 

ata gca aag 
He Ala Lys 



ttt tta age 
Phe Leu Ser 



gca ccg cag 
Ala Pro Gin 
495 

tct ggt ttt 
Ser Gly Phe 
510 

gag ttt tea 

Glu Phe Ser 
525 

gcg ctg atg 

Ala Leu Met 



cct gat cac 
Pro Asp His 



gcg atg ttt 
Ala Met Phe 
575 

ccc ggt acc 
Pro Gly Thr 
590 

agg ata gga 
Arg He Gly 
605 

aac cag gga 
Asn Gin Gly 



gee aac ctg 
Ala Asn Leu 



cat ate cat 
His He His 
655 

egg gga ttg 



aac 1296 
Asn 



atg 1344 
Met 



ccc 1392 
Pro 



tta 1440 

Leu 

480 

acg 1488 
Thr 



tgg 153 6 
Trp 



gtc 1584 
Val 



ccg 1632 
Pro 



get 1680 

Ala 

560 

gtc 1728 
Val 



gec 1776 
Ala 



tgt 1824 
Cys 



eta 1872 
Leu 



ttt 1920 

Phe 

640 

tat 1968 
Tyr 



tac 2016 



Leu Trp Leu Met Asn Asn Gly Thr Val Val Glu lie Arg Gly Leu Tyr 
660 665 670 

aga aga cac gca etc agt get att gca tta gta ttt get ttt att gga 2064 
Arg Arg His Ala Leu Ser Ala lie Ala Leu Val Phe Ala Phe lie Gly 
675 680 685 

aca atg tct get ctg tat ttt ctg ttt aag ctg ttt tec ate ctg gee 2112 
Thr Met Ser Ala Leu Tyr Phe Leu Phe Lys Leu Phe Ser He Leu Ala 
690 695 700 

taa 2115 

705 



<210> 39 
<211> 704 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 39 
Met Ala Arg He 
1 

Val Thr Asp Lys 
20 

Phe Asn Gly Thr 
35 

Leu Thr Ser He 
50 

Glu Pro Leu Ser 
65 

Ser He Val Asn 

Ser Asp His Val 
100 

Pro Gly Phe Thr 
115 

Gly Val Leu Leu 
130 

Met Asp Leu Phe 
145 

Thr Phe Asp Tyr 

Trp He Leu Val 
180 

Cys Val Ala Leu 
195 

Gly Tyr Val Ser 
210 

Ala Leu Gly Val 
225 

Phe Leu Pro Lys 

Pro Leu Asn Ala 
260 

Ala Lys Gly Ala 
275 

Phe Phe Phe Glu 
290 

Gin Lys Gin Gin 



Ser Phe He Phe 
5 

Tyr Val Tyr Asp 

He Tyr Gin He 
40 

Val Met Glu Asp 
55 

Val Thr Leu Glu 
70 

Val His Asn Asn 
85 

He Asn Leu Thr 

Gly Asn Phe Asn 
120 

Thr Lys Arg Glu 
135 

Tyr Ala Glu Lys 
150 

Ser Asn Val He 
165 

Ser Val Thr Asn 

Leu Phe Gly Val 
200 

Tyr Arg Asp Leu 
215 

He Ala Pro Lys 
230 

Asn Phe Thr Glu 
245 

He Asp Tyr Leu 

Cys Gin Asn Pro 
280 

Val Thr Ala Val 
295 

Leu Val Asn Val 



Phe Phe Thr He 
10 

Glu Lys Ser Asn 
25 

Asn Trp Arg Asn 

Ala Trp Tyr Asp 
60 

Lys Arg Lys Ser 
75 

Asp Tyr Thr Phe 
90 

Val Asp Phe Asn 
105 

Val Met Thr His 

Leu Phe Thr Asn 
140 

He Asn Ala Glu 
155 

He Ser Gly He 
170 

Ser Ser Val Lys 
185 

Pro Ser Thr Phe 

Leu Val Val Lys 
220 

Ser Tyr Asn Thr 
235 

Met Phe Val Ser 
250 

Lys Gly Lys Leu 
265 

Ser Asn Glu Asn 

Asn Phe Leu Phe 
300 

Gly Cys Val Val 



He Arg Cys Ser 
15 

Val Glu Leu Glu 
30 

Val Ser Lys Glu 
45 

Ser Leu Leu Leu 

Leu Leu Arg Ser 
80 

Cys Lys Ser Ser 
95 

Tyr Ser Ser Leu 
110 

Ala Leu Thr Gin 
125 

Ser Thr Asn He 

Met Phe Lys He 
160 

He Thr Glu Asn 
175 

Ser Asn Met Gin 
190 

Pro Ala Leu Lys 
205 

Asn Ser Asn Tyr 

Leu Asp Leu Ala 
240 

Val He Asp Ser 
255 

Leu Ala He Glu 
270 

Asp He Leu Ser 
285- 

Ile Lys Asn Leu 
Arg His Val Ala 



102 



305 310 315 320 

Ala Leu Glu Ser Leu Met His Leu Leu Arg Leu Cys Tyr Pro Thr Phe 

325 330 335 

Lys Leu Tyr Glu Leu Asn Leu Glu Thr Leu Ser His He Ala Glu Ser 

340 345 350 

Gin Val Phe Asn Leu Pro Ala Asn Ser Met Leu Ser Leu Ser Val Asn 

355 360 365 

Asp Gin Glu Val Val Phe Ser Met Phe Lys He Val Tyr Asn Thr Pro 

370 375 380 

Lys Val Gly Gly Lys He Leu Asn Glu He Val Tyr He Thr Asn Tyr 
385 390 395 400 

Met Tyr Thr Lys Tyr Ser Glu Asn Tyr Gin Leu Thr Asn Thr Phe Arg 

405 410 415 

Arg Asn Val Met Asn Met Tyr Glu Val Leu Thr Thr He Lys Leu Asn 

420 425 430 

Val Thr Asp Ser Ser Val Phe Tyr Pro Tyr He Leu Phe Thr Ser Met 

435 440 445 

Cys Asn Asn Val Glu He Ser Tyr Met He Asn Gin He Ala Lys Pro 

450 455 460 

Asp Asp He Thr He Phe Arg Val Phe Ser Pro Cys Phe Leu Ser Leu 
465 470 475 480 

Arg Phe Asp Leu Asp Glu Asn Lys Leu Arg Ser Asp Ala Pro Gin Thr 

485 490 495 

Ser Lys Arg Thr Gly Ser Glu Leu Ala Gin Gly Ala Ser Gly Phe Trp 

500 505 510 

Arg Leu Leu His Ala Phe His Ala Thr Arg He Asn Glu Phe Ser Val 

515 520 525 

He Asn Cys Thr Arg Leu Ala Trp Lys Gin Val Thr Ala Leu Met Pro 

530 535 540 

Leu Thr Asn He Thr Tyr Val He Ser Ser Val Arg Pro Asp His Ala 
545 550 555 560 

Arg Val Tyr Glu Val Ser Glu Val Phe Leu Asn Ser Ala Met Phe Val 

565 570 575 

Ser Ala Val Tyr Pro Asn Cys Ser His Phe Thr Pro Pro Gly Thr Ala 

580 585 590 

Leu His He Pro He Leu Tyr Asn Phe Ser Ala Pro Arg He Gly Cys 

595 600 605 

Pro Leu Cys Asp Ser He Val Leu Ser Tyr Asp Glu Asn Gin Gly Leu 

610 615 620^ 

Gin Thr Met Met Tyr Val Ser Asn Pro Thr Val Gin Ala Asn Leu Phe 
625 630 635 640 

Ser Pro Tyr Ser Pro Phe Phe Asp Asn Asp Asn Phe His He His Tyr 

645 650 655 

Leu Trp Leu Met Asn Asn Gly Thr Val Val Glu He Arg Gly Leu Tyr 

660 665 670 

Arg Arg His Ala Leu Ser Ala He Ala Leu Val Phe Ala Phe He Gly 

675 680 685 

Thr Met Ser Ala Leu Tyr Phe Leu Phe Lys Leu Phe Ser He Leu Ala 
690 695 700 



<210> 40 
<211> 1209 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1209) 
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<400> 40 

atg ata aaa ate tea gat tta aag get cgc tta gtt ggg ggg gcg gtg 

Met lie Lys lie Ser Asp Leu Lys Ala Arg Leu Val Gly Gly Ala Val 

15 10 15 

cag eta tec aac ggc gag tac gtc tgt cat gtc gtg tac teg teg gca 

Gin Leu Ser Asn Gly Glu Tyr Val Cys His Val Val Tyr Ser Ser Ala 

20 25 30 

ctt gec gca atg gtc gga ctt ccg gga ccg gee gta ccc ctt ccc ctg 

Leu Ala Ala Met Val Gly Leu Pro Gly Pro Ala Val Pro Leu Pro Leu 

35 40 45 

ttg ttt aaa aaa ttc age acc ate tat tea aac atg atg cca ttg tac 

Leu Phe Lys Lys Phe Ser Thr lie Tyr Ser Asn Met Met Pro Leu Tyr 

50 55 60 

gee cca aaa cgt ccg gaa tta tea atg ctg cga att atg gtg tea ccc 

Ala Pro Lys Arg Pro Glu Leu Ser Met Leu Arg lie Met Val Ser Pro 

65 70 75 80 

cat ccc tac gcg eta aac age tgc ttg tgc gtg gga acg gac gaa ggc 

His Pro Tyr Ala Leu Asn Ser Cys Leu Cys Val Gly Thr Asp Glu Gly 

85 90 95 

gag aga ggc gta tea ttg ttc aga gac cca gta att aga agt tct gac 

Glu Arg Gly Val Ser Leu Phe Arg Asp Pro Val lie Arg Ser Ser Asp 

100 105 110 

ttt gag gac aca cca att acc gta aac tea aaa tta gtg att gcg agt 

Phe Glu Asp Thr Pro He Thr Val Asn Ser Lys Leu Val He Ala Ser 

115 120 125 

aac tct ctt ttt ttg cac tgc agg cca ttt tec gtc ccg gcg acc gtc 

Asn Ser Leu Phe Leu His Cys Arg Pro Phe Ser Val Pro Ala Thr Val 

130 135 140 

aag acc cca ccg gta act ctt aca aat aac aaa cag ata acc ata aac 

Lys Thr Pro Pro Val Thr Leu Thr Asn Asn Lys Gin He Thr He Asn 

145 150 155 160 

gag ctg gca aac acc acg caa gaa tac gat cca aat gcg cca ccg acc 

Glu Leu Ala Asn Thr Thr Gin Glu Tyr Asp Pro Asn Ala Pro Pro Thr 

165 170 175 

tta tgt age gcg ctt ccg ccg gat aat aaa aaa ctt cga tec ate etc 

Leu Cys Ser Ala Leu Pro Pro Asp Asn Lys Lys Leu Arg Ser He Leu 

180 185 190 

aag caa ccg cca gcg acc age gag tea aat gta cag teg gac tgt tta 

Lys Gin Pro Pro Ala Thr Ser Glu Ser Asn Val Gin Ser Asp Cys Leu 

195 200 205 

ctg gca gac ata ttt ttt gec atg ggt teg cgc cag cca caa ata ggc 

Leu Ala Asp He Phe Phe Ala Met Gly Ser Arg Gin Pro Gin He Gly 

210 215 220 

gag age ccg att acc gcg ttt aac acc gtg aca att atg cag cgc gec 

Glu Ser Pro He Thr Ala Phe Asn Thr Val Thr He Met Gin Arg Ala 

225 230 235 240 
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aac aac tct ata atg ttt tta cct aat tta aaa eta aag cct ate caa 768 

Asn Asn Ser He Met Phe Leu Pro Asn Leu Lys Leu Lys Pro lie Gin 
245 250 255 

cac ctt ttt tta aag cac gtc eta ttg cag cgt ttg ggt ttg gaa aat 816 

His Leu Phe Leu Lys His Val Leu Leu Gin Arg Leu Gly Leu Glu Asn 
260 265 270 

att ctg ttt cac ttt aaa atg eta tac gcg aat acg tgt aag gcg gcg 864 

He Leu Phe His Phe Lys Met Leu Tyr Ala Asn Thr Cys Lys Ala Ala 
275 280 285 

ggc ccg tat cag aga gaa tac ttc gaa tec atg etc teg cgc gta aaa 912 

Gly Pro Tyr Gin Arg Glu Tyr Phe Glu Ser Met Leu Ser Arg Val Lys 
290 295 300 

cag cgc ctg gaa gac atg gtg ttt tgt tta aac tec ata gag tec cac 960 

Gin Arg Leu Glu Asp Met Val Phe Cys Leu Asn Ser He Glu Ser His 
305 310 315 320 

gac ttt cag aag gat ttt cga gtt ttg teg egg get ccg cag cga tta 1008 

Asp Phe Gin Lys Asp Phe Arg Val Leu Ser Arg Ala Pro Gin Arg Leu 
325 330 335 

tta acc gcg ace gac aaa tat ttt tta atg ttt cct ccc caa aat cga 1056 

Leu Thr Ala Thr Asp Lys Tyr Phe Leu Met Phe Pro Pro Gin Asn Arg 
340 345 350 

gag ctg gec att caa gtt ggc gca gag gtg att gaa tea ata tgt gac 1104 

Glu Leu Ala He Gin Val Gly Ala Glu Val He Glu Ser He Cys Asp 
355 360 365 

ggc acg cct eta tec gag gtt etc get aat etc agt ccc cgc gta acc 1152 

Gly Thr Pro Leu Ser Glu Val Leu Ala Asn Leu Ser Pro Arg Val Thr 
370 375 380 

ate caa aaa gaa acg gga aac aac ctt tta aag ttt tac get tta tta 1200 

He Gin Lys Glu Thr Gly Asn Asn Leu Leu Lys Phe Tyr Ala Leu Leu 
385 390 395 400 

aca gtt tag 1209 
Thr Val 



<210> 41 
<211> 402 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 41 

Met He Lys He Ser Asp Leu Lys Ala Arg Leu Val Gly Gly Ala Val 

1 5 10 15 

Gin Leu Ser Asn Gly Glu Tyr Val Cys His Val Val Tyr Ser Ser Ala 

20 25 30 

Leu Ala Ala Met Val Gly Leu Pro Gly Pro Ala Val Pro Leu Pro Leu 

35 40 45 

Leu Phe Lys Lys Phe Ser Thr He Tyr Ser Asn Met Met Pro Leu Tyr 

50 55 60 

Ala Pro Lys Arg Pro Glu Leu Ser Met Leu Arg He Met Val Ser Pro 
65 70 75 80 

His Pro Tyr Ala Leu Asn Ser Cys Leu Cys Val Gly Thr Asp Glu Gly 

105 



85 90 95 

Glu Arg Gly Val Ser Leu Phe Arg Asp Pro Val He Arg Ser Ser Asp 

100 105 HO 

Phe Glu Asp Thr Pro He Thr Val Asn Ser Lys Leu Val He Ala Ser 

115 120 125 

Asn Ser Leu Phe Leu His Cys Arg Pro Phe Ser Val Pro Ala Thr Val 

130 135 140 

Lys Thr Pro Pro Val Thr Leu Thr Asn Asn Lys Gin He Thr He Asn 
145 150 155 160 

Glu Leu Ala Asn Thr Thr Gin Glu Tyr Asp Pro Asn Ala Pro Pro Thr 

165 170 175 

Leu Cys Ser Ala Leu Pro Pro Asp Asn Lys Lys Leu Arg Ser He Leu 

180 185 190 

Lys Gin Pro Pro Ala Thr Ser Glu Ser Asn Val Gin Ser Asp Cys Leu 

195 200 205 

Leu Ala Asp He Phe Phe Ala Met Gly Ser Arg Gin Pro Gin He Gly 

210 215 220 

Glu Ser Pro He Thr Ala Phe Asn Thr Val Thr He Met Gin Arg Ala 
225 230 235 240 

Asn Asn Ser He Met Phe Leu Pro Asn Leu Lys Leu Lys Pro He Gin 

245 250 255 

His Leu Phe Leu Lys His Val Leu Leu Gin Arg Leu Gly Leu Glu Asn 

260 265 270 

lie Leu Phe His Phe Lys Met Leu Tyr Ala Asn Thr Cys Lys Ala Ala 

275 280 285 

Gly Pro Tyr Gin Arg Glu Tyr Phe Glu Ser Met Leu Ser Arg Val Lys 

290 295 300 

Gin Arg Leu Glu Asp Met Val Phe Cys Leu Asn Ser He Glu Ser His 
305 310 315 320 

Asp Phe Gin Lys Asp Phe Arg Val Leu Ser Arg Ala Pro Gin Arg Leu 

325 330 335 

Leu Thr Ala Thr Asp Lys Tyr Phe Leu Met Phe Pro Pro Gin Asn Arg 

340 345 350 

Glu Leu Ala He Gin Val Gly Ala Glu Val He Glu Ser He Cys Asp 

355 360 365 

Gly Thr Pro Leu Ser Glu Val Leu Ala Asn Leu Ser Pro Arg Val Thr 

370 375 380 

He Gin Lys Glu Thr Gly Asn Asn Leu Leu Lys Phe Tyr Ala Leu Leu 
385 390 395 400 

Thr Val 



<210> 42 
<211> 2199 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - . (2199) 

<400> 42 

atg ctg ttg cag ggc cct gtt ctt ctg ccc gcg tgt cct get acg gtg 

Met Leu Leu Gin Gly Pro Val Leu Leu Pro Ala Cys Pro Ala Thr Val 
15 10 15 

gca gcg aac gec ccg tea ccg gec aat tec gat ttt aaa act caa ctg 
Ala Ala Asn Ala Pro Ser Pro Ala Asn Ser Asp Phe Lys Thr Gin Leu 
20 25 30 



48 



96 



106 



gca att ttt tgc tgc ctg gcc acc aat aat gag att ctt gaa aac gtc 144 

Ala lie Phe Cys Cys Leu Ala Thr Asn Asn Glu He Leu Glu Asn Val 

35 40 45 

teg eta gag gcc ttg gac agg get atg caa aca gag acc acc ttt tac 192 

Ser Leu Glu Ala Leu Asp Arg Ala Met Gin Thr Glu Thr Thr Phe Tyr 
50 55 60 

gcg tgc cgt gcg etc egg cgt ctg gta ttg gga gaa gga eta tac cct 240 

Ala Cys Arg Ala Leu Arg Arg Leu Val Leu Gly Glu Gly Leu Tyr Pro 

65 70 75 80 

ttt ata cac cgc caa ggt ggt ate gtt ggc aag acc ggt aac gag tac 288 

Phe He His Arg Gin Gly Gly He Val Gly Lys Thr Gly Asn Glu Tyr 

85 90 95 

gcg gga ccg ggt tta ata att gat gat get ate gga tgt aca ttt tec 336 

Ala Gly Pro Gly Leu He He Asp Asp Ala He Gly Cys Thr Phe Ser 

100 105 110 

cat ate gag acg cat acg ttt tta ccc acc gtt ttc acc tac gaa ctg 384 

His He Glu Thr His Thr Phe Leu' Pro Thr Val Phe Thr Tyr Glu Leu 

115 120 125 

tec gac acg gtt ttg gtt caa age gac gaa cga ate eta egg teg ctg 432 

Ser Asp Thr Val Leu Val Gin Ser Asp Glu Arg He Leu Arg Ser Leu 

130 135 140 

tac tgt teg cca tta atg gta tgt ggc gta aat tat caa age atg ttt 480 

Tyr Cys Ser Pro Leu Met Val Cys Gly Val Asn Tyr Gin Ser Met Phe 

145 150 155 160 

cga ate etc tgt cgt tac ctt caa ata tgg gaa ttc gag gaa tgc ttt 528 

Arg He Leu Cys Arg Tyr Leu Gin He Trp Glu Phe Glu Glu Cys Phe 

165 170 175 

get gcc ttt acg cgc aca ctg ccg gaa cat ctg ate ggc acg tgt tat 576 

Ala Ala Phe Thr Arg Thr Leu Pro Glu His Leu He Gly Thr Cys Tyr 

180 185 190 

cag aat tac ttt aaa ctt tta gag cca ttc aaa ate etc acg eta get 624 

Gin Asn Tyr Phe Lys Leu Leu Glu Pro Phe Lys He Leu Thr Leu Ala 

195 200 205 

egg tgt ccg ccc ccg tgt gca aaa ctg cat ctg aat tac tta aaa ttt 672 

Arg Cys Pro Pro Pro Cys Ala Lys Leu His Leu Asn Tyr Leu Lys Phe 

210 215 220 

aac att ttg ggt ttt act teg gat tgg att teg cat ccc gaa ctg cat 720 

Asn He Leu Gly Phe Thr Ser Asp Trp He Ser His Pro Glu Leu His 

225 230 235 240 



cgc gtc caa acg gta att att cac aac ate gaa tec aat ccg gta ctt 
Arg Val Gin Thr Val He He His Asn He Glu Ser Asn Pro Val Leu 
245 250 255 



768 



tta aaa aac etc tea aag caa aac aaa ttt cag gac ate aaa gtg gcg 816 
Leu Lys Asn Leu Ser Lys Gin Asn Lys Phe Gin Asp He- Lys Val Ala 
260 265 270 



age gaa tta ate ata gac tac caa aac ata gta aat caa teg ctt gac 
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Ser Glu Leu He He Asp Tyr Gin Asn He Val Asn Gin Ser Leu Asp 

275 280 285 

gta aat eta cag gtg aaa att aac aaa aag gat ccc ggc aaa aaa ccg 912 

Val Asn Leu Gin Val Lys He Asn Lys Lys Asp Pro Gly Lys Lys Pro 

290 295 300 

tac aag gta gtt gtt gtg acg cca aag tea acg tac tat eta acg ttt 960 

Tyr Lys Val Val Val Val Thr Pro Lys Ser Thr Tyr Tyr Leu Thr Phe 

305 310 315 320 

cca ccc gag gta cca ata ttc cgc gtc gca atg tgc atg tct gtg gcg 1008 

Pro Pro Glu Val Pro He Phe Arg Val Ala Met Cys Met Ser Val Ala 

325 330 335 

gaa cac gtg tgc cat teg tgc gac agg ctg tat cct aac acc gag ttt 1056 

Glu His Val Cys His Ser Cys Asp Arg Leu Tyr Pro Asn Thr Glu Phe 

340 345 350 

c tg ggg cca ggc gaa acg cct cgc gtc eta gag gee atg ttt teg cga 1104 

Leu Gly Pro Gly Glu Thr Pro Arg Val Leu Glu Ala Met Phe Ser Arg 

355 360 365 

att caa tac gca cca aaa gat aga gat tac aat ttt att ttt aac gec 1152 

lie Gin Tyr Ala Pro Lys Asp Arg Asp Tyr Asn Phe He Phe Asn Ala 

370 375 380 



gat caa aac cca gat cga tat gaa caa gcg cgt cac gac cac cag acc 
Asp Gin Asn Pro Asp Arg Tyr Glu Gin Ala Arg His Asp His Gin Thr 
385 390 395 400 



1200 



gaa cct ctt cca gac atg ttc gat cca gtt aag cac atg agt tta cac 1248 

Glu Pro Leu Pro Asp Met Phe Asp Pro Val Lys His Met Ser Leu His 

405 410 415 

aac ttc aaa ate tea gtt ttc aat acc aat atg gta att aat acc aaa 1296 

Asn Phe Lys He Ser Val Phe Asn Thr Asn Met Val He Asn Thr Lys 
420 425 430 

ate acg tgc tgg tct etc gee ggt acc ttc gag tea att ate gac ate 1344 

He Thr Cys Trp Ser Leu Ala Gly Thr Phe Glu Ser He He Asp He 
435 440 445 

ccc agg etc act aat aac ttt gta atg aaa aaa ttc tec gtg aaa gaa 1392 

Pro Arg Leu Thr Asn Asn Phe Val Met Lys Lys Phe Ser Val Lys Glu 
450 455 460 

ccg tea ttc acc gtg agt gtg ttc tac tec gac aac ctg tgc aac ggg 1440 

Pro Ser Phe Thr Val Ser Val Phe Tyr Ser Asp Asn Leu Cys Asn Gly 
465 470 475 480 

get gca att aac gtt aac ata age ggg gac atg ctg cac ttt atg ttc 1488 

Ala Ala He Asn Val Asn He Ser Gly Asp Met Leu His Phe Met Phe 

485 490 495 

get atg gga aac etc agg tgc ttt tta ccg gtg aag cac att ttc ccg 1536 

Ala Met Gly Asn Leu Arg Cys Phe Leu Pro Val Lys His He Phe Pro 
500 505 510 



gtt teg att gcg aac tgg aac tct acg tta gac etc cac ggg ctt gaa 
Val Ser He Ala Asn Trp Asn Ser Thr Leu Asp Leu His Gly Leu Glu 



1584 
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515 520 525 

aac caa tac ata gtt aga egg ggg egg cga gac gtt ttc tgg acc act 1632 

Asn Gin Tyr He Val Arg Arg Gly Arg Arg Asp Val Phe Trp Thr Thr 
530 535 540 

aat ttc cct tct gtg gtt tct agt aaa gac gga tgc aac gtt tec tgg 1680 

Asn Phe Pro Ser Val Val Ser Ser Lys Asp Gly Cys Asn Val Ser Trp 
545 550 555 560 

ttt aag gcg gca act gcg aca att tct aaa ata tac ggt agg ccc tta 1728 

Phe Lys Ala Ala Thr Ala Thr He Ser Lys lie Tyr Gly Arg Pro Leu 

565 570 575 

ctt aaa aaa ctt teg gat gaa ttg aac cca ata tta age gtt cca tat 1776 

Leu Lys Lys Leu Ser Asp Glu Leu Asn Pro He Leu Ser Val Pro Tyr 

580 585 590 

get aga ata gac cag gtg aaa aac aca ata ttc aca acg ctg gag aca 

Ala Arg He Asp Gin Val Lys Asn Thr He Phe Thr Thr Leu Glu Thr 
595 600 605 

aga aat aaa gcg caa ata caa act eta cac aag cgt ttt ate gag tgt 

Arg Asn Lys Ala Gin He Gin Thr Leu His Lys Arg Phe He Glu Cys 
610 615 620 

ttg gtg gaa tgt tgt teg ttt ttg cgt ttg gat ctg gga gee ctg aac 

Leu Val Glu Cys Cys Ser Phe Leu Arg Leu Asp Leu Gly Ala Leu Asn 
625 630 635 640 

cgc gec gee cgc ctt ggt acg ttt gac ttt tea aaa cgc ate att agt 

Arg Ala Ala Arg Leu Gly Thr Phe Asp Phe Ser Lys Arg He He Ser 

645 650 655 

cac acc aaa age aaa cac gaa tgc get att ctg gga tat aaa aaa tgc 2016 

His Thr Lys Ser Lys His Glu Cys Ala lie Leu Gly Tyr Lys Lys Cys 

660 665 670 

aac etc att cca aag ata tac gtg cga agt aaa aaa ata egg tta gac 2064 

Asn Leu He Pro Lys He Tyr Val Arg Ser Lys Lys He Arg Leu Asp 
675 680 685 

gag tta gga cgc aac gec aat ttt atg tct ttc ata get acg acc ggt 2112 

Glu Leu Gly Arg Asn Ala Asn Phe Met Ser Phe He Ala Thr Thr Gly 
690 695 700 

cac gcg ttt tea aac tta aaa ccg cag gta att cga cac act att aga 

His Ala Phe Ser Asn Leu Lys Pro Gin Val He Arg His Thr He Arg 
705 710 715 720 

cgt ttg gga ctt cac tgg aga cat aaa gca aaa ata taa 

Arg Leu Gly Leu His Trp Arg His Lys Ala Lys He 

725 730 



1824 



1872 



1920 



1968 



2160 



2199 



<210> 43 

<211> 732 

<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 

<400> 43 
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Met 


Leu 


Leu 


Gin 


Gly 


Pro 


Val 


Leu 


Leu 


Pro 


Ala 


Cys 


Pro 


Ala 


Thr 


Val 


1 








5 










10 










15 




Ala 


Ala 


Asn 


Ala 


Pro 


Ser 


Pro 


Ala 


Asn 


Ser 


Asp 


Phe 


Lys 


Thr 


Gin 


Leu 








20 










25 










30 






Ala 


He 


Phe 


Cys 


Cys 


Leu 


Ala 


Thr 


Asn 


Asn 


Glu 


lie 


Leu 


Glu 


Asn 


Val 






35 










40 










45 








Ser 


Leu 


Glu 


Ala 


Leu 


Asp 


Arg 


Ala 


Met 


Gin 


Thr 


Glu 


Thr 


Thr 


Phe 


Tyr 




50 










55 










60 










Ala 


Cys 


Arg 


Ala 


Leu 


Arg 


Arg 


Leu 


Val 


Leu 


Gly 


Glu 


Gly 


Leu 


Tyr 


Pro 


65 










70 










75 










80 


Phe 


He 


His 


Arg 


Gin 


Gly 


Gly 


He 


Val 


Gly 


Lys 


Thr 


Gly 


Asn 


Glu 


Tyr 










85 










90 










95 




Ala 


Gly 


Pro 


Gly 


Leu 


He 


He 


Asp 


Asp 


Ala 


He 


Gly 


Cys 


Thr 


Phe 


Ser 








100 










105 










110 






His 


He 


Glu 


Thr 


His 


Thr 


Phe 


Leu 


Pro 


Thr 


Val 


Phe 


Thr 


Tyr 


Glu 


Leu 






115 










120 










125 








Ser 


Asp 


Thr 


Val 


Leu 


Val 


Gin 


Ser 


Asp 


Glu 


Arg 


lie 


Leu 


Arg 


Ser 


Leu 




130 










135 










140 










Tyr 


Cys 


Ser 


Pro 


Leu 


Met 


Val 


Cys 


Gly 


Val 


Asn 


Tyr 


Gin 


Ser 


Met 


Phe 


145 










150 










155 










160 


Arg 


He 


Leu 


Cys 


Arg 


Tyr 


Leu 


Gin 


He 


Trp 


Glu 


Phe 


Glu 


Glu 


Cys 


Phe 










165 










170 










175 




Ala 


Ala 


Phe 


Thr 


Arg 


Thr 


Leu 


Pro 


Glu 


His 


Leu 


He 


Gly 


Thr 


Cys 


Tyr 








180 










185 










190 






Gin 


Asn 


Tyr 


Phe 


Lys 


Leu 


Leu 


Glu 


Pro 


Phe 


Lys 


He 


Leu 


Thr 


Leu 


Ala 






195 










200 










205 








Arg 


Cys 


Pro 


Pro 


Pro 


Cys 


Ala 


Lys 


Leu 


His 


Leu 


Asn 


Tyr 


Leu 


Lys 


Phe 




210 










215 










220 










Asn 


He 


Leu 


Gly 


Phe 


Thr 


Ser 


Asp 


Trp 


lie 


Ser 


His 


Pro 


Glu 


Leu 


His 


225 










230 










235 










240 


Arg 


Val 


Gin 


Thr 


Val 


lie 


He 


His 


Asn 


lie 


Glu 


Ser 


Asn 


Pro 


Val 


Leu 








245 










250 










255 




Leu 


Lys 


Asn 


Leu 


Ser 


Lys 


Gin 


Asn 


Lys 


Phe 


Gin 


Asp 


He 


Lys 


Val 


Ala 








260 










265 










270 






Ser 


Glu 


Leu 


He 


He 


Asp 


Tyr 


Gin 


Asn 


He 


Val 


Asn 


Gin 


Ser 


Leu 


Asp 






275 










280 










285 








Val 


Asn 


Leu 


Gin 


Val 


Lys 


He 


Asn 


Lys 


Lys 


Asp 


Pro 


Gly 


Lys 


Lys 


Pro 




290 










295 










300 










Tyr 


Lys 


Val 


Val 


Val 


Val 


Thr 


Pro 


Lys 


Ser 


Thr 


Tyr 


Tyr 


Leu 


Thr 


Phe 


305 










310 










315 










320 


Pro 


Pro 


Glu 


Val 


Pro 


lie 


Phe 


Arg 


Val 


Ala 


Met 


Cys 


Met 


Ser 


Val 


Ala 










325 










330 










335 




Glu 


His 


Val 


Cys 


His 


Ser 


Cys 


Asp 


Arg 


Leu 


Tyr 


Pro 


Asn 


Thr 


Glu 


Phe 








340 










345 










350 






Leu 


Gly 


Pro 


Gly 


Glu 


Thr 


Pro 


Arg 


Val 


Leu 


Glu 


Ala 


Met 


Phe 


Ser 


Arg 






355 










360 










365 








lie 


Gin 


Tyr 


Ala 


Pro 


Lys 


Asp 


Arg 


Asp 


Tyr 


Asn 


Phe 


He 


Phe 


Asn 


Ala 




370 










3 75 










3 80 










Asp 


Gin 


Asn 


Pro 


Asp 


Arg 


Tyr 


Glu 


Gin 


Ala 


Arg 


His 


Asp 


His 


Gin 


Thr 


385 










390 










3 95 










4 00 


Glu 


Pro 


Leu 


Pro 


Asp 


Met 


Phe 


Asp 


Pro 


Val 


Lys 


His 


Met 


Ser 


Leu 


His 










4 05 










410 










415 




Asn 


Phe 


Lys 


He 


Ser 


Val 


Phe 


Asn 


Thr 


Asn 


Met 


Val 


lie 


Asn 


Thr 


Lys 








420 










425 










430 






He 


Thr 


Cys 


Trp 


Ser 


Leu 


Ala 


Gly 


Thr 


Phe 


Glu 


Ser 


lie 


He 


Asp 


He 






435 










440 










445 








Pro 


Arg 


Leu 


Thr 


Asn 


Asn 


Phe 


Val 


Met 


Lys 


Lys 


Phe 


Ser 


Val 


Lys 


Glu 




450 










455 










460 










Pro 


Ser 


Phe 


Thr 


Val 


Ser 


Val 


Phe 


Tyr 


Ser 


Asp 


Asn 


Leu 


Cys 


Asn 


Gly 


465 










470 










475 










480 


Ala 


Ala 


He 


Asn 


Val 


Asn 


lie 


Ser 


Gly 


Asp 


Met 


Leu 


His 


Phe 


Met 


Phe 



110 



485 490 495 

Ala Met Gly Asn Leu Arg Cys Phe Leu Pro Val Lys His lie Phe Pro 

500 505 510 

Val Ser lie Ala Asn Trp Asn Ser Thr Leu Asp Leu His Gly Leu Glu 

515 520 525 

Asn Gin Tyr He Val Arg Arg Gly Arg Arg Asp Val Phe Trp Thr Thr 

530 535 540 

Asn Phe Pro Ser Val Val Ser Ser Lys Asp Gly Cys Asn Val Ser Trp 
545 550 555 560 

Phe Lys Ala Ala Thr Ala Thr He Ser Lys He Tyr Gly Arg Pro Leu 

565 570 575 

Leu Lys Lys Leu Ser Asp Glu Leu Asn Pro He Leu Ser Val Pro Tyr 

580 585 590 

Ala Arg He Asp Gin Val Lys Asn Thr lie Phe Thr Thr Leu Glu Thr 

595 600 605 

Arg Asn Lys Ala Gin lie Gin Thr Leu His Lys Arg Phe He Glu Cys 

610 615 620 

Leu Val Glu Cys Cys Ser Phe Leu Arg Leu Asp Leu Gly Ala Leu Asn 
625 630 635 640 

Arg Ala Ala Arg Leu Gly Thr Phe Asp Phe Ser Lys Arg He He Ser 

645 650 655 

His Thr Lys Ser Lys His Glu Cys Ala He Leu Gly Tyr Lys Lys Cys 

660 665 670 

Asn Leu lie Pro Lys He Tyr Val Arg Ser Lys Lys He Arg Leu Asp 

675 680 685 

Glu Leu Gly Arg Asn Ala Asn Phe Met Ser Phe He Ala Thr Thr Gly 

690 695 700 

His Ala Phe Ser Asn Leu Lys Pro Gin Val He Arg His Thr He Arg 
705 710 715 720 

Arg Leu Gly Leu His Trp Arg His Lys Ala Lys lie 
725 730 



<210> 44 
<211> 4137 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (4137) 



<400> 44 

atg gag gcg get ctg gag gta cga ccc ttt cca tat atg gec acg gag 
Met Glu Ala Ala Leu Glu Val Arg Pro Phe Pro Tyr Met Ala Thr Glu 
15 10 15 



:aa ttt gag ggt ctt ttg ggc gtg tat acc aac gta att cag ttt gtg 
3ln Phe Glu Gly Leu Leu Gly Val Tyr Thr Asn Val He Gin Phe Val 



G] 

50 55 60 



aag ttt ctg gaa acg teg eta gcg gtt gcg tgc gtt aac acc gag ttt 

111 



48 



96 



gec aat ctg ctt cgt caa atg aaa gag teg gca gec age gga ctt ttt 
Ala Asn Leu Leu Arg Gin Met Lys Glu Ser Ala Ala Ser Gly Leu Phe 
20 25 30 

aaa agt ttt caa ctg ctg ttg gga aaa gac gec agg gag gga gga gtt 144 
Lys Ser Phe Gin Leu Leu Leu Gly Lys Asp Ala Arg Glu Gly Gly Val 
35 40 45 



192 



240 




Lys Phe Leu Glu Thr Ser Leu Ala Val Ala Cys Val Asn Thr Glu Phe 



65 



70 75 80 



aag gac etc aaa cga atg acg gat gga aaa att cag ttt aag gta tct 
Lys Asp Leu Lys Arg Met Thr Asp Gly Lys He Gin Phe Lys Val Ser 
85 90 95 

gta ccg acc ate gcg tat ggg gac ggc agg egg ccc aca aaa caa aaa 
Val Pro Thr He Ala Tyr Gly Asp Gly Arg Arg Pro Thr Lys Gin Lys 
100 105 HO 

caa tac att ate atg aag gec tgc aat aag cat cac att ggt gec gag 
Gin Tyr He He Met Lys Ala Cys Asn Lys His His He Gly Ala Glu 
115 120 125 

ata gag ctg teg act gat gac ate gag ctg eta ttc att gac aga gaa 
He Glu Leu Ser Thr Asp Asp He Glu Leu Leu Phe He Asp Arg Glu 
130 135 140 

acc cca etc gat tac aca gaa tac gec ggg gec gta aag acg att acc 
Thr Pro Leu Asp Tyr Thr Glu Tyr Ala Gly Ala Val Lys Thr He Thr 
145, 150 155 160 

gec tct etc cag ttt ggc gtg gac gcg ctg gag agg ggc ctg gta gat 
Ala Ser Leu Gin Phe Gly Val Asp Ala Leu Glu Arg Gly Leu Val Asp 
165 170 175 

acc gta ttg aat gtt aag ctt agg tec gec ccg ccg atg ttt att eta 
Thr Val Leu Asn Val Lys Leu Arg Ser Ala Pro Pro Met Phe He Leu 
180 185 190 

aaa aca eta tea gac ccg gtc tac acc gaa egg ggt eta aag aag get 
Lys Thr Leu Ser Asp Pro Val Tyr Thr Glu Arg Gly Leu Lys Lys Ala 
195 200 205 

gtt aag tea gac atg gtg tec atg ttc aaa age tac ctt atg gat aac 
Val Lys Ser Asp Met Val Ser Met Phe Lys Ser Tyr Leu Met Asp Asn 
210 215 220 

teg ttt ttc etc gac aaa tea gac ate gec gtc aag gga aag cag tac 
Ser Phe Phe Leu Asp Lys Ser Asp He Ala Val Lys Gly Lys Gin Tyr 
225 230 235 240 

gtg ctg teg gtt etc tec gac atg gtg ggg gcg gtg tgt cac gaa acg 
Val Leu Ser Val Leu Ser Asp Met Val Gly Ala Val Cys His Glu Thr 
245 250 255 

gtt ttt aag ggg acg aat acg tat ctg tct gca teg gga gag cca att 
Val Phe Lys Gly Thr Asn Thr Tyr Leu Ser Ala Ser Gly Glu Pro He 
260 265 - 270 

gec gga gtc atg gag acc acg gaa aac gta atg cga aaa ctg tta aac 
Ala Gly Val Met Glu Thr Thr Glu Asn Val Met Arg Lys Leu Leu Asn 
275 280 285 

atg eta ggt cag gtt gac ggg ggc atg tec ggt ccg gcg tct tac gec 
Met Leu Gly Gin Val Asp Gly Gly Met Ser Gly Pro Ala Ser Tyr Ala 
290 295 300 

aat tac gtt gtc agg ggc gaa aat etc gta acc gec gtg acg tac ggt 
Asn Tyr Val Val Arg Gly Glu Asn Leu Val Thr Ala Val Thr Tyr Gly 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



112 



305 



310 



315 



320 



cgc gtc atg cga acg ttt gac cag ttc atg aaa cgc ata gtg gac cgt 
Arg Val Met Arg Thr Phe Asp Gin Phe Ret Lys Arg lie Val Asp Arg 
325 330 335 

ccc aac gcg cag ccc age gtc gac gat gat egg gac gcg gtg gcg gac 
Pro Asn Ala Gin Pro Ser Val Asp Asp Asp Arg Asp Ala Val Ala Asp 
340 345 350 

ggg cag gac tec etc gee aaa aca ccg ate gcg gcg get gtt ate cag 
Gly Gin Asp Ser Leu Ala Lys Thr Pro lie Ala Ala Ala Val lie Gin 
355 360 365 

att gga gat aaa ctg gtg get ttg gaa agt tta cag cga atg tac aac 
lie Gly Asp Lys Leu Val Ala Leu Glu Ser Leu Gin Arg Met Tyr Asn 
370 375 380 

gag acc cag ttt ccc ttc ccg tta aac agg cgt atg cac tat act tat 
Glu Thr Gin Phe Pro Phe Pro Leu Asn Arg Arg Met His Tyr Thr Tyr 
385 390 395 400 

ttt ttc ccc att ggg ttg cac atg ccc cgt cct cag tat tec acg teg 
Phe Phe Pro He Gly Leu His Met Pro Arg Pro Gin Tyr Ser Thr Ser 
405 410 415 

gec aca att aag ggg gtc gag cac cca gcg gaa caa tec gtc gaa acg 
Ala Thr He Lys Gly Val Glu His Pro Ala Glu Gin Ser Val Glu Thr 
420 425 430 

tgg att gta aat aaa aac aac gta ctg ctg agt ttc aat tat caa aac 
Trp He Val Asn Lys Asn Asn Val Leu Leu Ser Phe Asn Tyr Gin Asn 
435 440 445 

gcg etc aag tec ate tgt cat ccg cgc atg cac aac ccg atg ccg tgt 
Ala Leu Lys Ser He Cys His Pro Arg Met His Asn Pro Met Pro Cys 
450 455 460 

ggg caa get ttg ggt cag gcg ttt cca gat ccc gga cac gtt cac agg 
Gly Gin Ala Leu Gly Gin Ala Phe Pro Asp Pro Gly His Val His Arg 
465 470 475 480 

tac gga caa aga tct gag cac ccc ccg aac atg aac eta tac ggg ctg 
Tyr Gly Gin Arg Ser Glu His Pro Pro Asn Met Asn Leu Tyr Gly Leu 
485 490 495 

gtg tac aac tat tac cag ggc aaa aac gtg gca cac gtt ccg gat gtc 
Val Tyr Asn Tyr Tyr Gin Gly Lys Asn Val Ala His Val Pro Asp Val 
500 505 510 

gee eta aag gcg acg atg acc aca gat gag ttg ctg cac cca acc teg 
Ala Leu Lys Ala Thr Met Thr Thr Asp Glu Leu Leu His Pro Thr Ser 
515 520 525 

cac gag acg ctt cgt ttg gag gtt cac ccg atg ttt gat ttt ttc gtt 
His Glu Thr Leu Arg Leu Glu Val His Pro Met Phe Asp Phe Phe Val 
530 535 ' 540 

cat cag cag cct ggt gcg caa gec gcg tat agg gcg acc cac agg act 
His Gin Gin Pro Gly Ala Gin Ala Ala Tyr Arg Ala Thr His Arg Thr 
545 550 555 560 



atg gtg ggt aac att cca caa ccc ctg gcg ccc aac gag ttt caa aac 1728 
Met Val Gly Asn lie Pro Gin Pro Leu Ala Pro Asn Glu Phe Gin Asn 
565 570 575 

age aga ggc ctg cag ttt gac aga gcg gcg gec gtg get cac gtg ctg 1776 
Ser Arg Gly Leu Gin Phe Asp Arg Ala Ala Ala Val Ala His Val Leu 
580 585 590 

gac cag tea acg atg gaa att ate caa gat acg gcg ttt gac acg teg 1824 
Asp Gin Ser Thr Met Glu lie He Gin Asp Thr Ala Phe Asp Thr Ser 
595 600 605 

tac cca eta etc tgt tat gtc ate gaa tgc etc att cac gga cag gaa 1872 
Tyr Pro Leu Leu Cys Tyr Val He Glu Cys Leu He His Gly Gin Glu 
610 615 620 



gac aaa ttt ttg att aat tct cct tta att gca tta ace att gaa act 
Asp Lys Phe Leu He Asn Ser Pro Leu He Ala Leu Thr He Glu Thr 
625 630 635 640 



1920 



1968 



tac tgg aac aat gee gga aaa ctg gcg ttt att aac age ttc cct atg 
Tyr Trp Asn Asn Ala Gly Lys Leu Ala Phe He Asn Ser Phe Pro Met 
645 650 655 

ctg cga ttt ate tgc gtt cac ctg ggc aac ggt agt att tct aag gac 
Leu Arg Phe He Cys Val His Leu Gly Asn Gly Ser He Ser Lys Asp 
660 665 670 

gtg tac gee cat tac cga aaa gtt ttt ggc gaa etc gtg gtt ttg cag 
Val Tyr Ala His Tyr Arg Lys Val Phe Gly Glu Leu Val Val Leu Gin 
675 680 685 

cag gcg etc teg aaa ate gcg ggc cac gag gtg gtg ggg cgc agg ccc 
Gin Ala Leu Ser Lys He Ala Gly His Glu Val Val Gly Arg Arg Pro 
690 695 , 700 

gcg tec gag ctg att aac tgt ctt cag gac ccc aat ctt ttg ccg ccc 
Ala Ser Glu Leu He Asn Cys Leu Gin Asp Pro Asn Leu Leu Pro Pro 
705 710 715 720 

ttt get tac aat gac gtt ttt ace aac ctg etc agg cag tec teg egg 
Phe Ala Tyr Asn Asp Val Phe Thr Asn Leu Leu Arg Gin Ser Ser Arg 
725 730 735 

cac ccc atg gta etc ata ggc gac gag ggg tac gaa acg gaa aat gac 
His Pro Met Val Leu He Gly Asp Glu Gly Tyr Glu Thr Glu Asn Asp 
740 745 750 

agg gat acg tac ate aac gtc aga gga aaa atg gag gac eta gtc ggt 
Arg Asp Thr Tyr He Asn Val Arg Gly Lys Met Glu Asp Leu Val Gly 
755 760 765 

gac atg gtt aac att tac gag acc aga aac aac gcg gat cat gac ggc 2352 
Asp Met Val Asn He Tyr Glu Thr Arg Asn Asn Ala Asp His Asp Gly 
770 775 780 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



cgc cac gtc ctt gac gtc ggt ccc ttt aat gaa aac gaa cag cac atg 
Arg His Val Leu Asp Val Gly Pro Phe Asn Glu Asn Glu Gin His Met 
785 790 795 800 



2400 



114 



get gtg ctg gaa aag ctt ttt tat tac gtg gtt ctg cca gec tgt acc 2448 
Ala Val Leu Glu Lys Leu Phe Tyr Tyr Val Val Leu Pro Ala Cys Thr 
805 810 815 

aac ggt cac gtc tgc ggc atg ggc gtc gat ttt gac aac gtg gec ctg 2496 
Asn Gly His Val Cys Gly Met Gly Val Asp Phe Asp Asn Val Ala Leu 
820 825 830 

gec ctg acg tac aac ggc cca gtg ttt get gac gtc gtg aac ccc gac 2544 
Ala Leu Thr Tyr Asn Gly Pro Val Phe Ala Asp Val Val Asn Pro Asp 
835 840 845 

gat gag att ttg gac cac ctg gag aac ggg acg etc cgc gag atg etc 2592 
Asp Glu lie Leu Asp His Leu Glu Asn Gly Thr Leu Arg Glu Met Leu 
850 855 860 

gag get teg gat ata cac ccc acc gtt gac atg att cga act ctt tgc 2640 
Glu Ala Ser Asp lie His Pro Thr Val Asp Met lie Arg Thr Leu Cys 
865 870 875 880 

acg teg ttt etc acc tgc ccg ttt gtt acc cag gee tec cgt gtt gtg 2688 
Thr Ser Phe Leu Thr Cys Pro Phe Val Thr Gin Ala Ser Arg Val Val 
885 890 895 

act cag egg gac ccc gcg caa ctg ttg acc act cac gac gac ggg aga 2736 
Thr Gin Arg Asp Pro Ala Gin Leu Leu Thr Thr His Asp Asp Gly Arg 
900 905 910 

tac gtg age cag act gtc etc gtt aac ggg ttc gcg gcg ttt get ate 2784 
Tyr Val Ser Gin Thr Val Leu Val Asn Gly Phe Ala Ala Phe Ala lie 
915 920 925 

gca gat agg tct cgt gac gtt gec gag acc atg ttt tac ccg gtg ccg 2832 
Ala Asp Arg Ser Arg Asp Val Ala Glu Thr Met Phe Tyr Pro Val Pro 
930 935 940 

ttc acc aag ctg tac age gat ccc ctg gtg gcg gec acg etc cac ccg 2880 
Phe Thr Lys Leu Tyr Ser Asp Pro Leu Val Ala Ala Thr Leu His Pro 
945 950 955 960 

ctg gtc gca aat tac gtg acg cgc ctg ccg gee cag cgc gtg ccg gtc 2928 
Leu Val Ala Asn Tyr Val Thr Arg Leu Pro Ala Gin Arg Val Pro Val 
965 970 975 

gcg ttt aac gtc ccc ccg gec etc atg gec gag tac gag gag tgg cac 2976 
Ala Phe Asn Val Pro Pro Ala Leu Met Ala Glu Tyr Glu Glu Trp His 
980 985 990 

aag tct cca atg ctg gec tac get aac acc tgc ccg atg acg ccc acg 3024 
Lys Ser Pro Met Leu Ala Tyr Ala Asn Thr Cys Pro Met Thr Pro Thr 
995 1000 1005 

teg ttg age acc ctg gcg age atg cac atg aag ctg tec gcg ccg ggg 3072 
Ser Leu Ser Thr Leu Ala Ser Met His Met Lys Leu Ser Ala Pro Gly 
1010 1015 1020 

ttc ate tgc cac gca aag cac aag att cac ccg ggc ttt gcg atg acc 3120 
Phe lie Cys His Ala Lys His Lys lie His Pro Gly Phe Ala Met Thr 
1025 1030 1035 1040 

gec gtc cga acc gat gag gtg ttg gcg gag aac ttg eta ttt agt gec 3168 



115 



.1, us I .\\ 



Ala Val Arg Thr Asp Glu Val Leu Ala Glu Asn Leu Leu Phe Ser Ala 
1045 1050 1055 

agg gcc teg acg tec atg ttt tta ggg cag cca teg gtt atg cgt egg 3216 

Arg Ala Ser Thr Ser Met Phe Leu Gly Gin Pro Ser Val Met Arg Arg 
1060 1065 1070 

gaa gtc agg gcg gac gca gtc acg ttt gag gtg aat cat gag ttg gca 3264 

Glu Val Arg Ala Asp Ala Val Thr Phe Glu Val Asn His Glu Leu Ala 
1075 1080 1085 

teg ctg gac atg gcg etc ggt tat tct tec ace ate acg ccc gcc cac 3312 

Ser Leu Asp Met Ala Leu Gly Tyr Ser Ser Thr He Thr Pro Ala His 

1090 1095 1100 

gtt gcg gcg att ace teg gac atg ggc gtt cac tgt cag gac atg ttt 3360 

Val Ala Ala He Thr Ser Asp Met Gly Val His Cys Gin Asp Met Phe 
1105 HIO 1H5 1120 

etc atg ttt ccc ggg gac teg tac cag gac agg ace etc aac gac tac 3408 

Leu Met Phe Pro Gly Asp Ser Tyr Gin Asp Arg Thr Leu Asn Asp Tyr 
1125 1130 1135 

gtt aaa caa aaa gcc gga tgc caa cga ttc ggt ggt cct ggc cag att 3456 

Val Lys Gin Lys Ala Gly Cys Gin Arg Phe Gly Gly Pro Gly Gin He 
1140 1145 1150 

cgt gag ccc gtc get tac gtt gcg ggg gtg ccg cac teg gac aac ata 3504 

Arg Glu Pro Val Ala Tyr Val Ala Gly Val Pro His Ser Asp Asn He 
1155 1160 - 1165 

ccg ggt etc age cac gga cag ctg gcc acg tgt gag att gtt ttg acg 3552 

Pro Gly Leu Ser His Gly Gin Leu Ala Thr Cys Glu He Val Leu Thr 

1170 1175 1180 

ccc gtt act gca gac gtt acc tat ttt caa acc ccc aac agt ccc egg 3600 

Pro Val Thr Ala Asp Val Thr Tyr Phe Gin Thr Pro Asn Ser Pro Arg 
1185 H90 1195 1200 

gga egg gca tec tgc gtg ate teg tgt gac gcg tac aac aac gaa age 3648 

Gly Arg Ala Ser Cys Val He Ser Cys Asp Ala Tyr Asn Asn Glu Ser 
1205 1210 1215 



gcg gaa cgt ttg etc ttt gac cac tec ate ccg gat tct gcc tac gaa 
Ala Glu Arg Leu Leu Phe Asp His Ser He Pro Asp Ser Ala Tyr Glu 
1220 1225 1230 



3696 



tac cgc act acg gtt aac cca tgg gcg teg cag cag ggc tec etc gga 3744 

Tyr Arg Thr Thr Val Asn Pro Trp Ala Ser Gin Gin Gly Ser Leu Gly 
1235 1240 1245 

gac gtg ctg tac aac tea acc teg cgc cag gtc gca gtg cca ggg atg 3792 

Asp Val Leu Tyr Asn Ser Thr Ser Arg Gin Val Ala Val Pro Gly Met 
1250 1255 1260 

tac agt ccg tgt cgc cag ttt ttc cac aag gac get att ttg cgt aac 3 840 

Tyr Ser Pro Cys Arg Gin Phe Phe His Lys Asp Ala He Leu Arg Asn 
1265 1270 1275 1280 

aat egg ggc ctg aac aca eta gtc acg gaa tac gcg gcc cgc etc acg 3888 

Asn Arg Gly Leu Asn Thr Leu Val Thr Glu Tyr Ala Ala Arg Leu Thr 



116 



1285 



1290 



1295 



gga acg ccg gcg ace age gcg acg gac ctg cag tac gtg gtg gtc aac 3 936 

Gly Thr Pro Ala Thr Ser Ala Thr Asp Leu Gin Tyr Val Val Val Asn 
1300 1305 1310 

gga acg gat gtg ttt eta gaa caa ccg tgc cag ttt eta caa gaa gcg 3984 

Gly Thr Asp Val Phe Leu Glu Gin Pro Cys Gin Phe Leu Gin Glu Ala 
1315 1320 1325 

ttt ccc acg etc gec gee agt cac agg tct ctg ctg gac gaa tat atg 4032 

Phe Pro Thr Leu Ala Ala Ser His Arg Ser Leu Leu Asp Glu Tyr Met 
1330 1335 1340 

teg aat aag etc acg cac gec cct gtg cac atg gga cat tat atg att 4080 

Ser Asn Lys Leu Thr His Ala Pro Val His Met Gly His Tyr Met He 
1345 1350 1355 1360 

gag gaa gtg gec cct atg aaa aga eta tta aag ate gga aac aag gtc 4128 

Glu Glu Val Ala Pro Met Lys Arg Leu Leu Lys lie Gly Asn Lys Val 
1365 1370 1375 



gec tat tag 
Ala Tyr 
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<210> 45 
<211> 1378 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 45 

Met Glu Ala Ala Leu Glu Val Arg Pro Phe Pro Tyr Met Ala Thr Glu 

15 10 15 

Ala Asn Leu Leu Arg Gin Met Lys Glu Ser Ala Ala Ser Gly Leu Phe 

20 25 30 

Lys Ser Phe Gin Leu Leu Leu Gly Lys Asp Ala Arg Glu Gly Gly Val 

35 40 45 

Gin Phe Glu Gly Leu Leu Gly Val Tyr Thr Asn Val He Gin Phe Val 

50 55 60 

Lys Phe Leu Glu Thr Ser Leu Ala Val Ala Cys Val Asn Thr Glu Phe 
65 70 75 80 

Lys Asp Leu Lys Arg Met Thr Asp Gly Lys He Gin Phe Lys Val Ser 

85 90 95 

Val Pro Thr He Ala Tyr Gly Asp Gly Arg Arg Pro Thr Lys Gin Lys 

100 105 HO 

Gin Tyr He He Met Lys Ala Cys Asn Lys His His He Gly Ala Glu 

115 120 125 

He Glu Leu Ser Thr Asp Asp He Glu Leu Leu Phe He Asp Arg Glu 

130 135 140 

Thr Pro Leu Asp Tyr Thr Glu Tyr Ala Gly Ala Val Lys Thr lie Thr 
145 150 155 160 

Ala Ser Leu Gin Phe Gly Val Asp Ala Leu Glu Arg Gly Leu Val Asp 

165 170 175 

Thr Val Leu Asn Val Lys Leu Arg Ser Ala Pro Pro Met Phe He Leu 

180 185 190 

Lys Thr Leu Ser Asp Pro Val Tyr Thr Glu Arg Gly Leu Lys Lys Ala 

195 200 205 

Val Lys Ser Asp Met Val Ser Met Phe Lys Ser Tyr Leu Met Asp Asn 

210 215 220 

Ser Phe Phe Leu Asp Lys Ser Asp He Ala Val Lys Gly Lys Gin Tyr 
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1205 1210 1215 

Ala Glu Arg Leu Leu Phe Asp His Ser He Pro Asp Ser Ala Tyr Glu 

1220 1225 1230 

Tyr Arg Thr Thr Val Asn Pro Trp Ala Ser Gin Gin Gly Ser Leu Gly 

1235 1240 1245 

Asp Val Leu Tyr Asn Ser Thr Ser Arg Gin Val Ala Val Pro Gly Met 

1250 1255 1260 

Tyr Ser Pro Cys Arg Gin Phe Phe His Lys Asp Ala He Leu Arg Asn 
1265 1270 1275 1280 

Asn Arg Gly Leu Asn Thr Leu Val Thr Glu Tyr Ala Ala Arg Leu Thr 

1285 1290 1295 

Gly Thr Pro Ala Thr Ser Ala Thr Asp Leu Gin Tyr Val Val Val Asn 

1300 1305 1310 

Gly Thr Asp Val Phe Leu Glu Gin Pro Cys Gin Phe Leu Gin Glu Ala 

1315 1320 1325 

Phe Pro Thr Leu Ala Ala Ser His Arg Ser Leu Leu Asp Glu Tyr Met 

1330 1335 1340 

Ser Asn Lys Leu Thr His Ala Pro Val His Met Gly His Tyr Met He 
1345 1350 1355 1360 

Glu Glu Val Ala Pro Met Lys Arg Leu Leu Lys He Gly Asn Lys Val 
1365 1370 1375 

Ala Tyr 



<210> 46 

<211> 918 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (918) 

<400> 46 

atg gcc etc gat aag age ate gtt gtc teg gtg acg tct aga tta ttc 48 

Met Ala Leu Asp Lys Ser He Val Val Ser Val Thr Ser Arg Leu Phe 
1 5 10 15 



gcc gac gag ata gca aat ctt cag tea aag ata gga tgc att ttg cct 

Ala Asp Glu He Ala Asn Leu Gin Ser Lys He Gly Cys He Leu Pro 
20 25 30 

etc aga gac gcc cac cgt ctg cag aat ata cag gcg ctg ggt ctg ggg 

Leu Arg Asp Ala His Arg Leu Gin Asn He Gin Ala Leu Gly Leu Gly 
35 40 45 



tat ttg gac aaa tgc act etc gcc gtg ttg gaa gag gtc ggt ccc aac 

Tyr Leu Asp Lys Cys Thr Leu Ala Val Leu Glu Glu Val Gly Pro Asn 

65 70 75 80 

agt tta egg eta acg cgc att gat ccc atg gac aat tat caa ata aaa 

Ser Leu Arg Leu Thr Arg He Asp Pro Met Asp Asn Tyr Gin He Lys 

85 90 95 

aac gcg tac caa ccg gcc ttc cat tgg gat aac tac tea gaa ttg gta 

Asn Ala Tyr Gin Pro Ala Phe His Trp Asp Asn Tyr Ser Glu Leu Val 



96 



144 



aac ctg tgc tct agg gat tec gcg gtg gat ttt att cag gca tat cac 192 
Asn Leu Cys Ser Arg Asp Ser Ala Val Asp Phe He Gin Ala Tyr His 
50 55 60 
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288 



336 



120 



100 



105 HO 



130 



eta ctg acc tea ttg gtg cgc cac aac aaa cac gaa tta gtc gag att 
Leu Leu Thr Ser Leu Val Arg His Asn Lys His Glu Leu Val Glu He 
230 235 240 



225 



290 

aac tag 

Asn 

305 



384 



gtt ata cca ccg gtc ttt ggg cgc aaa gat gcg acc gtc tea ctg gag 
Val He Pro Pro Val Phe Gly Arg Lys Asp Ala Thr Val Ser Leu Glu 
115 120 125 

tct aac ggg ttt gat gtg gtt ttc cct gec gtg gtg cca gaa cca ctg 432 
Ser Asn Gly Phe Asp Val Val Phe Pro Ala Val Val Pro Glu Pro Leu 

135 140 



480 



get caa aca gtg ctt cag aag ctg ctg ctg tat aac ata tac tac aga 

Ala Gin Thr Val Leu Gin Lys Leu Leu Leu Tyr Asn He Tyr Tyr Arg 

145 150 155 160 

gtg gcg gag acg acg ccc acc gac gtc aac eta gec gag gtg acg ctg 

Val Ala Glu Thr Thr Pro Thr Asp Val Asn Leu Ala Glu Val Thr Leu 
165 170 175 

tac acg acc aat ate act tac atg ggt cgc aac tac gec ctg gac gtg 

Tyr Thr Thr Asn He Thr Tyr Met Gly Arg Asn Tyr Ala Leu Asp Val 
180 185 190 

gac ccc gtt ggg teg age tea get atg egg atg ctg gac gac ctg tec 

Asp Pro Val Gly Ser Ser Ser Ala Met Arg Met Leu Asp Asp Leu Ser 
195 200 205 

att tac ctg tgc gtt ttg tec gcg tta att ccg cgc ggg tgc gta agg 672 

He Tyr Leu Cys Val Leu Ser Ala Leu He Pro Arg Gly Cys Val Arg 

210 215 220 
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ttc gag ggg gtg gtg cca cct gag gta cag gec ctg gat etc aac aac 

Phe Glu Gly Val Val Pro Pro Glu Val Gin Ala Leu Asp Leu Asn Asn 

245 250 255 

gta age gtg gec gac gac ata acg cgc atg ggt gec etc ata acc tat 

Val Ser Val Ala Asp Asp He Thr Arg Met Gly Ala Leu He Thr Tyr 
260 265 270 

eta cga agt etc agt tct ata ttt aat ctg ggc cgc aga ttt cac gtt 

Leu Arg Ser Leu Ser Ser He Phe Asn Leu Gly Arg Arg Phe His Val 
275 280 285 

tac gcg ttc tea teg gac acg aat acc get tec tgt tgg tgt gca tat 912 

Tyr Ala Phe Ser Ser Asp Thr Asn Thr Ala Ser Cys Trp Cys Ala Tyr 

295 300 
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Ser 


Ser 


He 


Phe 


Asn 


Leu 


Gly 


Arg 


Arg 


Phe 


His 


Val 




275 










280 










285 








Tyr 


Ala 


Phe 


Ser 


Ser 


Asp 


Thr 


Asn 


Thr 


Ala 


Ser 


Cys 


Trp 


Cys 


Ala 


Tyr 



290 295 300 

Asn 
305 



<210> 48 
<211> 810 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - . (810) 

<400> 48 

atg teg ate ccc aaa att atg acg gtg tec aga gac aac gag ggt acg 

Met Ser He Pro Lys He Met Thr Val Ser Arg Asp Asn Glu Gly Thr 
15 10 15 

gtg tgt gaa gtc gcg gtg gac aac gga cga cac aga gcg atg att tat 
Val Cys Glu Val Ala Val Asp Asn Gly Arg His Arg Ala Met He Tyr 
20 25 30 



122 




tac cct aag acc acc aac tta gca aac gag cgc gcg gac gtt gtt aag 144 

Tyr Pro Lys Thr Thr Asn Leu Ala Asn Glu Arg Ala Asp Val Val Lys 

35 40 45 

gaa get ttt gat acc gaa acc cca gtg gac att gta aag caa att gtt 192 

Glu Ala Phe Asp Thr Glu Thr Pro Val Asp He Val Lys Gin He Val 

50 55 60 

aac gag ggc eta get ata tec aaa aaa aat tgc gtc cgt ttg gcg ttg 240 

Asn Glu Gly Leu Ala He Ser Lys Lys Asn Cys Val Arg Leu Ala Leu 

65 70 75 80 

tat tta tat ttt tat ttg cag tac gtg tgc ttt get ctg etc etc act 288 

Tyr Leu Tyr Phe Tyr Leu Gin Tyr Val Cys Phe Ala Leu Leu Leu Thr 

85 90 95 

tgg cag tta aac ccg tac atg gac cca ccg ggt ctg gtg ttt gcg gtt 336 

Trp Gin Leu Asn Pro Tyr Met Asp Pro Pro Gly Leu Val Phe Ala Val 

100 105 110 

aac ccc atg ggt cca aaa cat gtc acg aaa eta ccg cac ccg get att 384 

Asn Pro Met Gly Pro Lys His Val Thr Lys Leu Pro His Pro Ala He 

115 120 125 

gtt gcg gta ggt tgt ggg gca gac gec ate tgt aag aac tgt age gtc 432 

Val Ala Val Gly Cys Gly Ala Asp. Ala He Cys Lys Asn Cys Ser Val 

130 135 140 

ccc gat ate aaa acg gag ctt gga atg gtt tac cac aac ggg tct age 480 

Pro Asp He Lys Thr Glu Leu Gly Met Val Tyr His Asn Gly Ser Ser 

145 150 155 160 

gat tct ggt cag cgc gca cac tat ggg ctg gec ctg tta aag gcg gec 528 

Asp Ser Gly Gin Arg Ala His Tyr Gly Leu Ala Leu Leu Lys Ala Ala 

165 170 175 

tgg ctt gtc atg gga aat gtg tgt ccg gaa cca gta gtg egg caa ggc 576 

Trp Leu Val Met Gly Asn Val Cys Pro Glu Pro Val Val Arg Gin Gly 

180 185 190 

get gca tta ctt ggt cca tgg aac egg acg gag tgg teg gat ttt aaa 624 

Ala Ala Leu Leu Gly Pro Trp Asn Arg Thr Glu Trp Ser Asp Phe Lys 

195 200 205 

teg gca atg gcg gca acc acg ttt tgc gga tec aga ggc gtt ctg tgg 672 

Ser Ala Met Ala Ala Thr Thr Phe Cys Gly Ser Arg Gly Val Leu Trp 

210 215 220 

tea ccg att cat gaa aaa aac etc tgt cgc ccc acc tgg aat gat gta 720 

Ser Pro He His Glu Lys Asn Leu Cys Arg Pro Thr Trp Asn Asp Val 

225 230 235 240 

att aac aca tea gtt ttt aca aat gaa tea etc tgt cca aat ata cct 768 

He Asn Thr Ser Val Phe Thr Asn Glu Ser Leu Cys Pro Asn He Pro 

245 250 255 

gtg gtg ccc gaa agt gta ata gtg ctt aat ggt gat gca tga 810 

Val Val Pro Glu Ser Val He Val Leu Asn Gly Asp Ala" 

260 265 270 



123 



'In JU..JMU! C 



%j : 



<210> 49 
<211> 269 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 49 



Met 


Ser 


He 


Pro 


Lys 


He 


Met 


Thr 


Val 


Ser 


Arg 


Asp 


Asn 


Glu 


Gly 


Thr 


1 








5 










10 










15 




Val 


Cys 


Glu 


Val 


Ala 


Val 


Asp 


Asn 


Gly 


Arg 


His 


Arg 


Ala 


Met 


He 


Tyr 








20 










25 










30 






Tyr 


Pro 


Lys 


Thr 


Thr 


Asn 


Leu 


Ala 


Asn 


Glu 


Arg 


Ala 


Asp 


Val 


Val 


Lys 






35 










40 










45 








Glu 


Ala 


Phe 


Asp 


Thr 


Glu 


Thr 


Pro 


Val 


Asp 


He 


Val 


Lys 


Gin 


He 


Val 




50 










55 










60 










Asn 


Glu 


Gly 


Leu 


Ala 


He 


Ser 


Lys 


Lys 


Asn 


Cys 


Val 


Arg 


Leu 


Ala 


Leu 


65 










70 










75 










80 


Tyr 


Leu 


Tyr 


Phe 


Tyr 


Leu 


Gin 


Tyr 


Val 


Cys 


Phe 


Ala 


Leu 


Leu 


Leu 


Thr 










85 










90 










95 




Trp 


Gin 


Leu 


Asn 


Pro 


Tyr 


Met 


Asp 


Pro 


Pro 


Gly 


Leu 


Val 


Phe 


Ala 


Val 








100 










105 










110 






Asn 


Pro 


Met 


Gly 


Pro 


Lys 


His 


Val 


Thr 


Lys 


Leu 


Pro 


His 


Pro 


Ala 


He 






115 










120 










125 








Val 


Ala 


Val 


Gly 


Cys 


Gly 


Ala 


Asp 


Ala 


He 


Cys 


Lys 


Asn 


Cys 


Ser 


Val 




130 










135 










140 










Pro 


Asp 


He 


Lys 


Thr 


Glu 


Leu 


Gly 


Met 


Val 


Tyr 


His 


Asn 


Gly 


Ser 


Ser 


145 










150 










155 










160 


Asp 


Ser 


Gly 


Gin 


Arg 


Ala 


His 


Tyr 


Gly 


Leu 


Ala 


Leu 


Leu 


Lys 


Ala 


Ala 










165 










170 










175 




Trp 


Leu 


Val 


Met 


Gly 


Asn 


Val 


Cys 


Pro 


Glu 


Pro 


Val 


Val 


Arg 


Gin 


Gly 








180 










185 










190 






Ala 


Ala 


Leu 


Leu 


Gly 


Pro 


Trp 


Asn 


Arg 


Thr 


Glu 


Trp 


Ser 


Asp 


Phe 


Lys 






195 










200 










205 








Ser 


Ala 


Met 


Ala 


Ala 


Thr 


Thr 


Phe 


Cys 


Gl Y 


Ser 


Arg 


Gly 


Val 


Leu 


Trp 




210 










215 










220 










Ser 


Pro 


He 


His 


Glu 


Lys 


Asn 


Leu 


Cys 


Arg 


Pro 


Thr 


Trp 


Asn 


Asp 


Val 


225 










230 










235 










240 


lie 


Asn 


Thr 


Ser 


Val 


Phe 


Thr 


Asn 


Glu 


Ser 


.Leu 


Cys 


Pro 


Asn 


He 


Pro 










245 










250 










255 




Val 


Val 


Pro 


Glu 


Ser 


Val 


He 


Val 


Leu 


Asn 


Gly 


Asp 


Ala 









<210> 50 
<211> 276 
<212> DMA 

<213> Macaca mulatta rhadinovirus 17577 



<220> 

<221> CDS 

<222> (1) . . (276) 

<400> 50 

atg act gca cac acg aat ggg gtt 

Met Thr Ala His Thr Asn Gly Val 
1 5 

agt cag ccg gaa teg gtt caa gtt 
Ser Gin Pro Glu Ser Val Gin Val 
20 



tta acc acg acg ggc ttt tea aca 48 
Leu Thr Thr Thr Gly Phe Ser Thr 
10 15 

tct cca ttt tat cgc gta att aca 96 
Ser Pro Phe Tyr Arg Val He Thr 
25 30 



124 
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aaa cct ccc gtt atg ggc ttg ttt ttt tgc gtg get atg tgc gtt ate 144 

Lys Pro Pro Val Met Gly Leu Phe Phe Cys Val Ala Met Cys Val lie 

35 40 45 

gcg ttg gta tgg tac gtg atg egg agg gtg tgt tgt aag ggg cgc gtt 192 

Ala Leu Val Trp Tyr Val Met Arg Arg Val Cys Cys Lys Gly Arg Val 

50 55 60 

gtt gec gat teg tgt cgc gac ccg cgt caa ccc gcg tat gag atg ttg 240 

Val Ala Asp Ser Cys Arg Asp Pro Arg Gin Pro Ala Tyr Glu Met Leu 

65 70 75 80 

aat gtt agg ttg cgt ccc cac gga acc aat cca tag 276 

Asn Val Arg Leu Arg Pro His Gly Thr Asn Pro 

85 90 



<210> 51 
<211> 91 
<2l2> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 51 

Met Thr Ala His Thr Asn Gly Val Leu Thr Thr Thr Gly Phe Ser Thr 

! 5 10 15 

Ser Gin Pro Glu Ser Val Gin Val Ser Pro Phe Tyr Arg Val lie Thr 

20 25 30 

Lys Pro Pro Val Met Gly Leu Phe Phe Cys Val Ala Met Cys Val lie 

35 40 45 

Ala Leu Val Trp Tyr Val Met Arg Arg Val Cys Cys Lys Gly Arg Val 

50 55 60 

Val Ala Asp Ser Cys Arg Asp Pro Arg Gin Pro Ala Tyr Glu Met Leu 
65 70 75 80 

Asn Val Arg Leu Arg Pro His Gly Thr Asn Pro 
85 90 



<210> 52 

<211> 1047 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (1047) 



<400> 52 

atg ctt caa aag gac gca aaa ctg att ttt ata tec tea agt aac tea 

Met Leu Gin Lys Asp Ala Lys Leu lie Phe lie Ser Ser Ser Asn Ser 

1 5 10 15 

tct gac aaa tea act agt ttt tta ctt aat ttg aag gac gec cac gaa 

Ser Asp Lys Ser Thr Ser Phe Leu Leu Asn Leu Lys Asp Ala His Glu 

20 25 30 



48 



96 



aag atg ctt aac gtg gtg aac tac gta tgt ccg gat cat aaa gat gat 144 
Lys Met Leu Asn Val Val Asn Tyr Val Cys Pro Asp His Lys Asp Asp 
35 40 45 

ttt aac ttg caa gac act gtc gtg gcg tgc ccg tgc tac cgc ctg cat 192 

125 



iUi 
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Phe Asn Leu Gin Asp Thr Val Val Ala Cys Pro Cys Tyr Arg Leu His 
50 55 60 



att cct gcc tat att acg att gat gag acg gta egg age acg act aat 
lie Pro Ala Tyr lie Thr lie Asp Glu Thr Val Arg Ser Thr Thr Asn 
65 70 75 80 



240 



ctt ttt ttg gaa ggc gca ttc tec acg gag eta atg ggg gac get gcc 
Leu Phe Leu Glu Gly Ala Phe Ser Thr Glu Leu Met Gly Asp Ala Ala 
85 90 95 



288 



acg tec get caa age atg cat aaa ate gtt age gac tct teg ctg tea 
Thr Ser Ala Gin Ser Met His Lys lie Val Ser Asp Ser Ser Leu Ser 
100 105 110 



336 



caa ttg gat ctg tgc cgc gta aag age acg tea caa gat ata cag ggc 
Gin Leu Asp Leu Cys Arg Val Lys Ser Thr Ser Gin Asp He Gin Gly 
115 120 125 



384 



gcg atg aag ccg tgt etc cac gtt tac ata gac ccg gcg tac ace aac 
Ala Met Lys Pro Cys Leu His Val Tyr He Asp Pro Ala Tyr Thr Asn 
130 135 140 



432 



aac aca gac gcg tec ggc ace ggc ate ggt gcg gtg att gcg gta aat 
Asn Thr Asp Ala Ser Gly Thr Gly He Gly Ala Val He Ala Val Asn 
145 150 155 160 



480 



cac aag gtg att aaa tgc att tta tta ggc gtg gaa cat ttt ttt eta 
His Lys Val He Lys Cys He Leu Leu Gly Val Glu His Phe Phe Leu 
165 170 175 



528 



aga gat eta ace ggc ace gcc gcg tac cag ata gcg tea tgt gcc gcc 
Arg Asp Leu Thr Gly Thr Ala Ala Tyr Gin lie Ala Ser Cys Ala Ala 
180 185 190 



576 



gcg tta att cga gcg ate gtt ace ctt cac ccg cag ate acg cac gtt 
Ala Leu He Arg Ala He Val Thr Leu His Pro Gin He Thr His Val 
195 200 205 



624 



aac gtc gcc gtg gaa ggc aac age agt caa gat gcc gga gtg gcc ata 
Asn Val Ala Val Glu Gly Asn Ser Ser Gin Asp Ala Gly Val Ala He 
210 215 220 



672 



gca ace gtg tta aac gag att tgc teg gtc cct ctt agt ttt tta cac 
Ala Thr Val Leu Asn Glu He Cys Ser Val Pro Leu Ser Phe Leu His 
225 230 235 240 



720 



cac gtg gac aag aac act ctt ata cgt teg ccc att tac atg ttg ggg 
His Val Asp Lys Asn Thr Leu He Arg Ser Pro He Tyr Met Leu Gly 
245 250 255 



768 



cca gag aaa gcc aag gcg ttt gaa tct ttt att tac gca ttg aac teg 
Pro Glu Lys Ala Lys Ala Phe Glu Ser Phe He Tyr Ala Leu Asn Ser 
260 265 270 



816 



gga acg ttt age get age caa act gtg gtg tct cac act att aag eta 
Gly Thr Phe Ser Ala Ser Gin Thr Val Val Ser His Thr He Lys Leu 
275 280 285 

teg ttt gat ccc gta gcg tat eta ata gat caa ate aag gca ata cgt 
Ser Phe Asp Pro Val Ala Tyr Leu He Asp Gin He Lys Ala He Arg 

126 



864 



912 



290 



295 



300 



tgc att cca eta aaa gac gga ggt cac acg tac tgc gcg aaa caa aaa 
Cys lie Pro Leu Lys Asp Gly Gly His Thr Tyr Cys Ala Lys Gin Lys 
305 310 315 320 

acc atg teg gac gac gtg ctt gtc gec gec gtc atg gec cac tac atg 
Thr Met Ser Asp Asp Val Leu Val Ala Ala Val Met Ala His Tyr Met 
325 330 335 

gca acc aac gat aaa ttt gtt ttt aaa teg eta gaa taa 
Ala Thr Asn Asp Lys Phe Val Phe Lys Ser Leu Glu 
340 345 



<210> 53 
<211> 348 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 53 



Met 


Leu 


Gin 


Lys 


Asp 


Ala 


Lys 


Leu 


He 


Phe 


He 


Ser 


Ser 


Ser 


Asn 


Ser 


1 








5 










10 










15 




Ser 


Asp 


Lys 


Ser 


Thr 


Ser 


Phe 


Leu 


Leu 


Asn 


Leu 


Lys 


Asp 


Ala 


His 


Glu 




20 










25 










30 






Lys 


Met 


Leu 


Asn 


Val 


Val 


Asn 


Tyr 


Val 


Cys 


Pro 


Asp 


His 


Lys 


Asp 


Asp 




35 










40 










45 






His 


Phe 


Asn 


Leu 


Gin 


Asp 


Thr 


Val 


Val 


Ala 


Cys 


Pro 


Cys 


Tyr 


Arg 


Leu 




50 










55 










60 










lie 


Pro 


Ala 


Tyr 


He 


Thr 


He 


Asp 


Glu 


Thr 


vdl 


Arg 


Ser 


Thr 


Thr 


Asn 


65 








70 










75 










80 


Leu 


Phe 


Leu 


Glu 


Gly 


Ala 


Phe 


Ser 


Thr 


Glu 


Leu 


Met 


Gly 


Asp 


Ala 


Ala 










85 










90 










95 




Thr 


Ser 


Ala 


Gin 


Ser 


Met 


His 


Lys 


He 


Val 


Ser 


Asp 


Ser 


Ser 


Leu 


Ser 








100 










105 










110 






Gin 


Leu 


Asp 


Leu 


Cys 


Arg 


Val 


Lys 


Ser 


Thr 


Ser 


Gin 


Asp 


He 


Gin 


Gly 






115 










120 










125 








Ala 


Met 


Lys 


Pro 


Cys 


Leu 


His 


Val 


Tyr 


He 


Asp 


Pro 


Ala 


Tyr 


Thr 


Asn 




130 








135 










140 










Asn 


Thr 


Asp 


Ala 


Ser 


Gly 


Thr 


Gly 


He 


Gly 


Ala 


Val 


He 


Ala 


Val 


Asn 


145 










150 










155 










160 


His 


Lys 


Val 


He 


Lys 


Cys 


He 


Leu 


Leu 


Gly 


Val 


Glu 


His 


Phe 


Phe 


Leu 








165 










170 










175 




Arg 


Asp 


Leu 


Thr 


Gly 


Thr 


Ala 


Ala 


Tyr 


Gin 


He 


Ala 


Ser 


Cys 


Ala 


Ala 




180 










185 










190 






Ala 


Leu 


He 


Arg 


Ala 


He 


Val 


Thr 


Leu 


His 


Pro 


Gin 


He 


Thr 


His 


Val 






195 










200 










205 








Asn 


Val 


Ala 


Val 


Glu 


Gly 


Asn 


Ser 


Ser 


Gin 


Asp 


Ala 


Gly 


Val 


Ala 


He 




210 










215 










220 










Ala 


Thr 


Val 


Leu 


Asn 


Glu 


He 


Cys 


Ser 


Val 


Pro 


Leu 


Ser 


Phe 


Leu 


His 


225 










230 










235 










240 


His 


Val 


Asp 


Lys 


Asn 


Thr 


Leu 


He 


Arg 


Ser 


Pro 


He 


Tyr 


Met 


Leu 


Gly 








245 










250 










255 




Pro 


Glu 


Lys 


Ala 


Lys 


Ala 


Phe 


Glu 


Ser 


Phe 


He 


Tyr 


Ala 


Leu 


Asn 


Ser 






260 










265 










270 






Gly 


Thr 


Phe 


Ser 


Ala 


Ser 


Gin 


Thr 


Val 


Val 


Ser 


His 


Thr 


He 


Lys 


Leu 




275 










280 










285 








Ser 


Phe 


Asp 


Pro 


Val 


Ala 


Tyr 


Leu 


He 


Asp 


Gin 


He 


Lys 


Ala 


He 


Arg 




290 










295 










300 










Cys 


He 


Pro 


Leu 


Lys 


Asp 


Gly 


Gly 


His 


Thr 


Tyr 


Cys 


Ala 


Lys 


Gin 


Lys 



310 315 



Thr Met Ser Asp Asp Val Leu Val Ala Ala Val Met Ala His Tyr Met 

325 330 335 

Ala Thr Asn Asp Lys Phe Val Phe Lys Ser Leu Glu 
340 345 



<210> 54 
<211> 231 
<212> DMA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (231) 

<400> 54 

atg gag aac gat acg cct aag gac aaa ate teg gaa get gac ttt caa 
Met Glu Asn Asp Thr Pro Lys Asp Lys lie Ser Glu Ala Asp Phe Gin 
15 10 15 

cag tgt cag gcg ttc ttt cac cgt ccc att aga gat eta att tea tct 
Gin Cys Gin Ala Phe Phe His Arg Pro lie Arg Asp Leu lie Ser Ser 
20 25 30 

gga get gac get tta aac cac ttt age eta tct gaa tea gac gga cat 
Gly Ala Asp Ala Leu Asn His Phe Ser Leu Ser Glu Ser Asp Gly His 
35 40 45 

aaa ttg gaa egg att gtt ctt ctg ctt gac ctg gtg ggg aca gaa tgt 
Lys Leu Glu Arg He Val Leu Leu Leu Asp Leu Val Gly Thr Glu Cys 
50 55 60 

etc tct tat acc acg ate get gca aag aat gtc aaa tga 
Leu Ser Tyr Thr Thr He Ala Ala Lys Asn Val Lys 
65 70 75 



<210> 55 
<211> 76 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 55 

Met Glu Asn Asp Thr Pro Lys Asp Lys He Ser Glu Ala Asp Phe Gin 

15 10 15 

Gin Cys Gin Ala Phe Phe His Arg Pro He Arg Asp Leu He Ser Ser 

20 25 30 

Gly Ala Asp Ala Leu Asn His Phe Ser Leu Ser Glu Ser Asp Gly His 

35 40 45 

Lys Leu Glu Arg He Val Leu Leu Leu Asp Leu Val Gly Thr Glu Cys 

50 55 60 

Leu Ser Tyr Thr Thr He Ala Ala Lys Asn Val Lys 
65 70 75 



<210> 56 
<211> 654 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 



128 



gta atg age gac ggc gaa ggt ggc aga gac etc gcg gcg age att tac 
Val Met Ser Asp Gly Glu Gly Gly Arg Asp Leu Ala Ala Ser He Tyr 
150 155 ' 160 



145 



48 



96 



<220> 
<221> CDS 
<222> (1) . . (654) 

<400> 56 

atg tct etc tta tac cac gat cgc tgc aaa gaa tgt caa atg acg cgc 

Met Ser Leu Leu Tyr His Asp Arg Cys Lys Glu Cys Gin Met Thr Arg 
1 5 10 15 

gtc aac age cca ata tgt cga ttt cat aac gtc tct aac tta tac cag 
Val Asn Ser Pro He Cys Arg Phe His Asn Val Ser Asn Leu Tyr Gin 
20 25 30 

tgt ttg gat tgt aag cgc tat cac gta tgc gac ggg gga cgc aac tgc 144 
Cys Leu Asp Cys Lys Arg Tyr His Val Cys Asp Gly Gly Arg Asn Cys 
35 40 45 

gtg ate gtg tac act cgc gaa aat eta gtg tgt gat tta acg gga aac 192 
Val He Val Tyr Thr Arg Glu Asn Leu Val Cys Asp Leu Thr Gly Asn 
50 55 60 

tgc gtt ttg gat aat gtg cag gac gta tgt teg tac ggt cct cca gaa 
Cys Val Leu Asp Asn Val Gin Asp Val Cys Ser Tyr Gly Pro Pro Glu 
65 70 75 80 

cgc cgc gta ccc gac gec ttc ate gat ccg etc gtg tea cac ggc acg 
Arg Arg Val Pro Asp Ala Phe He Asp Pro Leu Val Ser His Gly Thr 
85 90 95 

agg gaa tgt ctt aaa age gat ata ctg agg tac ttt gag acg gtc ggt 336 
Arg Glu Cys Leu Lys Ser Asp He Leu Arg Tyr Phe Glu Thr Val Gly 
100 105 HO 

gtg aaa tct gag gca tat tct acc gtt gtc aag aat gga caa ttg aat 384 
Val Lys Ser Glu Ala Tyr Ser Thr Val Val Lys Asn Gly Gin Leu Asn 
115 120 . 125 

ggc ate ata ggt aga tta ata gac get acg ttt aac gag tgc ctt ccg 432 
Gly He He Gly Arg Leu He Asp Ala Thr Phe Asn Glu Cys Leu Pro 
130 135 140 



240 



288 



480 



528 



576 



ate cac ata att ate tec ata tac tec act aaa acg gta tat gat aat 
He His He He He Ser He Tyr Ser Thr Lys Thr Val Tyr Asp Asn 
165 170 175 

ctt eta ttt aaa tgt acg aga aat aaa aaa tac gac cac att gta aaa 
Leu Leu Phe Lys Cys Thr Arg Asn Lys Lys Tyr Asp His He Val Lys 
180 185 190 

act ate aga gcg caa tgg atg cgc atg gtc tea acc ggc gat ccg teg 624 
Thr He Arg Ala Gin Trp Met Arg Met Val Ser Thr Gly Asp Pro Ser 
195 200 205 

egg gtc agt gcg acg ggt tgt ttc acg tga 654 
Arg Val Ser Ala Thr Gly Cys Phe Thr 
210 215 



129 



<210> 57 
<211> 217 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 57 



Met 


Ser 


Leu 


Leu 


Tyr 


His 


Asp 


Arg 


Cys 


Lys 


Glu 


Cys 


Gin 


Met 


Thr 


Arg 


1 








5 










10 










15 




Val 


Asn 


Ser 


Pro 


He 


Cys 


Arg 


Phe 


His 


Asn 


Val 


Ser 


Asn 


Leu 


Tyr 


Gin 








20 










25 










3 0 






Cys 


Leu 


Asp 


Cys 


Lys 


Arg 


Tyr 


His 


Val 


Cys 


Asp 


Gly 


Gly 


Arg 


Asn 


Cys 




35 










40 










45 








Val 


He 


Val 


Tyr 


Thr 


Arg 


Glu 


Asn 


Leu 


Val 


Cys 


Asp 


Leu 


Thr 


Gly 


Asn 




50 










55 










60 










Cys 


Val 


Leu 


Asp 


Asn 


Val 


Gin 


Asp 


Val 


Cys 


Ser 


Tyr 


Gly 


Pro 


Pro 


Glu 


65 










70 










75 










o u 


Arg 


Arg 


Val 


Pro 


Asp 


Ala 


Phe 


He 


Asp 


Pro 


Leu 


Val 


Ser 


His 


Gly 


Thr 






85 










90 










95 




Arg 


Glu 


Cys 


Leu 


Lys 


Ser 


Asp 


He 


Leu 


Arg 


Tyr 


Phe 


Glu 


Thr 


Val 


Gly 






100 










105 










110 






Val 


Lys 


Ser 


Glu 


Ala 


Tyr 


Ser 


Thr 


Val 


Val 


Lys 


Asn 


Gly 


Gin 


Leu 


Asn 




115 










120 










125 








Gly 


He 


He 


Gly Arg 


Leu 


He 


Asp 


Ala 


Thr 


Phe 


Asn 


Glu 


Cys 


Leu 


Pro 


130 










135 










140 










Val 


Met 


Ser 


Asp 


Gly 


Glu 


Gly 


Gly 


Arg 


Asp 


Leu 


Ala 


Ala 


Ser 


He 


Tyr 


145 








150 










155 










160 


He 


His 


He 


He 


He 


Ser 


He 


Tyr 


Ser 


Thr 


Lys 


Thr 


Val 


Tyr 


Asp 


Asn 








165 










170 










175 




Leu 


Leu 


Phe 


Lys 


Cys 


Thr 


Arg 


Asn 


Lys 


Lys 


Tyr 


Asp 


His 


He 


Val 


Lys 








180 










185 










190 






Thr 


He 


Arg 


Ala 


Gin 


Trp 


Met 


Arg 


Met 


Val 


Ser 


Thr 


Gly 


Asp 


Pro 


Ser 






195 










200 










205 








Arg 


Val 


Ser 


Ala 


Thr 


Gly 


Cys 


Phe 


Thr 
















210 










215 





















<210> 58 
<211> 1395 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (1395) 

<400> 58 

atg gat gcg cat ggt etc aac egg cga tec gtc gcg ggt cag tgc gac 

Met Asp Ala His Gly Leu Asn Arg Arg Ser Val Ala Gly Gin Cys Asp 
1 5 10 15 

ggg ttg ttt cac gtg ata ctt ccg cga ggg ttt ate etc gcg aac aat 
Gly Leu Phe His Val He Leu Pro Arg Gly Phe He Leu Ala Asn Asn 
20 25 30 

att acg tgc ggt gaa egg cag egg ttt ttt gca cac act tgg ttc get 
He Thr Cys Gly Glu Arg Gin Arg Phe Phe Ala His Thr Trp Phe Ala 
35 40 45 



130 
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gca tct gga cgc acg tct aag act tta tac gtg tgg gga egg gta ttt 192 
Ala Ser Gly Arg Thr Ser Lys Thr Leu Tyr Val Trp Gly Arg Val Phe 
50 55 60 



caa aac acc gac ccg ggc cgc ggg gac ggt ccg tec ggg ccg tgg tec 
Gin Asn Thr Asp Pro Gly Arg Gly Asp Gly Pro Ser Gly Pro Trp Ser 

70 75 80 



ccg ttg ccg eta gat tgg tec gat ccg gtc gcg tac ctg tta gag ggc 
Pro Leu Pro Leu Asp Trp Ser Asp Pro Val Ala Tyr Leu Leu Glu Gly 
275 280 285 

gat ttg ttt tta gga gec aag cag aat gcg ttt gtc gat tct ata gaa 

131 



240 



288 



65 

gga ctg gcg att agt ctg cct ctg ttt acc aca aat gga aaa ttt cat 

Gly Leu Ala He Ser Leu Pro Leu Phe Thr Thr Asn Gly Lys Phe His 
85 90 95 

ccg ttt gat gta gtt ata etc aag gec gat acg cct gac tct gga age 336 

Pro Phe Asp Val Val He Leu Lys Ala Asp Thr Pro Asp Ser Gly Ser 
100 105 HO 

teg tgg acc gtg aag ttc ttg tat atg tea tta att gcg get tac aga 384 

Ser Trp Thr Val Lys Phe Leu Tyr Met Ser Leu He Ala Ala Tyr Arg 
115 120 125 

aac gca atg cga ggt tta aaa gat aaa gtt teg caa tgt acc gat gec 432 

Asn Ala Met Arg Gly Leu Lys Asp Lys Val Ser Gin Cys Thr Asp Ala 
130 135 140 

gec gtt gac ggt gag gtt cat cct eta acc gtc tta aaa gaa gcg ttg 4 80 

Ala Val Asp Gly Glu Val His Pro Leu Thr Val Leu Lys Glu Ala Leu 

145 150 155 160 

gta tea ccg gac act get acg cga ccc gtg tec gcg tgc aac cct eta 528 

Val Ser Pro Asp Thr Ala Thr Arg Pro Val Ser Ala Cys Asn Pro Leu 
165 170 175 

cag atg ttg acc gga etc tta cag tct agg gta egg gac gac tac gtg 576 
Gin Met Leu Thr Gly Leu Leu Gin Ser Arg Val Arg Asp Asp Tyr Val 
180 185 190 

aca cac cac cgt gcg etc gaa cgc cca ggt aat gtg agg gga caa gta 624 

Thr His His Arg Ala Leu Glu Arg Pro Gly Asn Val Arg Gly Gin Val 
195 200 205 

ate gec ccg acg cgc acc gag atg cca aac gga teg cca agt cgt gta 672 
He Ala Pro Thr Arg Thr Glu Met Pro Asn Gly Ser Pro Ser Arg Val 
210 215 220 



720 



768 



agg ctt gga ttc cgc cct ccc aaa caa gec aac tat cca aag acg tgg 
Arg Leu Gly Phe Arg Pro Pro Lys Gin Ala Asn Tyr Pro Lys Thr Trp 
225 230 235 240 

gcg cag gcg cgt cac gtt ttc teg tct cgc gca tat tac gtg tgc gta 
Ala Gin Ala Arg His Val Phe Ser Ser Arg Ala Tyr Tyr Val Cys Val 
245 250 255 

tat gat aac gaa gaa eta gat acc aag tgg cag egg caa gat ccg egg 816 
Tyr Asp Asn Glu Glu Leu Asp Thr Lys Trp Gin Arg Gin Asp Pro Arg 
260 265 270 



864 



912 



Asp Leu Phe Leu Gly Ala Lys Gin Asn Ala Phe Val Asp Ser lie Glu 
290 295 300 



aag acg tgc agg tgt cag aac tat acc att aag caa ttt ttt ccg gtt 
Lys Thr Cys Arg Cys Gin Asn Tyr Thr lie Lys Gin Phe Phe Pro Val 
305 310 315 320 

ttg ata aat agg gac aac gaa aca gtc gac tta att aag gag cat ttt 
Leu lie Asn Arg Asp Asn Glu Thr Val Asp Leu lie Lys Glu His Phe 
325 330 335 

ata gag gcg tgc ttc gtg att aga aac cag gtg tea gag agg age get 
lie Glu Ala Cys Phe Val lie Arg Asn Gin Val Ser Glu Arg Ser Ala 
340 345 350 

tgg 9 ta aa 9 9 C 9 9 C 9 ct 9 tt:t C 9 C aac 9 at a< 3 t aac ac 9 tat fc 99 aa 9 
Trp Val Lys Ala Ala Leu Phe Arg Asn Asp Ser Asn Thr Tyr Trp Lys 
355 360 365 

gat gtt ttg gga tta tgg gag cat ggg cct cat aag ctg ggt acg get 
Asp Val Leu Gly Leu Trp Glu His Gly Pro His Lys Leu Gly Thr Ala 
370 375 380 

ata aaa eta cca aca teg gaa cct tgc aat gec gac gta aac tgg age 
lie Lys Leu Pro Thr Ser Glu Pro Cys Asn Ala Asp Val Asn Trp Ser 
385 390 395 400 

tgg etc ctg tgt gac gag gat ata act egg tea att age ggg cag tct 
Trp Leu Leu Cys Asp Glu Asp lie Thr Arg Ser He Ser Gly Gin Ser 
405 410 415 

act gtt tgc tta gtt gtc teg cct acc ctg acc gec tgg ctg gtg etc 
Thr Val Cys Leu Val Val Ser Pro Thr Leu Thr Ala Trp Leu Val Leu 
420 425 430 

ccg ggg ggc ttt gtt att aaa ggc cgc tac gac eta tea age gag gat 
Pro Gly Gly Phe Val He Lys Gly Arg Tyr Asp Leu Ser Ser Glu Asp 
435 440 445 

tta atg ttt gtg get teg aga tat ggc cac cca gcg teg tea cat tct 
Leu Met Phe Val Ala Ser Arg Tyr Gly His Pro Ala Ser Ser His Ser 
450 455 460 



taa 



465 



<210> 59 
<211> 464 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 59 

Met Asp Ala His Gly Leu Asn Arg Arg Ser Val Ala Gly Gin Cys Asp 

15 10 15 

Gly Leu Phe His Val He Leu Pro Arg Gly Phe He Leu Ala Asn Asn 

20 25 30 

He Thr Cys Gly Glu Arg Gin Arg Phe Phe Ala His Thr Trp Phe Ala 

35 40 45 

Ala Ser Gly Arg Thr Ser Lys Thr Leu Tyr Val Trp Gly Arg Val Phe 
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100 










1 n c: 
1Uj 










1 1 u 






Ser 
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Ala 
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Ala 


Met 


Arg 
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Leu 
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Lys 


Vdl 


Se 
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Asp 
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13 5 




















Ala 


va 1 


Asp 


Gly Glu 


val 


His 


Pro 


Leu 
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17 0 










17 5 




Gin 


Met 


Leu 
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Gly 


Leu 


Leu 


Lain 


Ser 


Arg 


vai 


Arg 
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Asp 


Tyr 


vai 
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Vdl 
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bin 
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195 
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Glu 
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Pro 
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Ser 


Pro 


Ser 
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Val 
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22 0 










Arg 


Leu 


Gly 


Phe 


Arg 


Pro 


Pro 


Lys 


Gin 


Ala 


Asn 


Tyr 


Pro 


Lys 


Thr 


Trp 


225 










230 










235 










24 0 


Ala 


Gin 


Ala 


Arg 


His 


Val 


Phe 


Ser 


Ser 


Arg 


Ala 


Tyr 


Tyr 


Val 


Cys 


Val 
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250 










2 55 




Tyr 


Asp 


Asn 


Glu 


Glu 


Leu 


Asp 
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Lys 


Trp 


Gin 


Arg 


Gin 


Asp 


Pro 


Arg 








260 
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2 70 
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Trp 


Glu 


ill s 
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Leu 


Pro 


Thr 


Ser 


Glu 


Pro 


Cys 


Asn 
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Asp 
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Asn 


Trp 


Ser 


385 










390 
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400 


Trp 
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Cys 


Asp 


Glu 


Asp 
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Thr 


Arg 


Ser 


He 


Ser 
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Gin 


Ser 










405 










410 










415 




Thr 
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Val 


Val 
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Leu 
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<210> 60 
<211> 1011 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
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<222> (1) 



(1011) 



<400> 60 

atg gcc acc cag cgt cgt cac att ctt aaa teg ttt tta aac aag gaa 

Met Ala Thr Gin Arg Arg His He Leu Lys Ser Phe Leu Asn Lys Glu 

15 10 15 

tgc ata tgg ttg cga cac ccg ggt acg tec gcg ttt gtt egg gta tac 

Cys He Trp Leu Arg His Pro Gly Thr Ser Ala Phe Val Arg Val Tyr 

20 25 30 



48 



96 



acc gcg acc act gcg cat tct gcc gtt ttt gac ccg ccg gta act age 
Thr Ala Thr Thr Ala His Ser Ala Val Phe Asp Pro Pro Val Thr Ser 
35 40 45 

gaa aat gcg atg tea ctt aac ttt tta aat gtt atg ate gta ate atg 
Glu Asn Ala Met Ser Leu Asn Phe Leu Asn Val Met He Val He Met 
50 55 60 

aaa cca aaa gaa ttt ggc ccg tgc gtt acc gtg tac atg aat gga gat 
Lys Pro Lys Glu Phe Gly Pro Cys Val Thr Val Tyr Met Asn Gly Asp 
65 70 75 80 

att eta gat ttt tgt gcc acg gaa tct gtc gcc ata agg gac gtg cct 
He Leu Asp Phe Cys Ala Thr Glu Ser Val Ala He Arg Asp Val Pro 
85 90 95 

ggt agg gcg gac ctg tgt tta att cgt ttt ggt acc ctt tct aat gcg 
Gly Arg Ala Asp Leu Cys Leu He Arg Phe Gly Thr Leu Ser Asn Ala 
100 105 HO 



144 



192 



240 



288 



336 



ccg agg age gtt ccg ata ccc ggg ccg ttg aac cca cat ccg cga gaa 
Pro Arg Ser Val Pro He Pro Gly Pro Leu Asn Pro His Pro Arg Glu 
115 120 125 

acc gtg ccc ggg eta aca aaa cag gaa att ata tac act teg caa aca 
Thr Val Pro Gly Leu Thr Lys Gin Glu He He Tyr Thr Ser Gin Thr 
130 135 140 

gtg cca aga gga cag ata cca gat gcc ata aag ggg aaa gag ttc cac 
Val Pro Arg Gly Gin He Pro Asp Ala He Lys Gly Lys Glu Phe His 
14 5 150 155 160 

caa ata aat ccg ttt ttg tgg ttt gac gga ggg gcg ttt tgg caa ctg 
Gin He Asn Pro Phe Leu Trp Phe Asp Gly Gly Ala Phe Trp Gin Leu 
165 170 175 

ttc etc tct gtg gat ttt atg ctg etc tgt ccc gca etc gac aca gtt 
Phe Leu Ser Val Asp Phe Met Leu Leu Cys Pro Ala Leu Asp Thr Val 
180 185 190 



384 



432 



480 



528 



576 



ccg tec ctg gcc aga ate gtt ggg ctt ctt aca cag tgc gat aag age 
Pro Ser Leu Ala Arg He Val Gly Leu Leu Thr Gin Cys Asp Lys Ser 
195 200 205 



624 



acg tgt aaa att tgt acg ggg gcc cac gta cac gtt aac ccg tat cgc 

Thr Cys Lys He Cys Thr Gly Ala His Val His Val Asn Pro Tyr Arg 
210 215 220 

gga tac acg cca cct gac teg caa ggg acc tea ccc teg tgc ccc tgc 

Gly Tyr Thr Pro Pro Asp Ser Gin Gly Thr Ser Pro Ser Cys Pro Cys 

134 



672 



720 



225 

ctt ate teg 
Leu He Ser 



cac gtt aat 
His Val Asn 



gtg acc aag 
Val Thr Lys 
275 

gac gec atg 
Asp Ala Met 
290 

teg eta ccg 
Ser Leu Pro 
305 

atg eta tac 
Met Leu Tyr 



tga 



230 

tgc ggg gee 
Cys Gly Ala 
245 

ctt ttg ggc 
Leu Leu Gly 
260 

ctg cgt tta 
Leu Arg Leu 



teg ggc gtc 
Ser Gly Val 



tgg gee etc 
Trp Ala Leu 
310 

ggc tgc cag 
Gly Cys Gin 
325 



agg cgc gcg 
Arg Arg Ala 



ctg etc ttt 
Leu Leu Phe 
265 

aaa aga aac 
Lys Arg Asn 
280 

acg gee gaa 
Thr Ala Glu 
295 

att cgc ctg 
He Arg Leu 



aac tta aaa 
Asn Leu Lys 



235 

gcg gat gtc 
Ala Asp Val 
250 

gac ccc aaa 
Asp Pro Lys 



cca cgc ccg 
Pro Arg Pro 



ggg acc gag 
Gly Thr Glu 
300 

ccg gat tta 
Pro Asp Leu 
315 

age ate tgc 
Ser He Cys 
330 



ctg gtt acc 
Leu Val Thr 
255 

gec tec ccc 
Ala Ser Pro 
270 

gta ccg ata 
Val Pro He 
285 

gtg caa ccc 
Val Gin Pro 



gee agt cgc 
Ala Ser Arg 



tta cgt tct 
Leu Arg Ser 
335 



240 

gga 76 8 
Gly 



aaa 816 
Lys 



gag 864 
Glu 



act 912 
Thr 



gtg 960 

Val 

320 

tat 1008 
Tyr 



1011 



<210> 61 
<211> 336 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 61 

Met Ala Thr Gin Arg Arg His He Leu Lys Ser Phe Leu Asn Lys Glu 

15 10 15 

Cys He Trp Leu Arg His Pro Gly Thr Ser Ala Phe Val Arg Val Tyr 

20 25 30 

Thr Ala Thr Thr Ala His Ser Ala Val Phe Asp Pro Pro Val Thr Ser 

35' 40 45 

Glu Asn Ala Met Ser Leu Asn Phe Leu Asn Val Met He Val He Met 

50 55 60 

Lys Pro Lys Glu Phe Gly Pro Cys Val Thr Val Tyr Met Asn Gly Asp 
65 70 75 80 

He Leu Asp Phe Cys Ala Thr Glu Ser Val Ala He Arg Asp Val Pro 

85 90 95 

Gly Arg Ala Asp Leu Cys Leu He Arg Phe Gly Thr Leu Ser Asn Ala 

100 105 HO 

Pro Arg Ser Val Pro He Pro Gly Pro Leu Asn Pro His Pro Arg Glu 

115 12 0 12 5 

Thr Val Pro Gly Leu Thr Lys Gin Glu He He Tyr Thr Ser Gin Thr 

130 135 140 

Val Pro Arg Gly Gin He Pro Asp Ala He Lys Gly Lys Glu Phe His 
145 150 155 160 

Gin He Asn Pro Phe Leu Trp Phe Asp Gly Gly Ala Phe Trp Gin Leu 

165 170 175 

Phe Leu Ser Val Asp Phe Met Leu Leu Cys Pro Ala Leu Asp Thr Val 

180 185 190 

Pro Ser Leu Ala Arg He Val Gly Leu Leu Thr Gin Cys Asp Lys Ser 



135 



195 200 205 

Thr Cys Lys He Cys Thr Gly Ala His Val His Val Asn Pro Tyr Arg 

210 215 220 

Gly Tyr Thr Pro Pro Asp Ser Gin Gly Thr Ser Pro Ser Cys Pro Cys 
225 230 235 240 

Leu He Ser Cys Gly Ala Arg Arg Ala Ala Asp Val Leu Val Thr Gly 

245 250 255 

His Val Asn Leu Leu Gly Leu Leu Phe Asp Pro Lys Ala Ser Pro Lys 

260 265 270 

Val Thr Lys Leu Arg Leu Lys Arg Asn Pro Arg Pro Val Pro He Glu 

275 280 285 

Asp Ala Met Ser Gly Val Thr Ala Glu Gly Thr Glu Val Gin Pro Thr 

290 295 300 

Ser Leu Pro Trp Ala Leu He Arg Leu Pro Asp Leu Ala Ser Arg Val 
305 310 315 320 

Met Leu Tyr Gly Cys Gin Asn Leu Lys Ser lie Cys Leu Arg Ser Tyr 
325 330 335 



<210> 62 
<211> 984 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (984) 

<400> 62 

atg ttg tta acc age tat cgc gaa cgt ctt caa aat aac ttg cgc gtg 48 
Met Leu Leu Thr Ser Tyr Arg Glu Arg Leu Gin Asn Asn Leu Arg Val 
15 10 15 

gtc acg gac ggt ggt tgc gaa aac tgg ttt egg caa ccg ccc gtt att 96 
Val Thr Asp Gly Gly Cys Glu Asn Trp Phe Arg Gin Pro Pro Val He 
20 25 30 

ata teg ggc aac gac aag acc gaa cga atg gec cac cca tgc ttg gga 144 
He Ser Gly Asn Asp Lys Thr Glu Arg Met Ala His Pro Cys Leu Gly 
35 40 45 

gtt att cac gcg gtt aat gca tat agt tct gtt tta gac gat tat ctt 192 
Val He His Ala Val Asn Ala Tyr Ser Ser Val Leu Asp Asp Tyr Leu 
50 55 60 

caa acg tac cgc aga gtt caa gaa ccc atg ccg gec cct acg ttg gga 240 
Gin Thr Tyr Arg Arg Val Gin Glu Pro Met Pro Ala Pro Thr Leu Gly 
65 70 75 80 

aag ccc cga att tct age cac get acg ttg ccc egg tta acc gag gaa 288 
Lys Pro Arg He Ser Ser His Ala Thr Leu Pro Arg Leu Thr Glu Glu 
85 90 95 

etc aca aac tac ctt aaa caa aca tgt tgt egg gtc caa atg gca aac 336 
Leu Thr Asn Tyr Leu Lys Gin Thr Cys Cys Arg Val Gin Met Ala Asn 
100 105 110 



gec aag gac cag tac atg gaa tac caa teg gec caa egg acc cac gaa 
Ala Lys Asp Gin Tyr Met Glu Tyr Gin Ser Ala Gin Arg Thr His Glu 
115 120 125 



384 



136 
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get ttc eta gag tgc ccc gtt tat gca gaa ctg egg cag ttt tta gec 432 

Ala Phe Leu Glu Cys Pro Val Tyr Ala Glu Leu Arg Gin Phe Leu Ala 
130 135 140 

aac ctg teg tea ttt tta aat ggg agt tac gtg ccc ggg gtt tgt tgc 

Asn Leu Ser Ser Phe Leu Asn Gly Ser Tyr Val Pro Gly Val Cys Cys 



145 



150 155 160 



210 215 220 

aca tgg ate gtg gtc gee att ate age gtt ctt etc teg teg gtt gaa 

Thr Trp He Val Val Ala He He Ser Val Leu Leu Ser Ser Val Glu 

225 230 235 240 



gta age aga tgc ttt tta agt tct ggc age cgt ata gca tea cgc gac 
Val Ser Arg Cys Phe Leu Ser Ser Gly Ser Arg He Ala Ser Arg Asp 
310 315 320 



305 



tgg eta aat ccg gca ggc gaa tga 
Trp Leu Asn Pro Ala Gly Glu 
325 



480 



528 



ctt gag ccc ttt caa caa caa tta ate atg cac acg ttt tat ttt ate 

Leu Glu Pro Phe Gin Gin Gin Leu He Met His Thr Phe Tyr Phe He 

165 170 175 

gcg tct ate aaa gca ccc gaa aag aca cac cag tta ttt gee acg ttt 

Ala Ser He Lys Ala Pro Glu Lys Thr His Gin Leu Phe Ala Thr Phe 
180 185 190 

aag caa cac ttt ggt tta ttt gaa acc acg gac gac gtg tta cag acg 

Lys Gin His Phe Gly Leu Phe Glu Thr Thr Asp Asp Val Leu Gin Thr 
195 200 205 

ttt aag cag aaa gee age gtt ttt gtt ate ccg cgt cgt cac ggg aaa 672 

Phe Lys Gin Lys Ala Ser Val Phe Val He Pro Arg Arg His Gly Lys 



576 



624 



720 



768 



aac gtt cac gtg ggt tac gtg gee cac caa aaa cac gtc gee aat gee 
Asn Val His Val Gly Tyr Val Ala His Gin Lys His Val Ala Asn Ala 
245 250 255 

gtt ttc tec gag gtc ate gec acg eta tec agg tgg ttt ccg gcg aag 
Val Phe Ser Glu Val He Ala Thr Leu Ser Arg Trp Phe Pro Ala Lys 
260 265 270 

aac ctg aac ata aag aag gaa aac gga acc ate gtg tac gcg age ccc 
Asn Leu Asn He Lys Lys Glu Asn Gly Thr He Val Tyr Ala Ser Pro 
275 280 285 

gga egg egg ccg agt teg ctg atg tgt gcg acg tgc ttc aat aag aac 912 
Gly Arg Arg Pro Ser Ser Leu Met Cys Ala Thr Cys Phe Asn Lys Asn 
290 295 300 



816 



864 



960 



984 



<210> 63 

<211> 327 

<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 




5 10 15 



val 


Thr 


Asp 


Gly Gly 


Cys 


Glu 


Asn 


Trp 


Phe 


Arg 


Gin 


Pro 


Pro 


Val 


He 








20 










25 










30 






lie 


Ser 


Gly 
35 


Asn 


Asp 


Lys 


Thr 


Glu 
40 


Arg 


Met 


Ala 


His 


Pro 
45 


Cys 


Leu 


Gly 


Val 


lie 
50 


His 


Ala 


Val 


Asn 


Ala 
55 


Tyr 


Ser 


Ser 


Val 


Leu 
60 


Asp 


Asp 


Tyr 


Leu 


Gin 


Thr 


Tyr 


Arg 


Arg 


Val 


Gin 


Glu 


Pro 


Met 


Pro 


Ala 


Pro 


Thr 


Leu 


Gly 


65 










70 










75 










80 


Lys 


Pro 


Arg 


He 


Ser 


Ser 


His 


Ala 


Thr 


Leu 


Pro 


Arg 


Leu 


Thr 


Glu 


Glu 








85 










90 










95 




Leu 


Thr 


Asn 


Tyr 
100 


Leu 


Lys 


Gin 


Thr 


Cys 
105 


Cys 


Arg 


Val 


Gin 


Met 
110 


Ala 


Asn 


Ala 


Lys 


Asp 


Gin 


Tyr 


Met 


Glu 


Tyr 


Gin 


Ser 


Ala 


Gin 


Arg 


Thr 


His 


Glu 




115 










120 










125 








Ala 


Phe 
130 


Leu 


Glu 


Cys 


Pro 


Val 
135 


Tyr 


Ala 


Glu 


Leu 


Arg 
140 


Gin 


Phe 


Leu 


Ala 


Asn 


Leu 


Ser 


Ser 


Phe 


Leu 


Asn 


Gly 


Ser 


Tyr 


Val 


Pro 


Gly 


Val 


Cys 


Cys 


145 










150 










155 










160 


Leu 


Glu 


Pro 


Phe 


Gin 
165 


Gin 


Gin 


Leu 


He 


Met 
170 


His 


Thr 


Phe 


Tyr 


Phe 
175 


He 


Ala 


Ser 


lie 


Lys 
180 


Ala 


Pro 


Glu 


Lys 


Thr 
185 


His 


Gin 


Leu 


Phe 


Ala 
190 


Thr 


Phe 


Lys 


Gin 


His 


Phe 


Gly 


Leu 


Phe 


Glu 


Thr 


Thr 


Asp 


Asp 


Val 


Leu 


Gin 


Thr 




195 










200 










205 








Phe 


Lys 


Gin 


Lys 


Ala 


Ser 


Val 


Phe 


Val 


He 


Pro 


Arg 


Arg 


His 


Gly 


Lys 




210 










215 










220 








Glu 


Thr 


Trp 


He 


Val 


Val 


Ala 


lie 


He 


Ser 


Val 


Leu 


Leu 


Ser 


Ser 


Val 


225 








230 










235 










240 


Asn 


Val 


His 


Val 


Gly 

245 


Tyr 


Val 


Ala 


His 


Gin 
250 


Lys 


His 


Val 


Ala 


Asn 
255 


Ala 


Val 


Phe 


Ser 


Glu 
260 


Val 


He 


Ala 


Thr 


Leu 
265 


Ser 


Arg 


Trp 


Phe 


Pro 
270 


Ala 


Lys 


Asn 


Leu 


Asn 
275 


He 


Lys 


Lys 


Glu 


Asn 
280 


Gly 


Thr 


He 


Val 


Tyr 
285 


Ala 


Ser 


Pro 


Gly 


Arg 
290 


Arg 


Pro 


Ser 


Ser 


Leu 
295 


Met 


Cys 


Ala 


Thr 


Cys 
300 


Phe 


Asn 


Lys 


Asn 


Val 


Ser 


Arg 


Cys 


Phe 


Leu 


Ser 


Ser 


Gly 


Ser 


Arg 


He 


Ala 


Ser 


Arg 


Asp 


305 










310 










315 










320 


Trp 


Leu 


Asn 


Pro 


Ala 
325 


Gly 


Glu 





















<210> 64 
<211> 984 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (984) 

<400> 64 

atg ttc cct tea agt ttt ttg aat aac ggg cac ccc gaa acg gaa cgc 

Met Phe Pro Ser Ser Phe Leu Asn Asn Gly His Pro Glu Thr Glu Arg 
1 5 10 15 

cgc ttc gtt aaa ggc gtt cag tta get tta gac ctg tgt gac aac act 
Arg Phe Val Lys Gly Val Gin Leu Ala Leu Asp Leu Cys Asp Asn Thr 
20 25 30 



138 



ccg gga cag ttt aaa eta gtt gaa aca cct ctt aat agt ttt ctt ctg 144 
Pro Gly Gin Phe Lys Leu Val Glu Thr Pro Leu Asn Ser Phe Leu Leu 
35 40 45 

gta tec aac gtt ctg ccg gaa teg cgc ccg gtt aga gac tgt ccg cag 192 
Val Ser Asn Val Leu Pro Glu Ser Arg Pro Val Arg Asp Cys Pro Gin 
50 55 60 

ccg gaa ggg ttt gac ttt gaa cac att cac etc cca aaa eta aca cgc 240 
Pro Glu Gly Phe Asp Phe Glu His lie His Leu Pro Lys Leu Thr Arg 
65 70 75 80 

atg cag cgt gtc ctg ggg cga tac tgc gac cat gtt aac aac gac gac 288 
Met Gin Arg Val Leu Gly Arg Tyr Cys Asp His Val Asn Asn Asp Asp 
85 90 95 

acg tgc gtt aac gta aag gca agt tec teg aat tea cag ggt gec ttg 336 
Thr Cys Val Asn Val Lys Ala Ser Ser Ser Asn Ser Gin Gly Ala Leu 
100 105 110 

ttt tat ctg ccg tat gga cag gac gaa tgg aat tgg gcg etc acg tta 384 
Phe Tyr Leu Pro Tyr Gly Gin Asp Glu Trp Asn Trp Ala Leu Thr Leu 
115 120 125 

agg aaa gac aag ttg gtt aaa atg get gta gag ggc ttg tea aat ccc 432 
Arg Lys Asp Lys Leu Val Lys Met Ala Val Glu Gly Leu Ser Asn Pro 
130 135 140 

acg ace tgg aaa ggt tta gag ccc gtg gat cct tta ccg etc ata tgg 480 
Thr Thr Trp Lys Gly Leu Glu Pro Val Asp Pro Leu Pro Leu lie Trp 
145 150 155 160 

ctt ctg ttt tac ggt tec egg teg ttc tgt egg gaa cca gag tgc eta 528 
Leu Leu Phe Tyr Gly Ser Arg Ser Phe Cys Arg Glu Pro Glu Cys Leu 
165 170 175 

tat gaa cgc aat ttt ggt atg aag gga ccc ata etc tta ccg cca cat 576 
Tyr Glu Arg Asn Phe Gly Met Lys Gly Pro lie Leu Leu Pro Pro His 
180 185 190 

atg tat gec ccc caa aag gac gta atg act ttt gtc cat cat gta att 624 
Met Tyr Ala Pro Gin Lys Asp Val Met Thr Phe Val His His Val lie 
195 200 205 

aag tac gtt aaa ttt tta tac gtg aac gee ggt ggg ggt ctt gaa act 672 
Lys Tyr Val Lys Phe Leu Tyr Val Asn Ala Gly Gly Gly Leu Glu Thr 
210 215 220 

gaa ccg tec ccg ccg ttc gag gec teg egg ttg cgc gca gec ate get 72 0 
Glu Pro Ser Pro Pro Phe Glu Ala Ser Arg Leu Arg Ala Ala lie Ala 
225 230 235 240 

cgt etc ggg gac gtg gaa gcg gat gac gca tac ctg tec gca aag tgc 768 
Arg Leu Gly Asp Val Glu Ala Asp Asp Ala Tyr Leu Ser Ala Lys Cys 
245 250 255 

atg ttg tgt cac ctg tac aag caa aac gat acg att teg att cat gaa 816 
Met Leu Cys His Leu Tyr Lys Gin Asn Asp Thr lie Ser lie His Glu 
260 265 270 



139 



aca cac gtg ggc gga gtc ate gca tta ggc gga gac ggt gcg aga tat 
Thr His Val Gly Gly Val He Ala Leu Gly Gly Asp Gly Ala Arg Tyr 
275 280 285 



ata acg tct agt gtt egg get caa egg tgc acg agt egg gga gat ttt 

He Thr Ser Ser Val Arg Ala Gin Arg Cys Thr Ser Arg Gly Asp Phe 

290 295 300 

gtt tta ate ccc ctg tac aac att gaa ggg etc gta age atg ata agg 

Val Leu He Pro Leu Tyr Asn He Glu Gly Leu Val Ser Met He Arg 

305 310 315 320 

gaa cat ggc etc ggc age age taa 

Glu His Gly Leu Gly Ser Ser 
325 



<210> 65 
<211> 327 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 65 

Met Phe Pro Ser Ser Phe Leu Asn Asn Gly His Pro Glu Thr Glu Arg 

1 5 10 15 

Arg Phe Val Lys Gly Val Gin Leu Ala Leu Asp Leu Cys Asp Asn Thr 

20 25 30 

Pro Gly Gin Phe Lys Leu Val Glu Thr Pro Leu Asn Ser Phe Leu Leu 

35 40 45 

Val Ser Asn Val Leu Pro Glu Ser Arg Pro Val Arg Asp Cys Pro Gin 

50 55 60 

Pro Glu Gly Phe Asp Phe Glu His He His Leu Pro Lys Leu Thr Arg 
65 70 75 80 

Met Gin Arg Val Leu Gly Arg Tyr Cys Asp His Val Asn Asn Asp Asp 

85 90 95 

Thr Cys Val Asn Val Lys Ala Ser Ser Ser Asn Ser Gin Gly Ala Leu 

100 105 HO 

Phe Tyr Leu Pro Tyr Gly Gin Asp Glu Trp Asn Trp Ala Leu Thr Leu 

115 120 125 

Arg Lys Asp Lys Leu Val Lys Met Ala Val Glu Gly Leu Ser Asn Pro 

130 135 140 

Thr Thr Trp Lys Gly Leu Glu Pro Val Asp Pro Leu Pro Leu He Trp 
145 150 155 160 

Leu Leu Phe Tyr Gly Ser Arg Ser Phe Cys Arg Glu Pro Glu Cys Leu 

165 170 175 

Tyr Glu Arg Asn Phe Gly Met Lys Gly Pro He Leu Leu Pro Pro His 

180 185 190 

Met Tyr Ala Pro Gin Lys Asp Val Met Thr Phe Val His His Val He 

195 200 205 

Lys Tyr Val Lys Phe Leu Tyr Val Asn Ala Gly Gly Gly Leu Glu Thr 

210 215 220 

Glu Pro Ser Pro Pro Phe Glu Ala Ser Arg Leu Arg Ala Ala He Ala 
225 230 235 240 

Arg Leu Gly Asp Val Glu Ala Asp Asp Ala Tyr Leu Ser Ala Lys Cys 

245 250 255 

Met Leu Cys His Leu Tyr Lys Gin Asn Asp Thr He Ser He His Glu 

260 265 270 

Thr His Val Gly Gly Val lie Ala Leu Gly Gly Asp Gly Ala Arg Tyr 

275 280 285 

He Thr Ser Ser Val Arg Ala Gin Arg Cys Thr Ser Arg Gly Asp Phe 
290 295 300 



140 
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Val Leu He Pro Leu Tyr Asn He Glu Gly Leu Val Ser Met He Arg 

305 310 315 320 

Glu His Gly Leu Gly Ser Ser 
325 



tgc ccc aaa age cct tga 
Cys Pro Lys Ser Pro 
145 150 



<210> 67 
<211> 149 
<212> PRT 



48 



96 



144 



<210> 66 
<211> 450 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) - - (450) 

<400> 66 

atg gec teg gca gca get aaa aaa atg tta att aag teg gag etc gag 
Met Ala Ser Ala Ala Ala Lys Lys Met Leu He Lys Ser Glu Leu Glu 
15 10 15 

teg gaa ate aac aaa aaa ctg tec ate tec gta ttt gac agg ttt ggg 
Ser Glu He Asn Lys Lys Leu Ser He Ser Val Phe Asp Arg Phe Gly 
20 25 30 

gee gac agt gec gtg ttt aac gcg cag tat aag gga ace agg gaa teg 
Ala Asp Ser Ala Val Phe Asn Ala Gin Tyr Lys Gly Thr Arg Glu Ser 
35 40 45 

ctg egg teg tac aac age eta aaa aag aag gac gat ctg gcg acc gtt 192 
Leu Arg Ser Tyr Asn Ser Leu Lys Lys Lys Asp Asp Leu Ala Thr Val 
50 55 60 

gtc gga acg eta gaa acg teg ctg cgt gaa aaa caa age gaa ttg gga 240 
Val Gly Thr Leu Glu Thr Ser Leu Arg Glu Lys Gin Ser Glu Leu Gly 
65 70 75 80 

tta eta aag ggg ttt aac agg aaa aaa att gaa gag ttt gac get gtg 288 
Leu Leu Lys Gly Phe Asn Arg Lys Lys He Glu Glu Phe Asp Ala Val 
85 90 95 

gcg gac gcg gtt cgc gac etc aag gac gag ctg tac gga gaa ctg gag 3 36 
Ala Asp Ala Val Arg Asp Leu Lys Asp Glu Leu Tyr Gly Glu Leu Glu 
100 105 110 

att eta ggt acg ctt gac aat gaa tct gtt ccc gtg gaa gaa gag tec 384 
lie Leu Gly Thr Leu Asp Asn Glu Ser Val Pro Val Glu Glu Glu Ser 
115 120 125 

ccc aag gac gac att att agg tgg aaa ttg gag cgt ctg ccc aga gtg 432 
Pro Lys Asp Asp He lie Arg Trp Lys Leu Glu Arg Leu Pro Arg Val 
130 135 140 



450 



141 



<213> Macaca mulatta rhadinovirus 17577 



<400> 67 

Met Ala Ser Ala Ala Ala Lys Lys Met Leu lie Lys Ser Glu Leu Glu 

15 10 15 

Ser Glu He Asn Lys Lys Leu Ser He Ser Val Phe Asp Arg Phe Gly 

20 25 30 

Ala Asp Ser Ala Val Phe Asn Ala Gin Tyr Lys Gly Thr Arg Glu Ser 

35 40 45 

Leu Arg Ser Tyr Asn Ser Leu Lys Lys Lys Asp Asp Leu Ala Thr Val 

50 55 60 

Val Gly Thr Leu Glu Thr Ser Leu Arg Glu Lys Gin Ser Glu Leu Gly 
65 70 75 80 

Leu Leu Lys Gly Phe Asn Arg Lys Lys He Glu Glu Phe Asp Ala Val 

85 90 95 

Ala Asp Ala Val Arg Asp Leu Lys Asp Glu Leu Tyr Gly Glu Leu Glu 

100 105 110 

He Leu Gly Thr Leu Asp Asn Glu Ser Val Pro Val Glu Glu Glu Ser 

115 120 125 

Pro Lys Asp Asp He He Arg Trp Lys Leu Glu Arg Leu Pro Arg Val 

130 135 140 

Cys Pro Lys Ser Pro 
145 



<210> 68 
<211> 1308 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - . (1308) 

<400> 68 

atg aat ctg ttc ccg tgg aag aag agt ccc cca agg acg aca tta tta 

Met Asn Leu Phe Pro Trp Lys Lys Ser Pro Pro Arg Thr Thr Leu Leu 

1 5 10 15 

ggt gga aat tgg age gtc tgc cca gag tgt gec cca aaa gec ctt gat 
Gly Gly Asn Trp Ser Val Cys Pro Glu Cys Ala Pro Lys Ala Leu Asp 
20 25 30 

ccc att ccc aag gtt cag act gac gtc gac aga aca gca teg tec cat 
Pro He Pro Lys Val Gin Thr Asp Val Asp Arg Thr Ala Ser Ser His 
35 40 45 

ata acc gtc att aaa aca cgt aag acg ate gec caa ctg aag ata cct 
He Thr Val lie Lys Thr Arg Lys Thr He Ala Gin Leu Lys He Pro 
50 55 60 

aac aac tgg ggc cag tgt agt cac cag gcg acg gac tgg acc gec gtg 
Asn Asn Trp Gly Gin Cys Ser His Gin Ala Thr Asp Trp Thr Ala Val 
65 70 75 80 

etc gga cgc ggc teg tat ggt gtg gtg agg tec atg tct etc ggc cgc 
Leu Gly Arg Gly Ser Tyr Gly Val Val Arg Ser Met Ser Leu Gly Arg 
85 90 95 



tg c 9 tt: aa 9 cat tt:t 99 c a 9 c c 99 c 9 t 9 a 9 tt:t: ttt tac gag tgc att 



Cys Val Lys His Phe Gly Ser Arg Arg Glu Phe Phe Tyr Glu Cys lie 
100. 105 110 



ttt aac gat ata gta cgc gcc tgc egg gag aaa cat ccc ctg aac cgc 
Phe Asn Asp lie Val Arg Ala Cys Arg Glu Lys His Pro Leu Asn Arg 
115 120 125 

999 99ft 9 ac cgt ata eta tgt ttc eta gag ccg tgc gta cca tgt cgc 
Gly Gly Asp Arg lie Leu Cys Phe Leu Glu Pro Cys Val Pro Cys Arg 
130 135 140 

gcc ctg ata ttc ccg cag tta aca ggg aat ctg eta aac gcg gat ctt 
Ala Leu lie Phe Pro Gin Leu Thr Gly Asn Leu Leu Asn Ala Asp Leu 
145 150 155 160 

aaa cac gtg aac cct gaa egg ctg gcc gtt gaa ttc tct gag ctg agg 
Lys His Val Asn Pro Glu Arg Leu Ala Val Glu Phe Ser Glu Leu Arg 
165 170 175 

gaa ggc gtt agt ttt eta aat aat ata tgt ggc ate gtt cac tgt gac 
Glu Gly Val Ser Phe Leu Asn Asn lie Cys Gly lie Val His Cys Asp 
180 185 190 

ate agt cca gaa aat ata ctg ata aag ggg gaa ctg aca act gcg tac 
lie Ser Pro Glu Asn lie Leu lie Lys Gly Glu Leu Thr Thr Ala Tyr 
195 200 205 

ggg aga ctt atg ate gga gat eta ggg tec gcc tct tta cac acg gga 
Gly Arg Leu Met lie Gly Asp Leu Gly Ser Ala Ser Leu His Thr Gly 
210 215 220 

acc cct tgg acc gga gtg atg gtg acc tec aaa etc ggg ttc gtg cag 
Thr Pro Trp Thr Gly Val Met Val Thr Ser Lys Leu Gly Phe Val Gin 
225 230 235 240 

cac acg tac cat ttt aag gca ccg gcc aga ttt ate tgt aag cac att 
His Thr Tyr His Phe Lys Ala Pro Ala Arg Phe lie Cys Lys His lie 
245 250 255 

tac egg ccg teg tgt etc etc tac egg tgt ctg ctg teg tgc gcc ggg 
Tyr Arg Pro Ser Cys Leu Leu Tyr Arg Cys Leu Leu Ser Cys Ala Gly 
260 265 270 

ggc ccg cag gcg cat atg eta aat cag ccg ttc caa ate act cca caa 
Gly Pro Gin Ala His Met Leu Asn Gin Pro Phe Gin lie Thr Pro Gin 
275 280 285 

etc ggt etc aca att gac ata teg tec ctg ggt tat agt ttg eta gca 
Leu Gly Leu Thr lie Asp lie Ser Ser Leu Gly Tyr Ser Leu Leu Ala 
290 295 300 

tgc eta gag aaa tat ctt cag cca get gac cca ttt ccc cag cag gga 
Cys Leu Glu Lys Tyr Leu Gin Pro Ala Asp Pro Phe Pro Gin Gin Gly 
305 310 315 320 

gcg ttg gcg gac get tec tec gaa tec gcc cac cca ttg ttc tat ttg 
Ala Leu Ala Asp Ala Ser Ser Glu Ser Ala His Pro Leu Phe Tyr Leu 
325 330 335 



cgt tgc atg gtg cca aga gta gtc ate gcc gag att ttt tct gtt gcc 
Arg Cys Met Val Pro Arg Val Val lie Ala Glu lie Phe Ser Val Ala 



G 



is? lui Si Q 



340 345 350 

tgg 9 ac gtt cca etc gat tta ggc att gac tea tct ggc cac gcg cca 1104 

Trp Asp Val Pro Leu Asp Leu Gly lie Asp Ser Ser Gly His Ala Pro 

355 360 365 

get att ccc ctg aga gaa gcg tac agg egg ttt ttt gee aac cag tgt 1152 

Ala He Pro Leu Arg Glu Ala Tyr Arg Arg Phe Phe Ala Asn Gin Cys 

370 375 380 

agt tta tat agg gcg caa tac aaa gag gat gcg tta gaa aac gca tec 1200 

Ser Leu Tyr Arg Ala Gin Tyr Lys Glu Asp Ala Leu Glu Asn Ala Ser 

385 390 395 400 

teg egg ctg tgt aac tea aaa ctt aaa eta gtt etc cag aag ctg ttg 1248 

Ser Arg Leu Cys Asn Ser Lys Leu Lys Leu Val Leu Gin Lys Leu Leu 

405 410 415 

gtc agg gac tac ttt agt cat tgc gga aac tgc gga gat cat gga ttt 12 96 

Val Arg Asp Tyr Phe Ser His Cys Gly Asn Cys Gly Asp His Gly Phe 

42.0 42 5 43 0 

ttt etc aga tga 1308 
Phe Leu Arg 
435 



<210> 69 
<211> 435 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 69 



Met 


Asn 


Leu 


Phe 


Pro 


Trp 


Lys 


Lys 


Ser 


Pro 


Pro 


Arg 


Thr 


Thr 


Leu 


Leu 


1 








5 










10 










15 




Gly 


Gly 


Asn 


Trp 


Ser 


Val 


Cys 


Pro 


Glu 


Cys 


Ala 


Pro 


Lys 


Ala 


Leu 


Asp 








20 










25 










30 






Pro 


He 


Pro 


Lys 


Val 


Gin 


Thr 


Asp 


Val 


Asp 


Arg 


Thr 


Ala 


Ser 


Ser 


His 






35 










40 










45 








He 


Thr 


Val 


He 


Lys 


Thr 


Arg 


Lys 


Thr 


He 


Ala 


Gin 


Leu 


Lys 


lie 


Pro 




50 










55 










60 










Asn 


Asn 


Trp 


Gly 


Gin 


Cys 


Ser 


His 


Gin 


Ala 


Thr 


Asp 


Trp 


Thr 


Ala 


Val 


65 










70 










75 










80 


Leu 


Gly 


Arg 


Gly 


Ser 


Tyr 


Gly 


Val 


Val 


Arg 


Ser 


Met 


Ser 


Leu 


Gly 


Arg 










85 










90 










95 




Cys 


Val 


Lys 


His 


Phe 


Gly 


Ser 


Arg 


Arg 


Glu 


Phe 


Phe 


Tyr 


Glu 


Cys 


He 








100 










105 










110 






Phe 


Asn 


Asp 


He 


Val 


Arg 


Ala 


Cys 


Arg 


Glu 


Lys 


His 


Pro 


Leu 


Asn 


Arg 






115 










120 










125 








Gly 


Gly 


Asp 


Arg 


He 


Leu 


Cys 


Phe 


Leu 


Glu 


Pro 


Cys 


Val 


Pro 


Cys 


Arg 




130 










135 










140 










Ala 


Leu 


He 


Phe 


Pro 


Gin 


Leu 


Thr 


Gly 


Asn 


Leu 


Leu 


Asn 


Ala 


Asp 


Leu 


145 










150 










155 










160 


Lys 


His 


Val 


Asn 


Pro 


Glu 


Arg 


Leu 


Ala 


Val 


Glu 


Phe 


Ser 


Glu 


Leu 


Arg 










165 










170 










175 




Glu 


Gly 


Val 


Ser 


Phe 


Leu 


Asn 


Asn 


He 


Cys 


Gly 


He 


Val 


His 


Cys 


Asp 








180 










185 










190 






He 


Ser 


Pro 


Glu 


Asn 


He 


Leu 


He 


Lys 


Gly 


Glu 


Leu 


Thr 


Thr 


Ala 


Tyr 






195 










200 










205 








Gly 


Arg 


Leu 


Met 


He 


Gly 


Asp 


Leu 


Gly 


Ser 


Ala 


Ser 


Leu 


His 


Thr 


Gly 




210 










215 










220 











144 
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Thr 


Pro 


Trp 


Thr 


Gly 


Val 


Met 


Val 


Thr 


Ser 


Lys 


Leu 


Gly 


Phe 


Val 


Gin 


22 5 










230 










235 










240 


His 


Thr 


Tyr 


His 


Phe 


Lys 


Ala 


Pro 


Ala 


Arg 


Phe 


He 


Cys 


Lys 


His 


He 










245 










250 










255 




Tyr 


A TCI 


Pro 


Ser 


Cvs 


Leu 


Leu 


Tyr 


Arg 


Cys 


Leu 


Leu 


Ser 


Cys 


Ala 


Gly 




260 










265 










270 






vj jl y 


Pro 


Gin 


Ala 


His 


Met 


Leu 


Asn 


Gin 


Pro 


Phe 


Gin 


He 


Thr 


Pro 


Gin 




275 










280 










285 










Gly 


Leu 


Thr 


lie 


Asp 


He 


Ser 


Ser 


Leu 


Gly 


Tyr 


Ser 


Leu 


Leu 


Ala 




2 90 










295 










300 












Leu 


Glu 


Lys 


Tyr 


Leu 


Gin 


Pro 


Ala 


Asp 


Pro 


Phe 


Pro 


Gin 


Gin 


Gly 


3 05 










310 










315 










320 


A "1 

M.JL ct 


Leu 


Ala 


Asp 


Ala 


Ser 


Ser 


Glu 


Ser 


Ala 


His 


Pro 


Leu 


Phe 


Tyr 


Leu 










325 










330 










335 




Arg 


Cys 


Met 


Val 


Pro 


Arq 


Val 


Val 


He 


Ala 


Glu 


He 


Phe 


Ser 


Val 


Ala 




340 










345 










350 






Trp 


Asp 


Va 1 


Pro 


Leu 


Asp 


Leu 


Gly 


He 


Asp 


Ser 


Ser 


Gly 


His 


Ala 


Pro 


355 










360 










365 








Ala 


lie 


Pro 


Leu 


Arg 


Glu 


Ala 


Tyr 


Arg 


Arg 


Phe 


Phe 


Ala 


Asn 


Gin 


Cys 




370 










375 










380 










Ser 


Leu 


Tyr 


Arg 


Ala 


Gin 


Tyr 


Lys 


Glu 


Asp 


Ala 


Leu 


Glu 


Asn 


Ala 


Ser 


385 










390 










395 










400 


Ser 


Arg 


Leu 


Cys 


Asn 


Ser 


Lys 


Leu 


Lys 


Leu 


Val 


Leu 


Gin 


Lys 


Leu 


Leu 








405 










410 










415 




Val 


Arg 


Asp 


Tyr 


Phe 


Ser 


His 


Cys 


Gly 


Asn 


Cys 


Gly 


Asp 


His 


Gly 


Phe 






420 










425 










430 







Phe Leu Arg 
435 



<210> 70 
<211> 1443 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - . (1443) 

<400> 70 

atg gat ttt ttc tea gat gag ccg atg gtt cag gag atg gcg ctt etc 48 

Met Asp Phe Phe Ser Asp Glu Pro Met Val Gin Glu Met Ala Leu Leu 
i 5 10 15 



gac ate gat gag cag cag egg etc etc teg aaa atg age ctg gec aac 96 
Asp He Asp Glu Gin Gin Arg Leu Leu Ser Lys Met Ser Leu Ala Asn 
20 25 30 



ttt tta aaa cat gag cga gta agg gcg ttt ttt age gat aac aaa aag 

Phe Leu Lys His Glu Arg Val Arg Ala Phe Phe Ser Asp Asn Lys Lys 

35 40 45 

gaa ata age atg ccg get ata egg ttc gtg tat aat ttt tat eta ttc 

Glu He Ser Met Pro Ala He Arg Phe Val Tyr Asn Phe Tyr Leu Phe 

50 55 60 



gec aag gtg gga gat ttt ate ggc aac acc gac gtg tac. gat ttt ta. 
Ala Lys Val Gly Asp Phe He Gly Asn Thr Asp Val Tyr Asp Phe Ty 
65 70 75 80 



144 



192 



ac 240 
r 



145 



gtc act tgc gtg ttc agg ggg agg cgc ctg acg cgc ctg teg gaa gtg 
Val Thr Cys Val Phe Arg Gly Arg Arg Leu Thr Arg Leu Ser Glu Val 
85 90 95 



tac gac gcg tgc eta aac atg cac ccg cac gat cga cac cac gtg tgt 
Tyr Asp Ala Cys Leu Asn Met His Pro His Asp Arg His His Val Cys 
100 105 110 

gca ttg ata gaa cag gtc acg cgc ggc caa aac ate aat cct ctg tgg 
Ala Leu lie Glu Gin Val Thr Arg Gly Gin Asn lie Asn Pro Leu Trp 
115 120 125 

gac get ctg agg gac ggc ata att teg teg tea aaa ttt cac tgg gee 
Asp Ala Leu Arg Asp Gly lie lie Ser Ser Ser Lys Phe His Trp Ala 
130 135 140 

ata aaa caa cag aat teg tec aaa aaa att ttt aac ccg tgg cct ata 
lie Lys Gin Gin Asn Ser Ser Lys Lys He Phe Asn Pro Trp Pro He 
145 150 155 160 

gtc aac aat cac ttt gta gcg ggc ccg etc gcg ttt gga ctg cgt tgc 
Val Asn Asn His Phe Val Ala Gly Pro Leu Ala Phe Gly Leu Arg Cys 
165 170 175 

gag gaa gtg gtt aaa aag ata ctg gcg acg ttg ctg cat cca ggc gag 
Glu Glu Val Val Lys Lys He Leu Ala Thr Leu Leu His Pro Gly Glu 
180 185 190 

gcg cac tgt gaa aac tac gga ttc atg cag agt cct etc aac ggg gtt 
Ala His Cys Glu Asn Tyr Gly Phe Met Gin Ser Pro Leu Asn Gly Val 
195 200 205 

ttt ggc gtc tec ttg gat ttt gga att aac gtc agg tct gac cca aaa 
Phe Gly Val Ser Leu Asp Phe Gly He Asn Val Arg Ser Asp Pro Lys 
210 215 220 

gac ggt ttg gag ttt cac cca gac tgc aaa ate tat gaa ata aaa tgc 
Asp Gly Leu Glu Phe His Pro Asp Cys Lys He Tyr Glu He Lys Cys 
225 - 230 235 240 

egg ttt aag tac act ttt tec aag atg gag tgt gac ccg att tac get 
Arg Phe Lys Tyr Thr Phe Ser Lys Met Glu Cys Asp Pro He Tyr Ala 
245 250 255 

gcg tat get aaa ctt tat cag aag ccc age atg cag acg ctt aag ggg 
Ala Tyr Ala Lys Leu Tyr Gin Lys Pro Ser Met Gin Thr Leu Lys Gly 
260 265 270 

ttt ttg tac tec ata tct aaa ccg gcg ate gag ttt gtc gga gag gac 
Phe Leu Tyr Ser He Ser Lys Pro Ala He Glu Phe Val Gly Glu Asp 
275 280 285 

agg etc ccc agt gaa teg gac tat ctt gtg gca tat gac aaa gaa tgg 
Arg Leu Pro Ser Glu Ser Asp Tyr Leu Val Ala Tyr Asp Lys Glu Trp 
290 295 300 

gag gtg tgt ccg egg aaa aag aga cgc tta act gca gta cac cat eta 
Glu Val Cys Pro Arg Lys Lys Arg Arg Leu Thr Ala Val His His Leu 
305 310 315 320 

gtt aaa aag tgc atg att cac aac tct acg gcg cct tct gat gtg tat 



146 



Val Lys Lys Cys Met lie His Asn Ser Thr Ala Pro Ser Asp Val Tyr 
325 330 335 



ata ttg tea gat ccg cag gaa acc gga ggc caa att aat att aaa get 
lie Leu Ser Asp Pro Gin Glu Thr Gly Gly Gin lie Asn He Lys Ala 
340 345 350 

cat ctg age gec aac eta ttt ata aac gtc agg cat ccg tat tat tat 
His Leu Ser Ala Asn Leu Phe He Asn Val Arg His Pro Tyr Tyr Tyr 
355 360 365 

caa gtg ttg etc cag tct etc gtt gta cag gag tac ate agt etc tec 
Gin Val Leu Leu Gin Ser Leu Val Val Gin Glu Tyr He Ser Leu Ser 
370 375 380 

aag gga act aaa aat ttg gga acc cag aaa aac ttt ata gcg act ggc 
Lys Gly Thr Lys Asn Leu Gly Thr Gin Lys Asn Phe He Ala Thr Gly 
385 390 395 400 

ttt ttt aga aaa egg cag ttt caa gac ccg age tgc tgc acg ate ggt 
Phe Phe Arg Lys Arg Gin Phe Gin Asp Pro Ser Cys Cys Thr He Gly 
405 410 415 

gaa ttt gec ccg ttg gat cca cac gta gag ata ccg acc ctt tta ate 
Glu Phe Ala Pro Leu Asp Pro His Val Glu He Pro Thr Leu Leu He 
420 425 430 

gtg aca ccg gtg tat ttt ccc age gtg gee aaa cac caa ctg gtg aag 
Val Thr Pro Val Tyr Phe Pro Ser Val Ala Lys His Gin Leu Val Lys 
435 440 445 

cag gcg acc gaa ttc tgg gcg get agt get cgt gag gca ttt ccc gag 
Gin Ala Thr Glu Phe Trp Ala Ala Ser Ala Arg Glu Ala Phe Pro Glu 
450 455 460 

eta cca tgg gat tta tec tct ctg tgt gca aac gec cca cca aca ccg 
Leu Pro Trp Asp Leu Ser Ser Leu Cys Ala Asn Ala Pro Pro Thr Pro 
465 470 475 480 



tag 



<210> 71 
<211> 480 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 71 

Met Asp Phe Phe Ser Asp Glu Pro Met Val Gin Glu Met Ala Leu Leu 

15 10 15 

Asp He Asp Glu Gin Gin Arg Leu Leu Ser Lys Met Ser Leu Ala Asn 

20 25 30 

Phe Leu Lys His Glu Arg Val Arg Ala Phe Phe Ser Asp Asn Lys Lys 

35 40 45 

Glu He Ser Met Pro Ala He Arg Phe Val Tyr Asn Phe Tyr Leu Phe 

50 55 60 

Ala Lys Val Gly Asp Phe He Gly Asn Thr Asp Val Tyr Asp Phe Tyr 
65 70 75 80 

Val Thr Cys Val Phe Arg Gly Arg Arg Leu Thr Arg Leu Ser Glu Val 
85 90 95 
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Tyr 


Asp 


Ala 


Cys 


Leu 


Asn 


Met 


His 


Pro 


His 


Asp 


Arg 


His 


His 


Val 


Cys 




100 










105 










110 






Ala 


Leu 


lie 


Glu 


Gin 


Val 


Thr 


Arg 


Gly 


Gin 


Asn 


He 


Asn 


Pro 


Leu 


Trp 






115 










120 










125 








Asp 


Ala 


Leu 


Arg 


Asp 


Gly 


He 


He 


Ser 


Ser 


Ser 


Lys 


Phe 


His 


Trp 


Ala 




130 










135 










140 










lie 


Lys 


Gin 


Gin 


Asn 


Ser 


Ser 


Lys 


Lys 


He 


Phe 


Asn 


Pro 


Trp 


Pro 


He 


145 








150 










155 










160 


Val 


Asn 


Asn 


His 


Phe 


Val 


Ala 


Gly 


Pro 


Leu 


Ala 


Phe 


Gly 


Leu 


Arg 


Cys 










165 










170 










175 




Glu 


Glu 


Val 


Val 


Lys 


Lys 


He 


Leu 


Ala 


Thr 


Leu 


Leu 


His 


Pro 


Gly 


Glu 








180 










185 










190 






Ala 


His 


Cys 


Glu 


Asn 


Tyr 


Gly 


Phe 


Met 


Gin 


Ser 


Pro 


Leu 


Asn 


Gly 


Val 






195 










200 










205 








Phe 


Gly 


Val 


Ser 


Leu 


Asp 


Phe 


Gly 


lie 


Asn 


Val 


Arg 


Ser 


Asp 


Pro 


Lys 




210 










215 










220 










Asp 


Gly 


Leu 


Glu 


Phe 


His 


Pro 


Asp 


Cys 


Lys 


He 


Tyr 


Glu 


He 


Lys 


Cys 


225 








230 










235 










240 


Arg 


Phe 


Lys 


Tyr 


Thr 


Phe 


Ser 


Lys 


Met 


Glu 


Cys 


Asp 


Pro 


He 


Tyr 


Ala 








245 










250 










255 




Ala 


Tvr 


Ala 


Lys 


Leu 


Tyr 


Gin 


Lys 


Pro 


Ser 


Met 


Gin 


Thr 


Leu 


Lys 


Gly 








260 










265 










270 






Phe 


Leu 


Tyr 


Ser 


lie 


Ser 


Lys 


Pro 


Ala 


He 


Glu 


Phe 


Val 


Gly 


Glu 


Asp 






275 










280 










285 








Arg 


Leu 


Pro 


Ser 


Glu 


Ser 


Asp 


Tyr 


Leu 


Val 


Ala 


Tyr 


Asp 


Lys 


Glu 


Trp 


290 










295 










300 










Glu 


Val 


Cys 


Pro 


Arg 


Lys 


Lys 


Arg 


Arg 


Leu 


Thr 


Ala 


Val 


His 


His 


Leu 


305 










310 










315 










320 


Val 


Lys 


Lys 


Cys 


Met 


He 


His 


Asn 


Ser 


Thr 


Ala 


Pro 


Ser 


Asp 


Val 


Tyr 






325 










330 










335 




lie 


Leu 


Ser 


Asp 


Pro 


Gin 


Glu 


Thr 


Gly 


Gly 


Gin 


He 


Asn 


He 


Lys 


Ala 








340 










345 










350 






His 


Leu 


Ser 


Ala 


Asn 


Leu 


Phe 


He 


Asn 


Val 


Arg 


His 


Pro 


Tyr 


Tyr 


Tyr 






355 










360 










365 








Gin 


Val 


Leu 


Leu 


Gin 


Ser 


Leu 


Val 


Val 


Gin 


Glu 


Tyr 


He 


Ser 


Leu 


Ser 




370 










375 










380 










Lys 


Gly 


Thr 


Lys 


Asn 


Leu 


Gly 


Thr 


Gin 


Lys 


Asn 


Phe 


He 


Ala 


Thr 


Gly 


385 








390 










395 










400 


Phe 


Phe 


Arg 


Lys 


Arg 


Gin 


Phe 


Gin 


Asp 


Pro 


Ser 


Cys 


Cys 


Thr 


He 


Gly 








405 










410 










415 




Glu 


Phe 


Ala 


Pro 


Leu 


Asp 


Pro 


His 


Val 


Glu 


He 


Pro 


Thr 


Leu 


Leu 


He 








420 










425 










430 






Val 


Thr 


Pro 


Val 


Tyr 


Phe 


Pro 


Ser 


Val 


Ala 


Lys 


His 


Gin 


Leu 


Val 


Lys 






435 










440 










445 








Gin 


Ala 


Thr 


Glu 


Phe 


Trp 


Ala 


Ala 


Ser 


Ala 


Arg 


Glu 


Ala 


Phe 


Pro 


Glu 




450 










455 










460 










Leu 


Pro 


Trp 


Asp 


Leu 


Ser 


Ser 


Leu 


Cys 


Ala 


Asn 


Ala 


Pro 


Pro 


Thr 


Pro 


465 










470 










475 










480 



<210> 72 

<211> 210 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (!) . . (210) 

<400> 72 



148 



gcg ctgtac aaa ccc aaa acc aag cca tec aaa cat aaa aac aat aaa 
Ala Leu Tyr Lys Pro Lys Thr Lys Pro Ser Lys His Lys Asn Asn Lys 
50 55 60 

ttg tea gat ttt gtt taa 
Leu Ser Asp Phe Val 
65 70 



48 



96 



atg gga ttt ate etc tct gtg tgc aaa cgc ccc acc aac acc gta gat 
Met Gly Phe lie Leu Ser Val Cys Lys Arg Pro Thr Asn Thr Val Asp 
1.5 10 15 

gtg aag ggg gag ccc ata gat gta tec aaa gaa ttc gat cct att ata 
Val Lys Gly Glu Pro lie Asp Val Ser Lys Glu Phe Asp Pro lie lie 
20 25 30 

gga gaa gaa age att gtc ttg tta acg gca gat ggg act gee ccc gcg 144 
Gly Glu Glu Ser He Val Leu Leu Thr Ala Asp Gly Thr Ala Pro Ala 
35 40 45 



192 



210 



<210> 73 
<211> 69 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 73 

Met Gly Phe He Leu Ser Val Cys Lys Arg Pro Thr Asn Thr Val Asp 

15 10 15 

Val Lys Gly Glu Pro He Asp Val Ser Lys Glu Phe Asp Pro He He 

20 25 30 

Gly Glu Glu Ser He Val Leu Leu Thr Ala Asp Gly Thr Ala Pro Ala 

35 40 45 

Ala Leu Tyr Lys Pro Lys Thr Lys Pro Ser Lys His Lys Asn Asn Lys 

50 55 60 

Leu Ser Asp Phe Val 
65 



<210> 74 
<211> 1137 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - . (1137) 

<400> 74 

atg aaa att tea egg age gac teg ttt ate tta teg tea tgg gta aaa 48 
Met Lys He Ser Arg Ser Asp Ser Phe He Leu Ser Ser Trp Val Lys 
1 5 10 15 

eta ctg gtt att ctt gga ctt atg ttt ata atg tea gcg gta gtg cca 96 
Leu Leu Val He Leu Gly Leu Met Phe He Met Ser Ala Val Val Pro 
20 25 30 

ctg acc gec aca ttc ccg gga ctt gga ttt ccg tgc tac ttt aac acg 144 
Leu Thr Ala Thr Phe Pro Gly Leu Gly Phe Pro Cys Tyr Phe Asn Thr 
35 40 45 



149 




240 



288 



ttg gtc aac tac age gcg tta aac eta acg gtc aga agt tct get aaa 192 
Leu Val Asn Tyr Ser Ala Leu Asn Leu Thr Val Arg Ser Ser Ala Lys 
50 55 60 

cat ctg acg ccg act ttg ttt ttg gaa gca cca gaa atg ttt gtc tac 
His Leu Thr Pro Thr Leu Phe Leu Glu Ala Pro Glu Met Phe Val Tyr 
65 70 75 80 

ate teg tgg gcg ttt ttg gtg gac gga tat ctg ctg tgt tac tat gcg 
He Ser Trp Ala Phe Leu Val Asp Gly Tyr Leu Leu Cys Tyr Tyr Ala 
85 90 95 

tgg gec ate ctg gee ata ttc aag gec aag cgc gtg cac gcg aca aca 336 
Trp Ala. He Leu Ala He Phe Lys Ala Lys Arg Val His Ala Thr Thr 
100 105 HO 

atg ace age etc cag aca tgg ate gtg etc ata ggc tec cac agt gta 384 
Met Thr Ser Leu Gin Thr Trp He Val Leu He Gly Ser His Ser Val 
115 120 125 

gtt ttt atg tct att ttg agg ctg tgg acc ate cag tta ttt att cac 432 
Val Phe Met Ser He Leu Arg Leu Trp Thr He Gin Leu Phe He His 
130 " 135 140 

gtt ctg teg tac aaa cat ata ctt tta gee teg ttt gta tac tgt ata 480 
Val Leu Ser Tyr Lys His He Leu Leu Ala Ser Phe Val Tyr Cys He 
145 150 155 160 

cat ttt tgc eta teg ttt acg cac gtc cag get atg ata teg tgc aat 
His Phe Cys Leu Ser Phe Thr His Val Gin Ala Met He Ser Cys Asn 
165 170 175 

teg gec acc tgg tct eta cgc gtc tta gag cag cag att cct gag aat 
Ser Ala Thr Trp Ser Leu Arg Val Leu Glu Gin Gin He Pro Glu Asn 
180 185 190 

age ttg eta gac acc ctg ttg cgt tac gga aaa ccg att ggc gec aat 
Ser Leu Leu Asp Thr Leu Leu Arg Tyr Gly Lys Pro lie Gly Ala Asn 
195 200 205 

ctt tat ctg tec tta ata gec atg gag atg tta gta ttc tec etc gga 672 
Leu Tyr Leu Ser Leu He Ala Met Glu Met Leu Val Phe Ser Leu Gly 
210 215 220 

acg atg atg get att gga aac agt ttc tat atg etc gtt tec gat att 
Thr Met Met Ala He Gly Asn Ser Phe Tyr Met Leu Val Ser Asp He 
225 230 235 240 

gtc ttt ggt tea ata aac ttg ttt ttt gtc ctt aca ata get tgg tat 
Val Phe Gly Ser He Asn Leu Phe Phe Val Leu Thr lie Ala Trp Tyr 
245 250 255 

ate aac aca gaa eta ttt eta gta aag tac eta aag cac cag ate gga 
He Asn Thr Glu Leu Phe Leu Val Lys Tyr Leu Lys His Gin lie Gly 
260 265 270 

ttc tac gtt ggc gtt ttt gtc agt tac ctg att ctg ctt ctt ccc gtc 
Phe Tyr Val Gly Val Phe Val Ser Tyr Leu lie Leu Leu Leu Pro Val 
275 280 285 



528 



576 



624 



720 



768 



816 



864 



150 



gtt cga tac gac aag gta ttc ata tct gcc age ctg cac aaa gtc ata 912 
Val Arg Tyr Asp Lys Val Phe He Ser Ala Ser Leu His Lys Val He 
290 295 300 



gcc gtg aac ate tct atg att ccg ate act tgc ate eta gcc ate ate 
Ala Val Asn He Ser Met He Pro He Thr Cys He Leu Ala He lie 
305 310 315 320 



gat tac gag gat gca tat cca aaa tac ata tga 
Asp Tyr Glu Asp Ala Tyr Pro Lys Tyr He 
370 375 



960 



1006 



tta aga att ate aga aac gat tgg aaa tgg tgc gca aag teg ccc gaa 

Leu Arg He He Arg Asn Asp Trp Lys Trp Cys Ala Lys Ser Pro Glu 
325 330 335 

tac gca ccc ctt cct caa ggc cct aaa gag aag acg aca aaa gtt aag 1056 

Tyr Ala Pro Leu Pro Gin Gly Pro Lys Glu Lys Thr Thr Lys Val Lys 
340 345 350 

tac tec ccg gaa ctg aac gcg tta tat gaa aca gaa gaa gac gtg age 1104 

Tyr Ser Pro Glu Leu Asn Ala Leu Tyr Glu Thr Glu Glu Asp Val Ser 

355 360 365 



1137 



<210> 75 
<211> 378 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 75 

Met Lys He Ser Arg Ser Asp Ser Phe He Leu Ser Ser Trp Val Lys 

1 5 10 15 

Leu Leu Val He Leu Gly Leu Met Phe He Met Ser Ala Val Val Pro 

20 25 30 

Leu Thr Ala Thr Phe Pro Gly Leu Gly Phe Pro Cys Tyr Phe Asn Thr 

35 40 45 

Leu Val Asn Tyr Ser Ala Leu Asn Leu Thr Val Arg Ser Ser Ala Lys 

50 55 60 

His Leu Thr Pro Thr Leu Phe Leu Glu Ala Pro Glu Met Phe Val Tyr 
65 70 75 80 

lie Ser Trp Ala Phe Leu Val Asp Gly Tyr Leu Leu Cys Tyr Tyr Ala 

85 90 95 

Trp Ala He Leu Ala lie Phe Lys Ala Lys Arg Val His Ala Thr Thr 

100 105 HO 

Met Thr Ser Leu Gin Thr Trp He Val Leu He Gly Ser His Ser Val 

115 120 125 

Val Phe Met Ser He Leu Arg Leu Trp Thr He Gin Leu Phe lie His 

130 135 140 

Val Leu Ser Tyr Lys His lie Leu Leu Ala Ser Phe Val Tyr Cys He 
145 150 155 160 

His Phe Cys Leu Ser Phe Thr His Val Gin Ala Met He Ser Cys Asn 

165 170 175 

Ser Ala Thr Trp Ser Leu Arg Val Leu Glu Gin Gin lie Pro Glu Asn 

180 185 190 

Ser Leu Leu Asp Thr Leu Leu Arg Tyr Gly Lys Pro He Gly Ala Asn 

195 200 205 

Leu Tyr Leu Ser Leu He Ala Met Glu Met Leu Val Phe Ser Leu Gly 

210 215 220 

Thr Met Met Ala He Gly Asn Ser Phe Tyr Met Leu Val Ser Asp He 
225 230 235 240 
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Val 


Phe 


Gly 


Ser 


lie 


Asn 


Leu 


Phe 


Phe 


Val 


Leu 


Thr 




Aid 


Trp 


Tyr 










245 










2 5 0 














He 


Asn 


Thr 


Glu 


Leu 


Phe 


Leu 


Val 


Lys 


Tyr 


Leu 


Lys 


His 


Gin 


lie 


Gly 








2 60 










2 6 5 
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Phe 


Tyr 


Val 


Gly 


Val 


Phe 


Val 


Ser 


Tyr 


Leu 


He 


Leu 


Leu 


Leu 


Pro 


Val 






275 










280 










2 85 








Val 


Arg 


Tyr 


Asp 


Lys 


Val 


Phe 


He 


Ser 


Ala 


Ser 


Leu 


His 


Lys 


Vdl 


lie 




2 90 










2 95 










300 










Ala 


Val 


Asn 


He 


Ser 


Met 


He 


Pro 


He 


Thr 


Cys 


He 


Leu 


Ala 


He 


He 


305 










310 










315 










320 


Leu 


Arg 


He 


He 


Arg 


Asn 


Asp 


Trp 


Lys 


Trp 


Cys 


Ala 


Lys 


Ser 


Pro 


Glu 










325 










330 










335 




Tyr 


Ala 


Pro 


Leu 


Pro 


Gin 


Gly 


Pro 


Lys 


Glu 


Lys 


Thr 


Thr 


Lys 


Val 


Lys 






340 










345 










350 






Tyr 


Ser 


Pro 


Glu 


Leu 


Asn 


Ala 


Leu 


Tyr 


Glu 


Thr 


Glu 


Glu 


Asp 


Val 


Ser 




355 










360 










365 








Asp 


Tyr 


Glu 


Asp 


Ala 


Tyr 


Pro 


Lys 


Tyr 


He 















370 375 



<210> 76 
<211> 1407 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1407) 



<400> 76 

atg aac gec egg gag gtg gca etc acg gga cat gtt ttg cac ata teg 
Met Asn Ala Arg Glu Val Ala Leu Thr Gly His Val Leu His He Ser 
15 10 15 



48 



ctg cat age acg cac gag cgc gag aaa tta ata ate tgg cag gtt cat 
Leu His Ser Thr His Glu Arg Glu Lys Leu He He Trp Gin Val His 
20 25 30 



96 



tta ctt gta tgt caa caa tgc gga att cag gga gat gec gca tat eta 
Leu Leu Val Cys Gin Gin Cys Gly He Gin Gly Asp Ala Ala Tyr Leu 
35 40 45 



144 



ttt gtc ace gaa aca tta age aat act gac tgg gga aat ata ccg gcg 
Phe Val Thr Glu Thr Leu Ser Asn Thr Asp Trp Gly Asn He Pro Ala 
50 55 60 



192 



ata aac cgt cac gca ccg tec ata aat gag cac ggc cgt aat tat atg 
He Asn Arg His Ala Pro Ser He Asn Glu His Gly Arg Asn Tyr Met 
65 70 75 80 



240 



cag tgg gaa etc cgt act cgt tta egg aat ccc ate att caa ttg tta 
Gin Trp Glu Leu Arg Thr Arg Leu Arg Asn Pro He He Gin Leu Leu 
85 90 95 



288 



agt cgc cag ccc ggt gcg gtt aac gta agg gtc age gag ccg aat atg 
Ser Arg Gin Pro Gly Ala Val Asn Val Arg Val Ser Glu Pro Asn Met 
100 105 HO 



336 



gta ata gtg ggc tgc gaa cga gcg ttg gat cac teg tgt teg gtg cgc 
Val He Val Gly Cys Glu Arg Ala Leu Asp His Ser Cys Ser Val Arg 
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gaa ggc gta gcg cca acg tta acg tac gtg tgt ccg ggg gcc act ccg 
Glu Gly Val Ala Pro Thr Leu Thr Tyr Val Cys Pro Gly Ala Thr Pro 
260 265 270 

gag gga acc ccc ttc ccc ccc tac ttt tec gcg ttt cca cac etc ccg 
Glu Gly Thr Pro Phe Pro Pro Tyr Phe Ser Ala Phe Pro His Leu Pro 
275 280 285 

etc gtg ttt gga aga ccg cga agg etc gac gta acc gcg gtc caa gaa 
Leu Val Phe Gly Arg Pro Arg Arg Leu Asp Val Thr Ala Val Gin Glu 
290 295 ■ 300 

etc cca aaa gca caa att gcg gta cac tgg ccc ccg ttt aaa gat tea 
Leu Pro Lys Ala Gin He Ala Val His Trp Pro Pro Phe Lys Asp Ser 
305 310 315 320 



432 



480 



528 



115 120 125 

gtg act gga gcc tat ctt cat tgc gat acc act atg gac ttt agt ttg 
Val Thr Gly Ala Tyr Leu His Cys Asp Thr Thr Met Asp Phe Ser Leu 
130 135 140 

gat tct gtt gtg tec cca acc cgc gaa ttt tgg ttc tea gag atg ttt 
Asp Ser Val Val Ser Pro Thr Arg Glu Phe Trp Phe Ser Glu Met Phe 
145 150 155 160 

tct cac tgt tta gtt tec aac att gaa gtt tac ctt aaa aca acg ggc 
Ser His Cys Leu Val Ser Asn He Glu Val Tyr Leu Lys Thr Thr Gly 
165 170 175 

ggg tta tac tat agg gca teg agt gcc acg caa tgc cga aaa agg gcg 576 
Gly Leu Tyr Tyr Arg Ala Ser Ser Ala Thr Gin Cys Arg Lys Arg Ala 
180 185 190 

aaa gat ggc gca ttg ggt att ctt gat ate ttt aat tgc gaa tct cgt 624 
Lys Asp Gly Ala Leu Gly He Leu Asp He Phe Asn Cys Glu Ser Arg 
195 200 205 

gaa ata caa gtt gcc ggg cag aag tac acc ttg agt ate gcc acc gca 672 
Glu He. Gin Val Ala Gly Gin Lys Tyr Thr Leu Ser He Ala Thr Ala 
210 215 220 

aca ttt cac gtt etc tgg gtg gac gag gcg tgt atg tgg aac ggg gcc 
Thr Phe His Val Leu Trp Val Asp Glu Ala Cys Met Trp Asn Gly Ala 
225 230 235 240 

ctg gcc gaa ttt ttt agg gcg ctg cac aat aag ttg ttc ggc gac egg 768 
Leu Ala Glu Phe Phe Arg Ala Leu His Asn Lys Leu Phe Gly Asp Arg 
245 250 255 



720 



816 



864 



912 



960 



1008 



ate tta ggg gat cag ctt etc ata cct ggc att tea cct aaa aag cca 

He Leu Gly Asp Gin Leu Leu lie Pro Gly He Ser Pro Lys Lys Pro 

325 330 335 

ggt acc gta ccc gtt cgt tgg ccg ctt tgg gtg gag gat gtt aac ttg 1056 

Gly Thr Val Pro Val Arg Trp Pro Leu Trp Val Glu Asp Val Asn Leu 

340 345 350 



agt etc tgc gag acg aca gaa age gtt gcc cgc ata gtc gac cca cat 
Ser Leu Cys Glu Thr Thr Glu Ser Val Ala Arg He Val Asp Pro His 
355 360 365 



1104 



153 



tct ata gta ate ata aaa ttt tea tea ctg ttg tgc cag cac eta aaa 
Ser He Val He He Lys Phe Ser Ser Leu Leu Cys Gin His Leu Lys 
370 375 380 

tgc cac cgt gcg ttt gtt aaa aat gag tta gaa tac ata gca acc ate 
Cys His Arg Ala Phe Val Lys Asn Glu Leu Glu Tyr He Ala Thr He 
385 390 395 400 

tgt tec age gac ctt cgc etc ttc ate caa gag gaa tac aac egg tta 
Cys Ser Ser Asp Leu Arg Leu Phe He Gin Glu Glu Tyr Asn Arg Leu 
405 410 415 

ctt gee acc att ttt acg tgg gec gcg gcg age ggg tat acc tgg gcg 
Leu Ala Thr He Phe Thr Trp Ala Ala Ala Ser Gly Tyr Thr Trp Ala 
420 425 430 

gee att gat aaa aca aca gta ttc ate aag get ccc cag etc age gca 
Ala He Asp Lys Thr Thr Val Phe He Lys Ala Pro Gin Leu Ser Ala 
435 440 445 

get gta agt ggt ttc tgc cca tea eta aat agt tgc cgt agg aaa caa 
Ala Val Ser Gly Phe Cys Pro Ser Leu Asn Ser Cys Arg Arg Lys Gin 
450 455 460 

tgt tac gaa ggt taa 
Cys Tyr Glu Gly 
465 



1152 



1200 



1248 



1296 



1344 



1392 



1407 



<210> 77 
<211> 468 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 77 



Met 


Asn 


Ala 


Arg 


Glu 


Val 


Ala 


Leu 


Thr 


Gly 


His 


Val 


Leu 


His 


He 


Ser 


1 






5 










10 










15 




Leu 


His 


Ser 


Thr 


His 


Glu 


Arg 


Glu 


Lys 


Leu 


He 


He 


Trp 


Gin 


Val 


His 








20 










25 










30 






Leu 


Leu 


Val 


Cys 


Gin 


Gin 


Cys 


Gly 


He 


Gin 


Gly 


Asp 


Ala 


Ala 


Tyr 


Leu 






35 










40 










45 






Ala 


Phe 


Val 


Thr 


Glu 


Thr 


Leu 


Ser 


Asn 


Thr 


Asp 


Trp 


Gly 


Asn 


He 


Pro 




50 










55 










60 










He 


Asn 


Arg 


His 


Ala 


Pro 


Ser 


He 


Asn 


Glu 


His 


Gly 


Arg 


Asn 


Tyr 


Met 


65 








70 










75 










80 


Gin 


Trp 


Glu 


Leu 


Arg 


Thr 


Arg 


Leu 


Arg 


Asn 


Pro 


He 


He 


Gin 


Leu 


Leu 








85 










90 










95 




Ser 


Arg 


Gin 


Pro 


Gly 


Ala 


Val 


Asn 


Val 


Arg 


Val 


Ser 


Glu 


Pro 


Asn 


Met 






100 










105 










110 






Val 


He 


Val 


Gly 


Cys 


Glu 


Arg 


Ala 


Leu 


Asp 


His 


Ser 


Cys 


Ser 


Val 


Arg 






115 










120 










125 








Val 


Thr 


Gly 


Ala 


Tyr 


Leu 


His 


Cys 


Asp 


Thr 


Thr 


Met 


Asp 


Phe 


Ser 


Leu 




130 








135 










140 








Phe 


Asp 


Ser 


Val 


Val 


Ser 


Pro 


Thr 


Arg 


Glu 


Phe 


Trp 


Phe 


Ser 


Glu 


Met 


145 










150 










155 










160 


Ser 


His 


Cys 


Leu 


Val 


Ser 


Asn 


He 


Glu 


Val 


Tyr 


Leu 


Lys 


Thr 


Thr 


Gly 








165 










170 










175 




Gly 


Leu 


Tyr 


Tyr 


Arg 


Ala 


Ser 


Ser 


Ala 


Thr 


Gin 


Cys 


Arg 


Lys 


Arg 


Ala 






180 










185 










190 






Lys 


Asp 


Gly 


Ala 


Leu 


Gly 


He 


Leu 


Asp 


He 


Phe 


Asn 


Cys 


Glu 


Ser 


Arg 
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X95 200 205 

Glu lie Gin Val Ala Gly Gin Lys Tyr Thr Leu Ser He Ala Thr Ala 

2io 215 220 

Thr Phe His Val Leu Trp Val Asp Glu Ala Cys Met Trp Asn Gly Ala 
225 230 235 240 

Leu Ala Glu Phe Phe Arg Ala Leu His Asn Lys Leu Phe Gly Asp Arg 

245 250 255 

Glu Gly Val Ala Pro Thr Leu Thr Tyr Val Cys Pro Gly Ala Thr Pro 

260 265 270 

Glu Gly Thr Pro Phe Pro Pro Tyr Phe Ser Ala Phe Pro His Leu Pro 

275 280 285 

Leu Val Phe Gly Arg Pro Arg Arg Leu Asp Val Thr Ala Val Gin Glu 

290 295 300 

Leu Pro Lys Ala Gin He Ala Val His Trp Pro Pro Phe Lys Asp Ser 
305 310 315 320 

He Leu Gly Asp Gin Leu Leu He Pro Gly He Ser Pro Lys Lys Pro 

325 330 335 

Gly Thr Val Pro Val Arg Trp Pro Leu Trp Val Glu Asp Val Asn Leu 

340 345 350 

Ser Leu Cys Glu Thr Thr Glu Ser Val Ala Arg He Val Asp Pro His 

355 360 365 

Ser He Val He He Lys Phe Ser Ser Leu Leu Cys Gin His Leu Lys 

370 375 380 

Cys His Arg Ala Phe Val Lys Asn Glu Leu Glu Tyr He Ala Thr He 
385 390 395 400 

Cys Ser Ser Asp Leu Arg Leu Phe He Gin Glu Glu Tyr Asn Arg Leu 

405 410 415 

Leu Ala Thr He Phe Thr Trp Ala Ala Ala Ser Gly Tyr Thr Trp Ala 

420 425 430 

Ala He Asp Lys Thr Thr Val Phe He Lys Ala Pro Gin Leu Ser Ala 

435 440 445 

Ala Val Ser Gly Phe Cys Pro Ser Leu Asn Ser Cys Arg Arg Lys Gin 

450 455 460 

Cys Tyr Glu Gly 
465 



<210> 78 

<211> 612 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (612) 

<400> 78 

atg tta cga agg tta aaa ata aca gtt cat ttc ctt tea cag gaa cag 

Met Leu Arg Arg Leu Lys He Thr Val His Phe Leu Ser Gin Glu Gin 
1 5 10 15 

caa aag gtc gtg acc cgt ctt gag gcg cat ttg gga ctt ccc gta cag 
Gin Lys Val Val Thr Arg Leu Glu Ala His Leu Gly Leu Pro Val Gin 
20 25 30 

gaa act tec cac ccg cct gac tgg etc aag tgt gag gtc tgc tec gcg 
Glu Thr Ser His Pro Pro Asp Trp Leu Lys Cys Glu Val- Cys Ser Ala 
35 40 45 

tec gtg ttt tta aaa ata cca gec ggg gtt ttg tat gec gga etc gca 

155 



48 



96 



144 



192 



lUi a . 



Ser Val Phe Leu Lys lie Pro Ala Gly Val Leu Tyr Ala Gly Leu Ala 

50 55 60 

aga gac ccc acc agg gaa gca aaa egg gac teg tgg ctg gac tgt eta 

Arg Asp Pro Thr Arg Glu Ala Lys Arg Asp Ser Trp Leu Asp Cys Leu 

65 70 75 80 

gta gaa ggc gcg acg ttg ttg ctt aac aac tea gtg tta ccg att ggg 

Val Glu Gly Ala Thr Leu Leu Leu Asn Asn Ser Val Leu Pro lie Gly 

85 90 95 



aat ttt tgg ctg ctg cca cgc gcg tgg gta aaa teg gcg ccc ata tgc 

Asn Phe Trp Leu Leu Pro Arg Ala Trp Val Lys Ser Ala Pro lie Cys 
115 120 125 

cct ccc eta ccg att gac tgt gtt acg cct cca cag ttt gtc gtg aca 

Pro Pro Leu Pro lie Asp Cys Val Thr Pro Pro Gin Phe Val Val Thr 

130 135 140 

aag cgt gga cca ate tgc tgg tac aag gaa tgg ccg tta ccg gtt gac 

Lys Arg Gly Pro lie Cys Trp Tyr Lys Glu Trp Pro Leu Pro Val Asp 
145 150 155 160 

gtt gat ttt atg tac tac eta cag gag gca eta tgt gtt ttt agt gtt 

Val Asp Phe Met Tyr Tyr Leu Gin Glu Ala Leu Cys Val Phe Ser Val 
165 170 175 

gtg tec aac ggg gag ggt acg gag agt cac gcg gac aat ata cga caa 

Val Ser Asn Gly Glu Gly Thr Glu Ser His Ala Asp Asn lie Arg Gin 
180 185 190 

tta gag aag ttt gaa aag gta eta tgt tta ttt taa 

Leu Glu Lys Phe Glu Lys Val Leu Cys Leu Phe 
195 200 



<210> 79 
<211> 203 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 79 



Met 


Leu 


Arg 


Arg 


Leu 


Lys 


He 


Thr 


Val 


His 


Phe 


Leu 


Ser 


Gin 


Glu 


Gin 


1 
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10 










15 




Gin 


Lys 


Val 


Val 


Thr 


Arg 


Leu 


Glu 


Ala 


His 


Leu 


Gly 


Leu 


Pro 


Val 


Gin 






20 










25 










30 






Glu 


Thr 


Ser 


His 


Pro 


Pro 


Asp 


Trp 


Leu 


Lys 


Cys 


Glu 


Val 


Cys 


Ser 


Ala 






35 










40 










45 








Ser 


Val 


Phe 


Leu 


Lys 


He 


Pro 


Ala 


Gly 


Val 


Leu 


Tyr 


Ala 


Gly 


Leu 


Ala 




50 










55 










60 










Arg 


Asp 


Pro 


Thr 


Arg 


Glu 


Ala 


Lys 


Arg 


Asp 


Ser 


Trp 


Leu 


Asp 


Cys 


Leu 


65 








70 










75 










80 


Val 


Glu 


Gly 


Ala 


Thr 


Leu 


Leu 


Leu 


Asn 


Asn 


Ser 


Val 


Leu 


Pro 


He 


Gly 








85 










90 










95 




Ala 


Leu 


Ala 


Gly 


He 


Leu 


Pro 


Thr 


Leu 


Phe 


Ala 


Asn 


Arg 


Arg 


Cys 


Val 








100 










105 










110 






Asn 


Phe 


Trp 


Leu 


Leu 


Pro 


Arg 


Ala 


Trp 


Val 


Lys 


Ser 


Ala 


Pro 


He 


Cys 



240 



288 



gcg ctg gcg ggt ate tta ccc acc ctt ttt gee aac agg egg tgt gtt 336 
Ala Leu Ala Gly He Leu Pro Thr Leu Phe Ala Asn Arg Arg Cys Val 
100 105 HO 



384 



432 



480 



528 



576 



612 



156 




120 



Pro 


Pro 


Leu 


Pro 


lie 


Asp 


Cys 


Val 


Thr 


Pro 


Pro 


Gin 


Phe 


Val 


Val 


Thr 




130 










135 










140 










Lys 


Arg 


Gly 


Pro 


He 


Cys 


Trp 


Tyr 


Lys 


Glu 


Trp 


Pro 


Leu 


Pro 


Val 


Asp 


145 










150 










155 










160 


Val 


Asp 


Phe 


Met 


Tyr 


Tyr 


Leu 


Gin 


Glu 


Ala 


Leu 


Cys 


Val 


Phe 


Ser 


Val 








165 










170 










175 




Val 


Ser 


Asn 


Gly 


Glu 


Gly 


Thr 


Glu 


Ser 


His 


Ala 


Asp 


Asn 


He 


Arg 


Gin 








180 










185 










190 






Leu 


Glu 


Lys 


Phe 


Glu 


Lys 


Val 


Leu 


Cys 


Leu 


Phe 













195 200 



<210> 80 
<211> 819 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (819) 

<400> 80 

atg gac caa ata ctg aag cgt eta atg ggg gag cag cac cga tct gag 

Met Asp Gin He Leu Lys Arg Leu Met Gly Glu Gin His Arg Ser Glu 

1 5 10 15 

gcg gta atg cct gaa aca gaa tgc teg tec aga ggg ccc tat aat tat 
Ala Val Met Pro Glu Thr Glu Cys Ser Ser Arg Gly Pro Tyr Asn Tyr 
20 25 30 

cca gtt ttt ccg cga ctg atg ttg gaa gta cat aaa aaa aat age att 
Pro Val Phe Pro Arg Leu Met Leu Glu Val His Lys Lys Asn Ser He 
35 40 45 

tgc atg get tec aat acg cct aag ctg tgt gtt cga gga cga eta aac 
Cys Met Ala Ser Asn Thr Pro Lys Leu Cys Val Arg Gly Arg Leu Asn 
50 55 60 

gtt cct gat tta ggc gtg cac gtg cgc acc agg ctt cag teg gca acg 
Val Pro Asp Leu Gly Val His Val Arg Thr Arg Leu Gin Ser Ala Thr 



65 70 



75 80 



ttt act ggg ttt gtc ttt gcg tgc gtg gtc gaa cac gag gac atg ate 
Phe Thr Gly Phe Val Phe Ala Cys Val Val Glu His Glu Asp Met He 



85 



90 95 



100 



105 HO 



ttt aag ccg gcg age gcg age gta aca gaa etc tgc tgc att ctt tct 

Phe Lys Pro Ala Ser Ala Ser Val Thr Glu Leu Cys Cys He Leu Ser 

115 120 125 

atg ctg gaa aac tac gat aag ccc ccc eta tea ttt ate ctg tec gcg 

Met Leu Glu Asn Tyr Asp Lys Pro Pro Leu Ser Phe He Leu Ser Ala 

130 135 140 

ctt gac egg gec agg tac eta cac gag agg tat acg tgt aat gat tec 

157 



48 



96 



144 



192 



240 



288 



gac gca ctg gac ata tat ccg cac gtt ttt tct gac cgc gtg cag ctg 336 
Asp Ala Leu Asp He Tyr Pro His Val Phe Ser Asp Arg Val Gin Leu 



384 



432 



480 
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Leu Asp Arg Ala Arg Tyr Leu His Glu Arg Tyr Thr Cys Asn Asp Ser 

145 150 " 155 160 

gcg ttt gta ttg tat ggg ate gaa gtg ata gcg teg acc ttg gcg gca 528 

Ala Phe Val Leu Tyr Gly He Glu Val He Ala Ser Thr Leu Ala Ala 

165 170 175 

tac cac gag tta aac cca ccc caa ggc att ttg cgc gtt ccg ccc eta 576 

Tyr His Glu Leu Asn Pro Pro Gin Gly He Leu Arg Val Pro Pro Leu 

180 185 190 



gtg egg ttc aaa etc cat aag ctg ttg gac gaa aac gca gac gac atg 

Val Arg Phe Lys Leu His Lys Leu Leu Asp Glu Asn Ala Asp Asp Met 

195 200 205 

aaa ggc tta ctg aaa ccc att tat ttg gaa tec ttt cgt eta aca gag 

Lys Gly Leu Leu Lys Pro He Tyr Leu Glu Ser Phe Arg Leu Thr Glu 

210 215 220 

aac gtt gga gaa gaa gag ggc cac gcg gag act ttt aac ata ttt tat 

Asn Val Gly Glu Glu Glu Gly His Ala Glu Thr Phe Asn He Phe Tyr 

225 230 235 240 

tgc ggc aca att ttt acc aga cac ctt cat aac gca teg gta tta aaa 

Cys Gly Thr He Phe Thr Arg His Leu His Asn Ala Ser Val Leu Lys 

245 250 255 

tat ttt caa ata acc age eta cac age ate ccc aga cag act ctg ttt 
Tyr Phe Gin He Thr Ser Leu His Ser He Pro Arg Gin Thr Leu Phe 

260 265 270 



624 



672 



720 



768 



816 



819 



<210> 81 
<211> 272 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 81 
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Thr 


Glu 


Cys 


Ser 


Ser 


Arg 


Gly 


Pro 


Tyr 


Asn 


Tyr 
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Pro 


Val 
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Pro 
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Leu 


Met 


Leu 


Glu 


Val 


His 


Lys 


Lys 


Asn 


Ser 


He 






35 










40 










45 








Cys 


Met 


Ala 


Ser 


Asn 


Thr 


Pro 


Lys 


Leu 


Cys 


Val 


Arg 


Gly 


Arg 


Leu 


Asn 


50 










55 










60 








Thr 


Val 


Pro 


Asp 


Leu 


Gly 


Val 


His 


Val 


Arg 


Thr 


Arg 


Leu 


Gin 


Ser 


Ala 


65 










70 










75 










80 


Phe 


Thr 


Gly 


Phe 


Val 


Phe 


Ala 


Cys 


Val 


Val 


Glu 


His 


Glu 


Asp 


Met 


He 








85 










90 










95 




Asp 


Ala 


Leu 


Asp 


He 


Tyr 


Pro 


His 


Val 


Phe 


Ser 


Asp 


Arg 


Val 


Gin 


Leu 






100 










105 










110 






Phe 


Lys 


Pro 


Ala 


Ser 


Ala 


Ser 


Val 


Thr 


Glu 


Leu 


Cys 


Cys 


He 


Leu 


Ser 




115 










120 










125 






Ala 


Met 


Leu 


Glu 


Asn 


Tyr 


Asp 


Lys 


Pro 


Pro 


Leu 


Ser 


Phe 


He 


Leu 


Ser 




130 










135 










140 










Leu 


Asp 


Arg 


Ala 


Arg 


Tyr 


Leu 


His 


Glu 


Arg 


Tyr 


Thr 


Cys 


Asn 


Asp 


Ser 


145 










150 










155 










160 



158 
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Ala Phe Val Leu Tyr Gly lie Glu Val He Ala Ser Thr Leu Ala Ala 

165 170 175 

Tyr His Glu Leu Asn Pro Pro Gin Gly He Leu Arg Val Pro Pro Leu 

180 185 190 

Val Arg Phe Lys Leu His Lys Leu Leu Asp Glu Asn Ala Asp Asp Met 

195 200 205 

Lys Gly Leu Leu Lys Pro He Tyr Leu Glu Ser Phe Arg Leu Thr Glu 

210 215 220 

Asn Val Gly Glu Glu Glu Gly His Ala Glu Thr Phe Asn He Phe Tyr 
225 230 235 240 

Cys Gly Thr He Phe Thr Arg His Leu His Asn Ala Ser Val Leu Lys 

245 250 255 

Tyr Phe Gin lie Thr Ser Leu His Ser He Pro Arg Gin Thr Leu Phe 
260 265 270 



48 



96 



144 



<210> 82 
<211> 1731 
<212> DMA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) - . (1731) 

<400> 82 

atg ttc aag atg aac ccc gga ttt ggt teg acg tgt ttg gtc cat ccg 

Met Phe Lys Met Asn Pro Gly Phe Gly Ser Thr Cys Leu Val His Pro 

1 5 10 15 

acg gaa etc tec ate teg ctt ttc gag ate eta cag ggc aag tac gcg 
Thr Glu Leu Ser He Ser Leu Phe Glu He Leu Gin Gly Lys Tyr Ala 
20 25 30 

tac gtt cgc ggt caa acg eta cac tec agt ctt cga aat cca ggg ate 
Tyr Val Arg Gly Gin Thr Leu His Ser Ser Leu Arg Asn Pro Gly He 
35 40 45 

ttt gga aga cag ctg ttc ata cac ctg tac aag acg gca ttg ggg age 192 
Phe Gly Arg Gin Leu Phe He His Leu Tyr Lys Thr Ala Leu Gly Ser 
50 55 60 

tgc aca tac gac aac gtc tta aag gac tgg acc aac ttc gag acc acc 240 
Cys Thr Tyr Asp Asn Val Leu Lys Asp Trp Thr Asn Phe Glu Thr Thr 
65 70 75 80 

ctg aag acc cgt tgg cgc ggt gtc gag cac ctg acg ccg gaa ttc aaa 
Leu Lys Thr Arg Trp Arg Gly Val Glu His Leu Thr Pro Glu Phe Lys 
85 90 95 

egg tct acg ttt gaa teg tgg gca cgt acc gtt cgt eta aca gtc gat 
Arg Ser Thr Phe Glu Ser Trp Ala Arg Thr Val Arg Leu Thr Val Asp 
100 105 HO 

caa ctg tta ctg aac act att aat cag gta eta cac acc agg act gtt 
Gin Leu Leu Leu Asn Thr He Asn Gin Val Leu His Thr Arg Thr Val 
115 120 125 

tta teg tac gaa cga tac gtc gac tgg gtg gtc gec ctt ggg ttg gtg 
Leu Ser Tyr Glu Arg Tyr Val Asp Trp Val Val Ala Leu Gly Leu Val 



288 



336 



384 



432 



159 



130 



135 140 



cct att gtc agg cgc acc ccc gac ggc gac acg ate gec agg ata caa 
Pro He Val Arg Arg Thr Pro Asp Gly Asp Thr He Ala Arg He Gin 
145 150 155 160 

gca cac tgt caa cag atg agg aaa acg tac get tec gga gac gtt acg 
Ala His Cys Gin Gin Met Arg Lys Thr Tyr Ala Ser Gly Asp Val Thr 
165 170 175 

ata teg egg att gtc gac aaa ctg gca cag gag att acc teg ata atg 
He Ser Arg He Val Asp Lys Leu Ala Gin Glu He Thr Ser He Met 
180 185 190 

acc gat gtg acg tct att tat att cca gac tac gec gag gtg tct gtg 
Thr Asp Val Thr Ser He Tyr He Pro Asp Tyr Ala Glu Val Ser Val 
195 200 205 

gag ttt aac gga gat aag get get tat etc ggc acc tac aga caa aaa 
Glu Phe Asn Gly Asp Lys Ala Ala Tyr Leu Gly Thr Tyr Arg Gin Lys 
210 215 220 

gat ata aca gtt gaa gtt gtt tea egg cct ata att tac aat gga cgc 
Asr> He Thr Val Glu Val Val Ser Arg Pro He He Tyr Asn Gly Arg 
* 235 240 



225 



230 



gtc gee ttt gat age cct ctg tac egg tta ttt acc gcg att atg aca 
Val Ala Phe Asp Ser Pro Leu Tyr Arg Leu Phe Thr Ala He Met Thr 



245 



250 255 



tgt cac agg acc gcg gag cac gca aag tta tgt cag tta eta aat acc 
Cys His Arg Thr Ala Glu His Ala Lys Leu Cys Gin Leu Leu Asn Thr 
260 265 270 

gcg cct eta aaa gcg ctg gtc ggt age acg tgc aac gac atg tat aaa 
Ala Pro Leu Lys Ala Leu Val Gly Ser Thr Cys Asn Asp Met Tyr Lys 
275 280 285 

qac ata ttg get cgc tta gag caa teg tct caa aaa acc gac ccc aag 
Asp He Leu Ala Arg Leu Glu Gin Ser Ser Gin Lys Thr Asp Pro Lys 
290 295 300 

agg gag etc eta aac ttg ctg ata aag etc gcg gag aac aaa aca gtg 
Ara Glu Leu Leu Asn Leu Leu He Lys Leu Ala Glu Asn Lys Thr Val 
305 310 315 320 



acg aac cag ate aac gag cag ttc gat aca att tct aac ctt gag aaa 
Thr Asn Gin He Asn Glu Gin Phe Asp Thr He Ser Asn Leu Glu Lys 
370 375 380 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



age ggc ata aca gac gtt gtt gag gat ttc gtc aca gac gtc tct caa 
Ser Gly He Thr Asp Val Val Glu Asp Phe Val Thr Asp Val Ser Gin 
325 330 335 

aac att gtg gac aaa aac aag etc ttt ggg acc ggt acg gaa agt acg 1056 
Asn He Val Asp Lys Asn Lys Leu Phe Gly Thr Gly Thr Glu Ser Thr 
340 345 350 

acc cag ggt etc egg aaa cag gtt tea aat acc gtg ttc aaa tgc ctg 1104 
Thr Gin Gly Leu Arg Lys Gin Val Ser Asn Thr Val Phe Lys Cys Leu 
355 360 365 



1152 



160 
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gaa cga gac gac tac gta aag aag att caa tgc att gaa acg cag ttg 1200 
Glu Arg Asp Asp Tyr Val Lys Lys He Gin Cys He Glu Thr Gin Leu 
385 390 395 400 

ctt cag agt tta cca gag gga ggc agg ccc aga cac gat att aat ata 1248 
Leu Gin Ser Leu Pro Glu Gly Gly Arg Pro Arg His Asp He Asn He 
405 410 415 

etc act cag aat acg ttg cag get tta tec ggt ctg cgc gac cca acc 12 96 
Leu Thr Gin Asn Thr Leu Gin Ala Leu Ser Gly Leu Arg Asp Pro Thr 
420 425 430 

att aac eta tec gaa tgt cac ate cct aaa ggc age tec gta gta aac 1344 
He Asn Leu Ser Glu Cys His He Pro Lys Gly Ser Ser Val Val Asn 
435 440 445 

age ttt ttt tea caa tac gtc cct ccg ttt atg gag atg etc aaa gag 1392 
Ser Phe Phe Ser Gin Tyr Val Pro Pro Phe Met Glu Met Leu Lys Glu 
450 455 460 

eta act age ctg tgg gaa ggg gaa atg ttt caa acg tac aac etc aca 144 0 
Leu Thr Ser Leu Trp Glu Gly Glu Met Phe Gin Thr Tyr Asn Leu Thr 



465 



470 475 480 



ccc gtg gtc gac aat cag ggg cag cga acg age ate gec tac teg cag 
Pro Val Val Asp Asn Gin Gly Gin Arg Thr Ser He Ala Tyr Ser Gin 
485 490 495 



ctg act cac atg gac etc ata aat cac tct etc att tec tta agt tta 
Leu Thr His Met Asp Leu He Asn His Ser Leu He Ser Leu Ser Leu 



cac gat att gca gac cag ctg tat gtg gac age agg ctg tct gtg tat 
His Asp He Ala Asp Gin Leu Tyr Val Asp Ser Arg Leu Ser Val Tyr 

r- t r- C/in 



ata aat gac ata ggc cac aaa tat tgc gaa caa att age cag cct gga 
He Asn Asp He Gly His Lys Tyr Cys Glu Gin He Ser Gin Pro Gly 



545 



550 555 560 



1488 



gac acg gtt tec att etc tta ggc ccg ttt acg tac ata ate gee aaa 1536 
Asp Thr Val Ser He Leu Leu Gly Pro Phe Thr Tyr He He Ala Lys 
500 505 510 



1584 



515 520 525 

c age agg ctg tct gtg tat 1632 

p Ser Arg 

530 535 540 



1680 



acc gat gga cca aat act gaa gcg tct aat ggg gga gca gca ccg ate 1728 
Thr Asp Gly Pro Asn Thr Glu Ala Ser Asn Gly Gly Ala Ala Pro He 
565 570 575 



tga 



1731 



<210> 83 
<211> 576 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 83 

Met Phe Lys Met Asn Pro Gly Phe Gly Ser Thr Cys Leu Val His Pro 
15 10 15 



161 



Thr Glu Leu Ser lie Ser Leu Phe Glu lie Leu Gin Gly Lys Tyr Ala 

20 25 30 

Tyr Val Arg Gly Gin Thr Leu His Ser Ser Leu Arg Asn Pro Gly lie 

35 40 v 45 

Phe Gly Arg Gin Leu Phe He His Leu Tyr Lys Thr Ala Leu Gly Ser 

50 55 60 

Cys Thr Tyr Asp Asn Val Leu Lys Asp Trp Thr Asn Phe Glu Thr Thr 
65 70 75 80 

Leu Lys Thr Arg Trp Arg Gly Val Glu His Leu Thr Pro Glu Phe Lys 

85 90 95 

Arg Ser Thr Phe Glu Ser Trp Ala Arg Thr Val Arg Leu Thr Val Asp 

100 105 110 

Gin Leu Leu Leu Asn Thr He Asn Gin Val Leu His Thr Arg Thr Val 

115 120 125 

Leu Ser Tyr Glu Arg Tyr Val Asp Trp Val Val Ala Leu Gly Leu Val 

130 135 140 

Pro He Val Arg Arg Thr Pro Asp Gly Asp Thr He Ala Arg He Gin 
145 150 155 160 

Ala His Cys Gin Gin Met Arg Lys Thr Tyr Ala Ser Gly Asp Val Thr 

165 170 175 

He Ser Arg He Val Asp Lys Leu Ala Gin Glu He Thr Ser He Met 

180 185 190 

Thr Asp Val Thr Ser He Tyr He Pro Asp Tyr Ala Glu Val Ser Val 

195 200 205 

Glu Phe Asn Gly Asp Lys Ala Ala Tyr Leu Gly Thr Tyr Arg Gin Lys 

210 215 220 

Asp He Thr Val Glu Val Val Ser Arg Pro He lie Tyr Asn Gly Arg 
225 230 235 240 

Val Ala Phe Asp Ser Pro Leu Tyr Arg Leu Phe Thr Ala He Met Thr 

245 250 255 

Cys His Arg Thr Ala Glu His Ala Lys Leu Cys Gin Leu Leu Asn Thr 

260 265 270 

Ala Pro Leu Lys Ala Leu Val Gly Ser Thr Cys Asn Asp Met Tyr Lys 

275 280 285 

Asp He Leu Ala Arg Leu Glu Gin Ser Ser Gin Lys Thr Asp Pro Lys 

290 295 300 

Arg Glu Leu Leu Asn Leu Leu He Lys Leu Ala Glu Asn Lys Thr Val 
305 310 315 320 

Ser Gly He Thr Asp Val Val Glu Asp Phe Val Thr Asp Val Ser Gin 

325 330 335 

Asn He Val Asp Lys Asn Lys Leu Phe Gly Thr Gly Thr Glu Ser Thr 

340 345 350 

Thr Gin Gly Leu Arg Lys Gin Val Ser Asn Thr Val Phe Lys Cys Leu 

355 360 365 

Thr Asn Gin He Asn Glu Gin Phe Asp Thr He Ser Asn Leu Glu Lys 

370 375 380 

Glu Arg Asp Asp Tyr Val Lys Lys He Gin Cys He Glu Thr Gin Leu 
385 390 395 ' 400 

Leu Gin Ser Leu Pro Glu Gly Gly Arg Pro Arg His Asp lie Asn lie 

405 410 415 

Leu Thr Gin Asn Thr Leu Gin Ala Leu Ser Gly Leu Arg Asp Pro Thr 

420 425 430 

He Asn Leu Ser Glu Cys His He Pro Lys Gly Ser Ser Val Val Asn 

435 440 445 

Ser Phe Phe Ser Gin Tyr Val Pro Pro Phe Met Glu Met Leu Lys Glu 

450 455 460 

Leu Thr Ser Leu Trp Glu Gly Glu Met Phe Gin Thr Tyr Asn Leu Thr 
465 470 475 480 

Pro Val Val Asp Asn Gin Gly Gin Arg Thr Ser He Ala Tyr Ser Gin 

485 490 495 

Asp Thr Val Ser He Leu Leu Gly Pro Phe Thr Tyr He lie Ala Lys 



162 



U 1 









500 










505 










510 






Leu 


Thr 


His 


Met 


Asp 


Leu 


He 


Asn 


His 


Ser 


Leu 


He 


Ser 


Leu 


Ser 


Leu 






515 










520 










525 








His 


Asp 


lie 


Ala 


Asp 


Gin 


Leu 


Tyr 


Val 


Asp 


Ser 


Arg 


Leu 


Ser 


Val 


Tyr 




530 










535 










540 










lie 


Asn 


Asp 


lie 


Gly 


His 


Lys 


Tyr 


Cys 


Glu 


Gin 


He 


Ser 


Gin 


Pro 


Gly 


545 










550 










555 










560 


Thr 


Asp 


Gly 


Pro 


Asn 


Thr 


Glu 


Ala 


Ser 


Asn 


Gly 


Gly 


Ala 


Ala 


Pro 


He 






565 










570 










575 





<210> 84 
<211> 2373 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) . - (2373) 

<400> 84 

atg gag agt tec gtc gga tgg acc aaa cac gtc gaa cca aat ccg ggg 

Met Glu Ser Ser Val Gly Trp Thr Lys His Val Glu Pro Asn Pro Gly 
15 10 15 

ttc ate ttg aac atg acg tec gat gec aaa gtc agg ggt gtc gtg gat 
Phe He Leu Asn Met Thr Ser Asp Ala Lys Val Arg Gly Val Val Asp 
20 25 30 

cac gtc agt cgc ctg tea aat ata act acc age cca ccg gaa atg ggt 144 
His Val Ser Arg Leu Ser Asn He Thr Thr Ser Pro Pro Glu Met Gly 
35 40 45 

tgg tac gac ctg gec ttc gat ccg get gaa gac tec ggg ccg ttc ttg 192 
Trp Tyr Asp Leu Ala Phe Asp Pro Ala Glu Asp Ser Gly Pro Phe Leu 
50 55 60 



ccg ttt acc gtt tat eta att acg gga act get ggt get ggg aaa agt 
Pro Phe Thr Val Tyr Leu He Thr Gly Thr Ala Gly Ala Gly Lys Ser 

70 75 80 



65 



att cag caa cag gag eta teg aag tat tgg ccg gtt ata gtg gac att 
He Gin Gin Gin Glu Leu Ser Lys Tyr Trp Pro Val He Val Asp He 
145 150 155 160 



48 



96 



240 



288 



acc age ata teg gec ctg tac caa aat tta aac tgc ctg ate acg ggc 
Thr Ser He Ser Ala Leu Tyr Gin Asn Leu Asn Cys Leu He Thr Gly 
85 90 95 

gcg acc acc ata gec gca cag aac eta teg cgt cgc eta aag acg ttc 336 
Ala Thr Thr He Ala Ala Gin Asn Leu Ser Arg Arg Leu Lys Thr Phe 
100 105 HO 

tgt ccc acg ate ttc age get ttt ggc ttt aag age cga cac ate aat 384 
Cys Pro Thr He Phe Ser Ala Phe Gly Phe Lys Ser Arg His He Asn 
115 120 125 

ata gec gtc aga aaa get cat cag acc gga gec gta tec ata gag caa 432 
He Ala Val Arg Lys Ala His Gin Thr Gly Ala Val Ser He Glu Gin 
130 135 140 



480 



163 
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atg aaa gag gtt atg gcg aaa aaa ccc aat ggc atg tac ggg act ata 528 
Met Lys Glu Val Met Ala Lys Lys Pro Asn Gly Met Tyr Gly Thr lie 
165 170 175 

tec aac gcg aat ttt gaa acc etc teg aga atg acc gga ccg tgt tta 576 
Ser Asn Ala Asn Phe Glu Thr Leu Ser Arg Met Thr Gly Pro Cys Leu 
180 185 190 



tgg act tec aat att att gta ate gac gag gec gga acc ctg tec tct 
Trp Thr Ser Asn lie He Val He Asp Glu Ala Gly Thr Leu Ser Ser 
195 200 205 

tac ata etc acc acc gtc gtg ttc ttt tac tgg ttc eta aac age tgg 
Tyr He Leu Thr Thr Val Val Phe Phe Tyr Trp Phe Leu Asn Ser Trp 
210 215 220 

eta aat acc cct ctt tac cgc cag ggg gcg gtt ccg tgc ata gta tgc 
Leu Asn Thr Pro Leu Tyr Arg Gin Gly Ala Val Pro Cys He Val Cys 
225 230 235 240 

gtc ggt teg cca acg cag aca aac gcg ttc cag tea act tac aac cac 
Val Gly Ser Pro Thr Gin Thr Asn Ala Phe Gin Ser Thr Tyr Asn His 
245 250 255 

ggg acg caa aag acg gag ata teg teg tgc gag aac ate eta aca ttc 
Gly Thr Gin Lys Thr Glu He Ser Ser Cys Glu Asn He Leu Thr Phe 
260 265 270 

atg ata ggg aag aag gtc gta tct gag tac gta cac ttg gag aga aac 
Met He Gly Lys Lys Val Val Ser Glu Tyr Val His Leu Glu Arg Asn 
275 280 285 

tgg gcg ctg ttt ata aac aac aag cgc tgc acc gat ctg cag ttc gga 
Trp Ala Leu Phe He Asn Asn Lys Arg Cys Thr Asp Leu Gin Phe Gly 
290 295 300 

cac ctt eta aaa att tta gag tat aat ctt ccc att cct gac gaa gtc 
His Leu Leu Lys He Leu Glu Tyr Asn Leu Pro He Pro Asp Glu Val 
305 310 315 320 

atg agt tac gta gac aga ttt gtc gtt cct aaa agt aag att atg gat 
Met Ser Tyr Val Asp Arg Phe Val Val Pro Lys Ser Lys He Met Asp 
325 330 335 

cct tta gaa tac att ggc tgg acc cga etc ttt ttg tea cat age gag 
Pro Leu Glu Tyr He Gly Trp Thr Arg Leu Phe Leu Ser His Ser Glu 
340 345 350 

gta aag gcg tat eta aca aac eta cac aca tgt eta acg eta ggg ggc 
Val Lys Ala Tyr Leu Thr Asn Leu His Thr Cys Leu Thr Leu Gly Gly 
355 360 365 

gat acc agg gac acg aag etc ttt acc tgt ccc gtg gtg tgc gag gtg 1152 
Asp Thr Arg Asp Thr Lys Leu Phe Thr Cys Pro Val Val Cys Glu Val 
370 375 380 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



ttt gtg aag ccg ttt gag gaa tac aag egg gec gtc aac etc acc aac 
Phe Val Lys Pro Phe Glu Glu Tyr Lys Arg Ala Val Asn Leu Thr Asn 
385 390 395 , 400 



1200 



164 



etc acc gtg acc gag tgg gtg aca aaa aac etc ttt aag tta agt aat 1248 
Leu Thr Val Thr Glu Trp Val Thr Lys Asn Leu Phe Lys Leu Ser Asn 
405 410 415 

tat teg cag ttt gtg gac cag gac atg tec ata gtc gec acg gaa tec 1296 
Tyr Ser Gin Phe Val Asp Gin Asp Met Ser lie Val Ala Thr Glu Ser 
420 425 430 

act gaa cgt teg acc cag gtt acc ttt ate acc aag ttt gtt aaa aac 1344 
Thr Glu Arg Ser Thr Gin Val Thr Phe He Thr Lys Phe Val Lys Asn 
435 440 445 

age cac gtg tct tta aat gga aaa aca aaa aaa tgc ata tgt ggg ttt 1392 
Ser His Val Ser Leu Asn Gly Lys Thr Lys Lys Cys He Cys Gly Phe 
450 455 460 



cag ggt acg tac ttt gag ttc aaa aga ate eta gac age gaa etc ttc 
Gin Gly Thr Tyr Phe Glu Phe Lys Arg He Leu Asp Ser Glu Leu Phe 
470 475 480 



aac ggg ctg ttg gcg ttt gca teg acc gtt gaa teg tat aaa att aag 

Asn Gly Leu Leu Ala Phe Ala Ser Thr Val Glu Ser Tyr Lys He Lys 

625 630 635 640 

ggg tac acg ttt tta ccg gta gcg ttc ggt cgc tgt cag ggt ttg cca 

165 



1440 



1488 



465 

gtg gag acg cat teg caa gac cgt ccg gaa tac gta tac ggt ttc tta 

Val Glu Thr His Ser Gin Asp Arg Pro Glu Tyr Val Tyr Gly Phe Leu 
485 490 495 

aac acg eta ctg tac aac gec atg tac teg ttt cac gcg tac ggc gtg 1536 
Asn Thr Leu Leu Tyr Asn Ala Met Tyr Ser Phe His Ala Tyr Gly Val 
500 505 510 

act agg tea cac gag aaa tat ctg caa gac eta aaa ttt gca ccc etc 1584 
Thr Arg Ser His Glu Lys Tyr Leu Gin Asp Leu Lys Phe Ala Pro Leu 
515 520 525 

ccg gec get ctg gca acc ggg cgt gta gac ctt caa acg gtt cgt gaa 1632 
Pro Ala Ala Leu Ala Thr Gly Arg Val Asp Leu Gin Thr Val Arg Glu 
530 535 540 

gag tta aac ctg gaa gac gac ate ttc tac cac gtg tgc agt ccc ccg 1680 
Glu Leu Asn Leu Glu Asp Asp He Phe Tyr His Val Cys Ser Pro Pro 
545 550 555 560 

cca ccc gcg ggt ate acc tec etc cag gtt ttg gtc gac acg tac tgc 1728 
Pro Pro Ala Gly He Thr Ser Leu Gin Val Leu Val Asp Thr Tyr Cys 
565 570 575 

gec eta aag gac gtg ttc gec tec aga ata aag gtg gcg tgt cgc tgg 1776 
Ala Leu Lys Asp Val Phe Ala Ser Arg He Lys Val Ala Cys Arg Trp 
580 585 590 

ttt ggc ggg gag ttt gag aag gaa acg ttt tec gcg ttt acg gtt aac 1824 
Phe Gly Gly Glu Phe Glu Lys Glu Thr Phe Ser Ala Phe Thr Val Asn 
595 600 605 

atg gtc gta agg gac gga gtt gac ttt gtc tec cct tea gaa cgt etc 1872 
Met Val Val Arg Asp Gly Val Asp Phe Val Ser Pro Ser Glu Arg Leu 
610 615 620 



1920 



1968 



Gly Tyr Thr Phe Leu Pro Val Ala Phe Gly Arg Cys Gin Gly Leu Pro 
645 650 655 

etc agt gat gac etc agg aag aag atg ccc tec ctg gtc gtg cag gac 2016 

Leu Ser Asp Asp Leu Arg Lys Lys Met Pro Ser Leu Val Val Gin Asp 
660 665 670 

tct age ggt ttt ate gcg tgc eta gag aat aac ata acc aaa ttg ace 2064 

Ser Ser Gly Phe lie Ala Cys Leu Glu Asn Asn lie Thr Lys Leu Thr 
675 680 685 

gaa acc atg gag gac ggg age gtt ttc caa gtg tgc tgt gcg ggg gac 2112 

Glu Thr Met Glu Asp Gly Ser Val Phe Gin Val Cys Cys Ala Gly Asp 
690 695 700 



2160 



2208 



tat ggg gtc age tea aat tta gec atg acc ate gta aag gca cag gga 
Tyr Gly Val Ser Ser Asn Leu Ala Met Thr lie Val Lys Ala Gin Gly 
705 710 715 720 

atg teg ttg gag cga gta gee gta gta ttt ggg tec cac aag aac gtc 
Met Ser Leu Glu Arg Val Ala Val Val Phe Gly Ser His Lys Asn Val 
725 730 735 

cag aca age cac gtg tat gta gca ata tea agg get gtt aac tea aat 2256 
Gin Thr Ser His Val Tyr Val Ala lie Ser Arg Ala Val Asn Ser Asn 
740 745 750 

tat ttg gtc atg gac age aac ccc ctt aaa acc etc etc aga gaa cca 2304 
Tyr Leu Val Met Asp Ser Asn Pro Leu Lys Thr Leu Leu Arg Glu Pro 
755 760 765 

gtc gat aac acc tec gec aag cat ata gtc cgc gee etc cac aac cca 2352 
Val Asp Asn Thr Ser Ala Lys His He Val Arg Ala Leu His Asn Pro 
770 775 780 

aac aca acc etc ate tac taa 2373 
Asn Thr Thr Leu He Tyr 
785 790 



<210> 85 
<211> 790 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 85 

Met Glu Ser Ser Val Gly Trp Thr Lys His Val Glu Pro Asn Pro Gly 

15 10 15 

Phe He Leu Asn Met Thr Ser Asp Ala Lys Val Arg Gly Val Val Asp 

20 25 30 

His Val Ser Arg Leu Ser Asn lie Thr Thr Ser Pro Pro Glu Met Gly 

35 40 45 

Trp Tyr Asp Leu Ala Phe Asp Pro Ala Glu Asp Ser Gly Pro Phe Leu 

50 55 60 

Pro Phe Thr Val Tyr Leu He Thr Gly Thr Ala Gly Ala Gly Lys Ser 
65 70 75 80 

Thr Ser He Ser Ala Leu Tyr Gin Asn Leu Asn Cys Leu He Thr Gly 

85 90 " 95 

Ala Thr Thr He Ala Ala Gin Asn Leu Ser Arg Arg Leu Lys Thr Phe 

100 105 110 

Cys Pro Thr He Phe Ser Ala Phe Gly Phe Lys Ser Arg His He Asn 

166 
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115 120 125 

lie Ala Val Arg Lys Ala His Gin Thr Gly Ala Val Ser lie Glu Gin 

130 135 140 

He Gin Gin Gin Glu Leu Ser Lys Tyr Trp Pro Val He Val Asp He 
145 150 155 160 

Met Lys Glu Val Met Ala Lys Lys Pro Asn Gly Met Tyr Gly Thr He 

165 170 175 

Ser Asn Ala Asn Phe Glu Thr Leu Ser Arg Met Thr Gly Pro Cys Leu 

180 185 190 

Trp Thr Ser Asn He He Val He Asp Glu Ala Gly Thr Leu Ser Ser 

195 200 205 

Tyr He Leu Thr Thr Val Val Phe Phe Tyr Trp Phe Leu Asn Ser Trp 

210 215 220 

Leu Asn Thr Pro Leu Tyr Arg Gin Gly Ala Val Pro Cys He Val Cys 
225 230 235 240 

Val Gly Ser Pro Thr Gin Thr Asn Ala Phe Gin Ser Thr Tyr Asn His 

245 250 255 

Gly Thr Gin Lys Thr Glu He Ser Ser Cys Glu Asn He Leu Thr Phe 

260 265 270 

Met He Gly Lys Lys Val Val Ser Glu Tyr Val His Leu Glu Arg Asn 

275 280 285 

Trp Ala Leu Phe He Asn Asn Lys Arg Cys Thr Asp Leu Gin Phe Gly 

290 295 300 

His Leu Leu Lys He Leu Glu Tyr Asn Leu Pro He Pro Asp Glu Val 
305 310 315 320 

Met Ser Tyr Val Asp Arg Phe Val Val Pro Lys Ser Lys He Met Asp 

325 330 335 

Pro Leu Glu Tyr He Gly Trp Thr Arg Leu Phe Leu Ser His Ser Glu 

340 345 350 

Val Lys Ala Tyr Leu Thr Asn Leu His Thr Cys Leu Thr Leu Gly Gly 

355 360 365 

Asp Thr Arg Asp Thr Lys Leu Phe Thr Cys Pro Val Val Cys Glu Val 

370 375 380 

Phe Val Lys Pro Phe Glu Glu Tyr Lys Arg Ala Val Asn Leu Thr Asn 
385 390 395 400 

Leu Thr Val Thr Glu Trp Val Thr Lys Asn Leu Phe Lys Leu Ser Asn 

405 410 415 

Tyr Ser Gin Phe Val Asp Gin Asp Met Ser He Val Ala Thr Glu Ser 

420 425 430 

Thr Glu Arg Ser Thr Gin Val Thr Phe He Thr Lys Phe Val Lys Asn 

435 440 445 

Ser His Val Ser Leu Asn Gly Lys Thr Lys Lys Cys He Cys Gly Phe 

450 455 460 

Gin Gly Thr Tyr Phe Glu Phe Lys Arg He Leu Asp Ser Glu Leu Phe 
465 470 475 480 

Val Glu Thr His Ser Gin Asp Arg Pro Glu Tyr Val Tyr Gly Phe Leu 

485 490 495 

Asn Thr Leu Leu Tyr Asn Ala Met Tyr Ser Phe His Ala Tyr Gly Val 

500 505 510 

Thr Arg Ser His Glu Lys Tyr Leu Gin Asp Leu Lys Phe Ala Pro Leu 

515 520 525 

Pro Ala Ala Leu Ala Thr Gly Arg Val Asp Leu Gin Thr Val Arg Glu 

530 535 540 

Glu Leu Asn Leu Glu Asp Asp He Phe Tyr His Val Cys Ser Pro Pro 
545 550 555 560 

Pro Pro Ala Gly He Thr Ser Leu Gin Val Leu Val Asp Thr Tyr Cys 

565 570 575 

Ala Leu Lys Asp Val Phe Ala Ser Arg He Lys Val Ala Cys Arg Trp 

580 585 590 

Phe Gly Gly Glu Phe Glu Lys Glu Thr Phe Ser Ala Phe Thr Val Asn 
595 600 605 



167 
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Met 


Val 


Val 


Arg 


Asp 


uiy 


v a x 


Asp 


Phe 


Val 


Ser 


Pro 


Ser 


Glu 


Arg 


Leu 




610 










615 










620 










Asn 


Gly 


Leu 


Leu 


Ala 

Ai a 


Phe 


a J. a 


Ser 


Thr 


Val 


Glu 


Ser 


Tvr 


Lys 


He 


Lys 


62 5 








D J U 










6 3 5 










640 


Gly 


Tyr 


Thr 


trie 


Leu 


Pro 


Val 


Ala 


Phe 


Gly 


Arg 


Cys 


Gin 


Glv 


Leu 


Pro 






D 4t Z> 










6 5 0 










655 




Leu 


Ser 


Asp 


Asp 


Leu 


Arg 


Ly s 


ys 


Met 


Pro 


Ser 


Leu 


Val 


Val 


Gin 


Asp 








66 0 










665 










670 






Ser 


Ser 


Gly 


Phe 


lie 


Ala 


Cys 


Leu 


pin 


Asn 


Asn 


He 


Thr 


Lys 


Leu 


Thr 






675 










6 8 0 










6 85 








Glu 


Thr 


Met 


Glu 


Asp 


Gly 


Ser 


XTzz 1 
Vdl 




Gin 


Val 


Cys 


Cvs 


Ala 


Glv 


Asp 




690 










O D 










700 










Tyr 


Gly 


va jl 


Ser 


DCI 


Asn 




Ala 


Met 


Thr 


He 


Val 


Lys 


Ala 


Gin 


Gly 


705 








710 










715 










720 


Met 


Ser 


Leu 


Glu 


Arg 


Val 


Ala 


Val 


Val 


Phe 


Gly 


Ser 


His 


Lys 


Asn 


Val 










725 










730 










735 




Gin 


Thr 


Ser 


His 


Val 


Tyr 


Val 


Ala 


He 


Ser 


Arg 


Ala 


Val 


Asn 


Ser 


Asn 








740 










745 










750 






Tyr 


Leu 


Val 


Met 


Asp 


Ser 


Asn 


Pro 


Leu 


Lys 


Thr 


Leu 


Leu 


Arg 


Glu 


Pro 




755 










760 










765 








Val 


Asp 


Asn 


Thr 


Ser 


Ala 


Lys 


His 


He 


Val 


Arg 


Ala 


Leu 


His 


Asn 


Pro 




770 










775 










780 










Asn 


Thr 


Thr 


Leu 


He 


Tyr 























cat teg gag aac age gac tac agt gat gtg gat gaa gaa gac gaa ccg 
His Ser Glu Asn Ser Asp Tyr Ser Asp Val Asp Glu Glu Asp Glu Pro 
85 90 95 



48 



96 



<210> 86 
<211> 1059 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) . . (1059) 

<400> 86 

atg get atg ttc tta tec gac ccg tea agg ace ccg ccg gec act ccc 

Met Ala Met Phe Leu Ser Asp Pro Ser Arg Thr Pro Pro Ala Thr Pro 
1 5 10 15 

agg atg ctg ccg att ccc ggg gca cct agg aaa aag aga aca aga cgt 
Arg Met Leu Pro He Pro Gly Ala Pro Arg Lys Lys Arg Thr Arg Arg 
20 25 30 

ttt ttg ttc gee ggg tec cgt acg ggg ttg ccg gta ccg ccc ggg tac 144 
Phe Leu Phe Ala Gly Ser Arg Thr Gly Leu Pro Val Pro Pro Gly Tyr 
35 40 45 

gga ggc ccc cca gtg att gac atg act gca cct gag gat gtt ttt gac 192 
Gly Gly Pro Pro Val He Asp Met Thr Ala Pro Glu Asp Val Phe Asp 
50 55 60 

caa gat tec cct ccc act act cca aaa acg cca gac gaa acc gac tct 240 
Gin Asp Ser Pro Pro Thr Thr Pro Lys Thr Pro Asp Glu Thr Asp Ser 
65 70 75 80 



288 



168 



ccg gtc tec tec cca cct aga ata gat ccg cac gca egg gac ggt gag 3 3 6 

Pro Val Ser Ser Pro Pro Arg lie Asp Pro His Ala Arg Asp Gly Glu 

100 105 110 

tec ttt aac cag age ggc egg eta cct acc gtc att act teg aca ggt 3 84 

Ser Phe Asn Gin Ser Gly Arg Leu Pro Thr Val lie Thr Ser Thr Gly 

115 120 125 

gec acc aca cca ccg age gcg ccc gcg cca ctg acc gcg ttt ggt ggt 432 

Ala Thr Thr Pro Pro Ser Ala Pro Ala Pro Leu Thr Ala Phe Gly Gly 

130 135 140 

ccg egg ccc gtt gcg gtg gtt act ggc caa cac cgc gca ccc cag tea 4 80 

Pro Arg Pro Val Ala Val Val Thr Gly Gin His Arg Ala Pro Gin Ser 

145 150 155 160 

tct gaa tea gac age gaa gac gat ttt ttt ata gat gac tac gag gac 52 8 

Ser Glu Ser Asp Ser Glu Asp Asp Phe Phe lie Asp Asp Tyr Glu Asp 

165 170 175 

act gat gaa agt ggc gga gaa get gac ggc ttt teg cca egg gec teg 576 

Thr Asp Glu Ser Gly Gly Glu Ala Asp Gly Phe Ser Pro Arg Ala Ser 

180 185 190 

ccc gec tgg tea ggt gat ata tct aga tea ccg gca gag ggc gga tgg 62 4 

Pro Ala Trp Ser Gly Asp lie Ser Arg Ser Pro Ala Glu Gly Gly Trp 

195 200 205 

tct tec gat gaa gaa gaa cct gtt gtt get gga tea act gca gga caa 672 

Ser Ser Asp Glu Glu Glu Pro Val Val Ala Gly Ser Thr Ala Gly Gin 

210 215 220 

gag acc att att att tea gat gac gac gaa gtg gac gac cga ggc tct 72 0 

Glu Thr He lie He Ser Asp Asp Asp Glu Val Asp Asp Arg Gly Ser 

225 230 235 240 

gtt gag aca tgg gac gaa tea gac gec gat gag ggc acg ggc gec act 768 

Val Glu Thr Trp Asp Glu Ser Asp Ala Asp Glu Gly Thr Gly Ala Thr 

245 250 255 



816 



gat gtg att gac ctg tgt tea tct tea gac age gac gac gat gca gat 

Asp Val He Asp Leu Cys Ser Ser Ser Asp Ser Asp Asp Asp Ala Asp 

260 265 270 

cac gtt aca age ggc ggt gtt cgt gee gee tgc aag agg cgc gcg tea 8 64 

His Val Thr Ser Gly Gly Val Arg Ala Ala Cys Lys Arg Arg Ala Ser 

275 280 285 

cgc agg gac tgc aac ggt gat gac gac gtg att tat gtt ggt aca acc 912 

Arg Arg Asp Cys Asn Gly Asp Asp Asp Val He Tyr Val Gly Thr Thr 
290 295 300 

cag ggc cct aag aga cgt atg acc teg acc acg ggg ggc ggg get act 960 

Gin Gly Pro Lys Arg Arg Met Thr Ser Thr Thr Gly Gly Gly Ala Thr 
305 310 315 320 

tct aac cca gag ggc ccg ggc gtt tct ggt cgt caa act atg gcg gec 1008 

Ser Asn Pro Glu Gly Pro Gly Val Ser Gly Arg Gin Thr Met Ala Ala 
325 330 335 

acc ccc ccg gtg tgt gga aat gac aac tat ccg tgg ccg tgg ttg gac 1056 



169 



Thr Pro Pro Val Cys Gly Asn Asp Asn Tyr Pro Trp Pro Trp Leu Asp 
340 345 350 



tga 



<210> 87 
<211> 352 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 87 



Met 


Ala 


Met 


Phe 


Leu 


Ser 


Asp 


Pro 


Ser 


Arg 


Thr 


Pro 


Pro 


Ala 


Thr 


Pro 


1 








5 










10 










15 




Arcr 


Met 


Leu 


Pro 


He 


Pro 


Gly 


Ala 


Pro 


Arg 


Lys 


Lys 


Arg 


Thr 


Arg 


Arg 






20 










25 










30 






Phe 


Leu 


Phe 


Ala 


Gly 


Ser 


Arg 


Thr 


Giy 


Leu 


Pro 


Val 


Pro 


Pro 


Gly 


Tyr 






35 










40 










45 








Glv 


Gly 


Pro 


Pro 


Val 


He 


Asp 


Met 


Thr 


Ala 


Pro 


Glu 


Asp 


Val 


Phe 


Asp 


50 










55 










60 










Gin 


Asp 


Ser 


Pro 


Pro 


Thr 


Thr 


Pro 


Lys 


Thr 


Pro 


Asp 


Glu 


Thr 


Asp 


Ser 


65 








70 










75 










80 


His 


Ser 


Glu 


Asn 


Ser 


Asp 


Tyr 


Ser 


Asp 


Val 


Asp 


Glu 


Glu 


Asp 


Glu 


Pro 










85 










90 










95 




Pro 


Val 


Ser 


Ser 


Pro 


Pro 


Arg 


He 


Asp 


Pro 


His 


Ala 


Arg 


Asp 


Gly 


Glu 








100 










105 










110 






Ser 


Phe 


Asn 


Gin 


Ser 


Gly 


Arg 


Leu 


Pro 


Thr 


Val 


He 


Thr 


Ser 


Thr 


Gly 






115 










120 










125 








Ala 


Thr 


Thr 


Pro 


Pro 


Ser 


Ala 


Pro 


Ala 


Pro 


Leu 


Thr 


Ala 


Phe 


Gly 


Gly 




130 










135 










140 










Pro 


Arg 


Pro 


Val 


Ala 


Val 


Val 


Thr 


Gly 


Gin 


His 


Arg 


Ala 


Pro 


Gin 


Ser 


145 










150 










155 










160 


Ser 


Glu 


Ser 


Asp 


Ser 


Glu 


Asp 


Asp 


Phe 


Phe 


He 


Asp 


Asp 


Tyr 


Glu 


Asp 










165 










170 










175 




Thr 


Asp 


Glu 


Ser 


Gly 


Gly 


Glu 


Ala 


Asp 


Gly 


Phe 


Ser 


Pro 


Arg 


Ala 


Ser 








180 










185 










190 






Pro 


Ala 


Trp 


Ser 


Gly 


Asp 


He 


Ser 


Arg 


Ser 


Pro 


Ala 


Glu 


Gly 


Gly 


Trp 






195 










200 










205 








Ser 


Ser 


As P 


Glu 


Glu 


Glu 


Pro 


Val 


Val 


Ala 


Gly 


Ser 


Thr 


Ala 


Gly 


Gin 




210 










215 










220 










Glu 


Thr 


He 


He 


He 


Ser 


Asp 


Asp 


Asp 


Glu 


Val 


Asp 


Asp 


Arg 


Gly 


Ser 


225 










230 










235 










240 


Val 


Glu 


Thr 


Trp 


Asp 


Glu 


Ser 


Asp 


Ala 


Asp 


Glu 


Gly 


Thr 


Gly 


Ala 


Thr 










245 










250 










255 




Asp 


Val 


He 


Asp 


Leu 


Cys 


Ser 


Ser 


Ser 


Asp 


Ser 


Asp 


Asp 


Asp 


Ala 


Asp 








260 










265 










270 






His 


Val 


Thr 


Ser 


Gly 


Gly 


Val 


Arg 


Ala 


Ala 


Cys 


Lys 


Arg 


Arg 


Ala 


Ser 






275 










280 










285 








Arg 


Arg 


Asp 


Cys 


Asn 


Gly 


Asp 


Asp 


Asp 


Val 


He 


Tyr 


Val 


Gly 


Thr 


Thr 




290 










295 










300 










Gin 


Gly 


Pro 


Lys 


Arg 


Arg 


Met 


Thr 


Ser 


Thr 


Thr 


Gly 


Gly 


Gly 


Ala 


Thr 


305 










310 










315 










320 


Ser 


Asn 


Pro 


Glu 


Gly 


Pro 


Gly 


Val 


Ser 


Gly 


Arg 


Gin 


Thr 


Met 


Ala 


Ala 










325 










330 










335 




Thr 


Pro 


Pro 


Val 


Cys 


Gly 


Asn 


Asp 


Asn 


Tyr 


Pro 


Trp 


Pro 


Trp 


Leu 


Asp 



340 345 350 



<210> 88 



170 



W 
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48 



96 



144 



<211> 768 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) . - (768) 

<400> 88 

atg gag ggt tgg ctt aaa aca ata gtt tgg agt aaa atg tea cca gaa 

Met Glu Gly Trp Leu Lys Thr lie Val Trp Ser Lys Met Ser Pro Glu 

1 5 10 15 

gtg tta gaa gaa ccc tec aca caa acg ttg eta ctt age gac tea tgg 
Val Leu Glu Glu Pro Ser Thr Gin Thr Leu Leu Leu Ser Asp Ser Trp 
20 25 30 

ctt gaa ttt tta aat ctt tct cca ttt tta aaa cag aag etc gcg get 
Leu Glu Phe Leu Asn Leu Ser Pro Phe Leu Lys Gin Lys Leu Ala Ala 
35 40 45 

ctg ctg aag cgc gtt atg gat atg age aac gta ace gtg att tat cca 192 
Leu Leu Lys Arg Val Met Asp Met Ser Asn Val Thr Val lie Tyr Pro 
50 55 60 

ccg ata gat aga att atg tgg tgg teg tat tgt tgc gaa ccg gag gat 240 
Pro lie Asp Arg lie Met Trp Trp Ser Tyr Cys Cys Glu Pro Glu Asp 
65 70 75 80 

att aaa gtc gtg ate ctt ggc cag gat cct tac cat cgc ggt caa gee 288 
lie Lys Val Val lie Leu Gly Gin Asp Pro Tyr His Arg Gly Gin Ala 
85 90 95 

ace gga eta gee ttt age gtt get ccg gac tac agt ata cct cca age 336 
Thr Gly Leu Ala Phe Ser Val Ala Pro Asp Tyr Ser He Pro Pro Ser 
100 105 HO 

etc aaa aat att ttt aaa gag ata gee aat act gta cct ggg ttc acc 384 
Leu Lys Asn He Phe Lys Glu He Ala Asn Thr Val Pro Gly Phe Thr 
115 120 125 

get cct tct cac ggg tgc tta gac tgt tgg gca aaa egg gga gtt ctg 432 
Ala Pro Ser His Gly Cys Leu Asp Cys Trp Ala Lys Arg Gly Val Leu 
130 135 140 

ctt tta aac acc att ctg acg gtg gaa aga ggg aag gcg ggg tea cac 480 
Leu Leu Asn Thr He Leu Thr Val Glu Arg Gly Lys Ala Gly Ser His 
145 150 155 160 

gec aac ctt ggc tgg gat tgg ttc acc age tac ata ata age tgc ctt 528 
Ala Asn Leu Gly Trp Asp Trp Phe Thr Ser Tyr He He Ser Cys Leu 
165 170 175 

tct gec aag ctt cag agg tgc gtt ttt atg ctg tgg gga aga aag get 576 
Ser Ala Lys Leu Gin Arg Cys Val Phe Met Leu Trp Gly Arg Lys Ala 
180 185 190 



ata gac aag gcg acg ctg ata aac gga cag aga cat etc gtc etc aag 
He Asp Lys Ala Thr Leu He Asn Gly Gin Arg His Leu Val Leu Lys 
195 200 205 



624 



171 



gcg cgc cat ccg tct ccc ctg get acc gec cac gcg gca acc ggt tct 

Ala Arg His Pro Ser Pro Leu Ala Thr Ala His Ala Ala Thr Gly Ser 
210 215 220 

ccc tgg ccg cag ttt eta ggc tgc aac cat ttt aaa ctt gcg aat gac 

Pro Trp Pro Gin Phe Leu Gly Cys Asn His Phe Lys Leu Ala Asn Asp 
225 230 235 240 

tat eta gtt cag aat egg agg ggt gec gtt gac tgg aat att aac tga 

Tyr Leu Val Gin Asn Arg Arg Gly Ala Val Asp Trp Asn lie Asn 
245 250 255 



<210> 89 
<211> 255 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 89 

Met Glu Gly Trp Leu Lys Thr lie Val Trp Ser Lys Met Ser Pro Glu 

15 10 15 

Val Leu Glu Glu Pro Ser Thr Gin Thr Leu Leu Leu Ser Asp Ser Trp 

20 25 30 

Leu Glu Phe Leu Asn Leu Ser Pro Phe Leu Lys Gin Lys Leu Ala Ala 

35 40 45 

Leu Leu Lys Arg Val Met Asp Met Ser Asn Val Thr Val He Tyr Pro 

50 55 60 

Pro He Asp Arg He Met Trp Trp Ser Tyr Cys Cys Glu Pro Glu Asp 
65 70 75 80 

He Lys Val Val He Leu Gly Gin Asp Pro Tyr His Arg Gly Gin Ala 

85 90 95 

Thr Gly Leu Ala Phe Ser Val Ala Pro Asp Tyr Ser He Pro Pro Ser 

100 105 HO 

Leu Lys Asn He Phe Lys Glu He Ala Asn Thr Val Pro Gly Phe Thr 

115 120 125 

Ala Pro Ser His Gly Cys Leu Asp Cys Trp Ala Lys Arg Gly Val Leu 

130 135 140 

Leu Leu Asn Thr He Leu Thr Val Glu Arg Gly Lys Ala Gly Ser His 
145 150 155 160 

Ala Asn Leu Gly Trp Asp Trp Phe Thr Ser Tyr He He Ser Cys Leu 

165 170 175 

Ser Ala Lys Leu Gin Arg Cys Val Phe Met Leu Trp Gly Arg Lys Ala 

180 185 190 

He Asp Lys Ala Thr Leu He Asn Gly Gin Arg His Leu Val Leu Lys 

195 200 205 

Ala Arg His Pro Ser Pro Leu Ala Thr Ala His Ala Ala Thr Gly Ser 

210 215 220 

Pro Trp Pro Gin Phe Leu Gly Cys Asn His Phe Lys Leu Ala Asn Asp 
225 230 235 240 

Tyr Leu Val Gin Asn Arg Arg Gly Ala Val Asp Trp Asn He Asn 
245 250 255 



<210> 90 

<211> 510 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 



<220> 
<221> CDS 



<222> (1) - - (510) 



<400> 90 

atg aga agt atg tat acc tta agt tta ttt att aca tgt gga ttt ttt 

Met Arg Ser Met Tyr Thr Leu Ser Leu Phe He Thr Cys Gly Phe Phe 

5 10 15 



1 



ttg cga tat aaa get gat cgc tct acc aca gaa ata tgt gta aat gga 
Leu Arg Tyr Lys Ala Asp Arg Ser Thr Thr Glu He Cys Val Asn Gly 



65 



70 75 80 



48 



96 



tta att aca tgt tgt acc ggg ctt gta gta aat cca tgt tgt aaa ata 

Leu He Thr Cys Cys Thr Gly Leu Val Val Asn Pro Cys Cys Lys He 

20 25 30 

ata cca ctt agt gac ttt ata ttt ccg gag ccg ttt gag att get tct 144 

He Pro Leu Ser Asp Phe He Phe Pro Glu Pro Phe Glu He Ala Ser 

35 40 45 

ttt cat tta aca aat tta get tta tgt cca ggg ctg tgt aca gca aca 192 

Phe His Leu Thr Asn Leu Ala Leu Cys Pro Gly Leu Cys Thr Ala Thr 

50 55 60 



240 



288 



ttt cac ctt agg gcg ttt ttt ata agg att ttg tac aag tta aat tac 

Phe His Leu Arg Ala Phe Phe He Arg He Leu Tyr Lys Leu Asn Tyr 
85 90 95 

agt gtc ccc aga gaa gag ctt caa ttg ttg aat tat atg caa tat agt 336 

Ser Val Pro Arg Glu Glu Leu Gin Leu Leu Asn Tyr Met Gin Tyr Ser 
100 105 HO 

ctt gat gag ttt tta gec gag ttt gaa gac ttt cac ata aac ggt agt 384 

Leu Asp Glu Phe Leu Ala Glu Phe Glu Asp Phe His He Asn Gly Ser 
115 120 125 

gaa age gga act get tat acg egg cca cct ctg ttg gat ttc tea gac 432 

Glu Ser Gly Thr Ala Tyr Thr Arg Pro Pro Leu Leu Asp Phe Ser Asp 

130 135 140 

aga agt aca aaa gtt tea cgt ata cgt aaa gta att acc aga cgc ggg 480 

Arg Ser Thr Lys Val Ser Arg He Arg Lys Val He Thr Arg Arg Gly 
145 * 150 155 160 

gat tta tgg agg gtt ggc tta aaa caa tag 510 

Asp Leu Trp Arg Val Gly Leu Lys Gin 

165 170 



<210> 91 
<211> 169 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 91 

Met Arg Ser Met Tyr Thr Leu Ser Leu Phe He Thr Cys Gly Phe Phe 

1 5 10 15 

Leu He Thr Cys Cys Thr Gly Leu Val Val Asn Pro Cys Cys Lys He 

20 25 30 

He Pro Leu Ser Asp Phe He Phe Pro Glu Pro Phe Glu He Ala Ser 

35 40 45 

Phe His Leu Thr Asn Leu Ala Leu Cys Pro Gly Leu Cys Thr Ala Thr 
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50 

Leu Arg Tyr Lys Ala 
65 

Phe His Leu Arg Ala 
85 

Ser Val Pro Arg Glu 
100 

Leu Asp Glu Phe Leu 
115 

Glu Ser Gly Thr Ala 
130 

Arg Ser Thr Lys Val 
145 

Asp Leu Trp Arg Val 
165 



55 

Asp Arg Ser Thr Thr Glu 
70 75 
Phe Phe lie Arg lie Leu 
90 

Glu Leu Gin Leu Leu Asn 
105 

Ala Glu Phe Glu Asp Phe 
120 

Tyr Thr Arg Pro Pro Leu 
135 

Ser Arg lie Arg Lys Val 
150 155 
Gly Leu Lys Gin 



60 

lie Cys Val Asn Gly 
80 

Tyr Lys Leu Asn Tyr 
95 

Tyr Met Gin Tyr Ser 
110 

His lie Asn Gly Ser 
125 

Leu Asp Phe Ser Asp 
140 

He Thr Arg Arg Gly 
160 



<210> 92 
<211> 1170 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (1170) 

<400> 92 

atg get gtt tec att cca gtt aag ggt gtt aac agg gag aca gag tct 

Met Ala Val Ser He Pro Val Lys Gly Val Asn Arg Glu Thr Glu Ser 
1 5 10 15 

aac tgg aga tec ate gtg act ace ttt gaa cga cat gga aac gcg gat 
Asn Trp Arg Ser He Val Thr Thr Phe Glu Arg His Gly Asn Ala Asp 
20 25 30 

aga gec ate egg teg ctt ttg cga ttt ttt aaa ggt gtc gac cat cct 
Arg Ala He Arg Ser Leu Leu Arg Phe Phe Lys Gly Val Asp His Pro 
35 40 45 



ggg 



ttt tta gcg tct ttg gtt ata ctt aaa gat gta act ata gat tea 

Gly Phe Leu Ala Ser Leu Val He Leu Lys Asp Val Thr He Asp Ser 

50 55 60 

gag aaa acg ata gaa aga acg gac ctt att ccc ctt tta cag gga gtg 

Glu Lys Thr He Glu Arg Thr Asp Leu He Pro Leu Leu Gin Gly Val 



65 70 



75 80 



etc get ctg gaa ctg gcg tgc ggc tgt caa agg tgc gcg age gee gee 
Leu Ala Leu Glu Leu Ala Cys Gly Cys Gin Arg Cys Ala Ser Ala Ala 
115 120 125 

agg cag eta egg gec tgt cag caa gec tgc agg cca cct aag ctg aat 

174 



48 



96 



144 



192 



240 



288 



agg ttt gtt ace caa cag ata tat atg cac eta aaa gat cac gcg age 

Arg Phe Val Thr Gin Gin He Tyr Met His Leu Lys Asp His Ala Ser 
85 90 95 

gaa tct cct atg gee gag ata tgg cgc gac tgc aaa gag cgc ttc tgc 336 

Glu Ser Pro Met Ala Glu He Trp Arg Asp Cys Lys Glu Arg Phe Cys 
100 105 HO 



384 



432 
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528 



Arg Gin Leu Arg Ala Cys Gin Gin Ala Cys Arg Pro Pro Lys Leu Asn 

130 135 140 

cct cac aag cag cag tgt gtg gcc get cgt etc tta acc gca gtt tat 480 

Pro His Lys Gin Gin Cys Val Ala Ala Arg Leu Leu Thr Ala Val Tyr 

145 150 155 160 

aac cag atg gtg etc aga act agg gtt teg gtt teg gaa ttt tgt ctg 

Asn Gin Met Val Leu Arg Thr Arg Val Ser Val Ser Glu Phe Cys Leu 
165 170 175 

aac get ttg atg tgc gtt cct aga gag ttc ggg ttt gtg age ggg gat 576 

Asn Ala Leu Met Cys Val Pro Arg Glu Phe Gly Phe Val Ser Gly Asp 
180 185 190 

gtc egg gtg gaa acc age cgc gtt gcc tec tgt ttg aac ttg teg tgg 624 

Val Arg Val Glu Thr Ser Arg Val Ala Ser Cys Leu Asn Leu Ser Trp 

195 200 205 

tta tac ttg att tta gac teg tat gtt cga aca gat tta aca aat ctg 672 

Leu Tyr Leu lie Leu Asp Ser Tyr Val Arg Thr Asp Leu Thr Asn Leu 

210 215 220 

gaa atg gca atg age cgt gcc tgc cgc att cac ggc ctt age gcc ggg 720 

Glu Met Ala Met Ser Arg Ala Cys Arg lie His Gly Leu Ser Ala Gly 

225 230 235 240 

gac ccg ttt tat tec gcc etc gtg tgg tta aaa aat agt tac gca tgt 768 

Asp Pro Phe Tyr Ser Ala Leu Val Trp Leu Lys Asn Ser Tyr Ala Cys 
245 250 255 

gac acg aat aca ttt ttt ttc acc gtc aat tea acc agt gtc acg act 816 

Asp Thr Asn Thr Phe Phe Phe Thr Val Asn Ser Thr Ser Val Thr Thr 
260 265 270 

cca ate tta atg gac ata tgt gcg tea ttg acg ggt ccg gtt cct gat 864 

Pro lie Leu Met Asp lie Cys Ala Ser Leu Thr Gly Pro Val Pro Asp 

275 280 285 

gtt ata aaa ata aat atg ctg ccc etc gtg aac gat caa atg cat cct 912 

Val He Lys He Asn Met Leu Pro Leu Val Asn Asp Gin Met His Pro 

290 295 300 

tec gtt tgt gtg gag cgt gcc aat ttt acg ggg tct tgt ccc aaa gtg 960 

Ser Val Cys Val Glu Arg Ala Asn Phe Thr Gly Ser Cys Pro Lys Val 

305 310 315 320 

tea ccc act cac cat ttg gat ggg eta aaa tta gaa aca aca tec ttg 1008 

Ser Pro Thr His His Leu Asp Gly Leu Lys Leu Glu Thr Thr Ser Leu 
325 330 335 

aca tta gca get gat tea ctg gat gat att tta caa gcc ctg gaa tta 1056 

Thr Leu Ala Ala Asp Ser Leu Asp Asp He Leu Gin Ala Leu Glu Leu 
340 345 350 

att tgt ggc gat gac gag ggt ate ctg gac tea tat att tec gac att 1104 

He Cys Gly Asp Asp Glu Gly He Leu Asp Ser Tyr He Ser Asp He 

355 360 365 

aac acg gaa acc gag gtt gat gag tea tea att gaa gag gaa att gtc 1152 

Asn Thr Glu Thr Glu Val Asp Glu Ser Ser He Glu Glu Glu He Val 



175 
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370 



375 



380 



ttt gag gag ctt age taa 
Phe Glu Glu Leu Ser 
385 390 



<210> 93 
<211> 389 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 93 

Met Ala Val Ser lie Pro Val Lys Gly Val Asn Arg Glu Thr Glu Ser 

.15 10 15 

Asn Trp Arg Ser lie Val Thr Thr Phe Glu Arg His Gly Asn Ala Asp 

20 25 30 

Arg Ala He Arg Ser Leu Leu Arg Phe Phe Lys Gly Val Asp His Pro 

35 40 45 

Gly Phe Leu Ala Ser Leu Val He Leu Lys Asp Val Thr He Asp Ser 

50 55 60 

Glu Lys Thr He Glu Arg Thr Asp Leu He Pro Leu Leu Gin Gly Val 
65 70 75 80 

Arg Phe Val Thr Gin Gin He Tyr Met His Leu Lys Asp His Ala Ser 

85 90 95 

Glu Ser Pro Met Ala Glu He Trp Arg Asp Cys Lys Glu Arg Phe Cys 

100 105 110 

Leu Ala Leu Glu Leu Ala Cys Gly Cys Gin Arg Cys Ala Ser Ala Ala 

115 120 125 

Arg Gin Leu Arg Ala Cys Gin Gin Ala Cys Arg Pro Pro Lys Leu Asn 

130 135 140 

Pro His Lys Gin Gin Cys Val Ala Ala Arg Leu Leu Thr Ala Val Tyr 
145 150 155 160 

Asn Gin Met Val Leu Arg Thr Arg Val Ser Val Ser Glu Phe Cys Leu 

165 170 175 

Asn Ala Leu Met Cys Val Pro Arg Glu Phe Gly Phe Val Ser Gly Asp 

180 185 190 

Val Arg Val Glu Thr Ser Arg Val Ala Ser Cys Leu Asn Leu Ser Trp 

195 200 205 

Leu Tyr Leu He Leu Asp Ser Tyr Val Arg Thr Asp Leu Thr Asn Leu 

210 215 220 

Glu Met Ala Met Ser Arg Ala Cys Arg He His Gly Leu Ser Ala Gly 
225 230 235 240 

Asp Pro Phe Tyr Ser Ala Leu Val Trp Leu Lys Asn Ser Tyr Ala Cys 

245 250 255 

Asp Thr Asn Thr Phe Phe Phe Thr Val Asn Ser Thr Ser Val Thr Thr 

260 265 270 

Pro He Leu Met Asp He Cys Ala Ser Leu Thr Gly Pro Val Pro Asp 

275 280 285 

Val He Lys He Asn Met Leu Pro Leu Val Asn Asp Gin Met His Pro 

290 295 300 

Ser Val Cys Val Glu Arg Ala Asn Phe Thr Gly Ser Cys Pro Lys Val 
305 310 315 320 

Ser Pro Thr His His Leu Asp Gly Leu Lys Leu Glu Thr Thr Ser Leu 

325 330 335 

Thr Leu Ala Ala Asp Ser Leu Asp Asp He Leu Gin Ala Leu Glu Leu 

340 345 350 

He Cys Gly Asp Asp Glu Gly He Leu Asp Ser Tyr He Ser Asp He 

355 360 365 

Asn Thr Glu Thr Glu Val Asp Glu Ser Ser He Glu Glu Glu He Val 
370 375 380 



176 
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Phe Glu Glu Leu Ser 
385 



48 



<210> 94 
<211> 906 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) - . (906) 

<400> 94 

atg agt cgc cat tac ggg aaa gac cat ctt tta aat cat atg tac aag 

Met Ser Arg His Tyr Gly Lys Asp His Leu Leu Asn His Met Tyr Lys 

1 5 10 15 

ttc cac tat ccg cca etc ggc atg ata gtc ggt gaa atg aac aca ctg 96 
Phe His Tyr Pro Pro Leu Gly Met He Val Gly Glu Met Asn Thr Leu 
20 25 30 

acc gtc aat gec cgc aat ccg etc tac caa gcg gca acc ctt aga gtg 144 
Thr Val Asn Ala Arg Asn Pro Leu Tyr Gin Ala Ala Thr Leu Arg Val 
35 40 45 

gaa egg gec eta tac ctg tct aag ata etc cag gtg etc atg caa cac 192 
Glu Arg Ala Leu Tyr Leu Ser Lys He Leu Gin Val Leu Met Gin His 
50 55 60 

aga caa gga gag egg ttc ate gta ccg cag tgt egg age aac atg gtt 
Arg Gin Gly Glu Arg Phe He Val Pro Gin Cys Arg Ser Asn Met Val 
65 70 75 80 

tac tgt eta aag gag etc cac aaa att acg aac gat aga att egg gga 
Tyr Cys Leu Lys Glu Leu His Lys He Thr Asn Asp Arg He Arg Gly 
85 90 95 

ctt att aac tea gta ctg ccc ctg gtg gat gcg ggc tgc gta gga ttt 336 
Leu He Asn Ser Val Leu Pro Leu Val Asp Ala Gly Cys Val Gly Phe 
100 105 HO 



240 



288 



384 



432 



480 



gac gaa gaa etc gta cgc gtc ctg ccg gag ate eta aaa ctg gag tac 

Asp Glu Glu Leu Val Arg Val Leu Pro Glu He Leu Lys Leu Glu Tyr 
115 120 125 

ccc cac gec cac gaa ttg tta ccc cct cac gat ccg acc tec ccc ctg 

Pro His Ala His Glu Leu Leu Pro Pro His Asp Pro Thr Ser Pro Leu 

130 135 140 

age tgg tgc eta tct cac atg gtc ggt gta acc aaa acc ttc aaa ggg 

Ser Trp Cys Leu Ser His Met Val Gly Val Thr Lys Thr Phe Lys Gly 

145 150 155 160 

gag gtg aag gag atg ata gac act ttt cac gac ctg tec gta cca teg 528 

Glu Val Lys Glu Met He Asp Thr Phe His Asp Leu Ser Val Pro Ser 
165 170 ' 175 

ttt cag tac ctg gcg agt ttg gtt aaa aaa ttt ttt ctt gtt gag gag 576 

Phe Gin Tyr Leu Ala Ser Leu Val Lys Lys Phe Phe Leu Val Glu Glu 



177 



180 



185 



190 



gtt ata tac gag gac tac cag gac acg cag ttt aac gtg ttt tta aat 

Val He Tyr Glu Asp Tyr Gin Asp Thr Gin Phe Asn Val Phe Leu Asn 

195 200 205 

ctt tgt ttt ttt tgg acc act gtc ata aag atg tac cag agt tgc att 

Leu Cys Phe Phe Trp Thr Thr Val He Lys Met Tyr Gin Ser Cys He 

210 215 220 

ttt aaa gac aag eta ttg gac acg att aaa get tgc ata gag ctt eta 

Phe L y S Asp Lys Leu Leu Asp Thr He Lys Ala Cys He Glu Leu Leu 

225 230 235 240 

aaa ggc gag gec agg cag ttt ttt ggt tgg tac gac eta aac acg cca 

Lys Gly Glu Ala Arg Gin Phe Phe Gly Trp Tyr Asp Leu Asn Thr Pro 
245 250 255 

aat tta ggt tea teg gca eta gta aag tac aca gag cac ctg ate cga 

Asn Leu Gly Ser Ser Ala Leu Val Lys Tyr Thr Glu His Leu He Arg 

260 265 270 

gca eta agt gtg gat tea tea gec att ccc att ggc gag ata tgc tec 

Ala Leu Ser Val Asp Ser Ser Ala He Pro He Gly Glu He Cys Ser 

275 280 285 

cac eta cac cac tgt aaa cac gee etc ctg aat ctt gaa taa 

His Leu His His Cys Lys His Ala Leu Leu Asn Leu Glu 

290 295 300 



<210> 95 
<211> 301 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 95 

Met Ser Arg His Tyr Gly Lys Asp His Leu Leu Asn His Met Tyr Lys 

15 10 15 

Phe His Tyr Pro Pro Leu Gly Met He Val Gly Glu Met Asn Thr Leu 

20 25 30 

Thr Val Asn Ala Arg Asn Pro Leu Tyr Gin Ala Ala Thr Leu Arg Val 

35 40 45 

Glu Arg Ala Leu Tyr Leu Ser Lys He Leu Gin Val Leu Met Gin His 

50 55 60 

Arg Gin Gly Glu Arg Phe He Val Pro Gin Cys Arg Ser Asn Met Val 
65 70 75 80 

Tyr Cys Leu Lys Glu Leu His Lys He Thr Asn Asp Arg He Arg Gly 

85 90 95 

Leu He Asn Ser Val Leu Pro Leu Val Asp Ala Gly Cys Val Gly Phe 

100 105 HO 

Asp Glu Glu Leu Val Arg Val Leu Pro Glu He Leu Lys Leu Glu Tyr 

115 120 125 

Pro His Ala His Glu Leu Leu Pro Pro His Asp Pro Thr Ser Pro Leu 

130 135 140 

Ser Trp Cys Leu Ser His Met Val Gly Val Thr Lys Thr Phe Lys Gly 
145 150 155 160 

Glu Val Lys Glu Met lie Asp Thr Phe His Asp Leu Ser Val Pro Ser 

165 170 175 

Phe Gin Tyr Leu Ala Ser Leu Val Lys Lys Phe Phe Leu Val Glu Glu 
180 185 190 

178 



Val lie Tyr Glu Asp Tyr Gin Asp 
195 200 
Leu Cys Phe Phe Trp Thr Thr Val 

210 215 
Phe Lys Asp Lys Leu Leu Asp Thr 
225 230 
Lys Gly Glu Ala Arg Gin Phe Phe 
245 

Asn Leu Gly Ser Ser Ala Leu Val 
260 

Ala Leu Ser Val Asp Ser Ser Ala 
275 280 
His Leu His His Cys Lys His Ala 
290 295 



Thr Gin Phe Asn Val Phe Leu Asn 
205 

lie Lys Met Tyr Gin Ser Cys lie 
220 

lie Lys Ala Cys lie Glu Leu Leu 
235 240 
Gly Trp Tyr Asp Leu Asn Thr Pro 

250 255 
Lys Tyr Thr Glu His Leu lie Arg 
265 270 
lie Pro lie Gly Glu lie Cys Ser 
285 

Leu Leu Asn Leu Glu 
300 



48 



<210> 96 
<211> 1545 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) . . (1545) 

<400> 96 

atg gaa tgt gcc tec ctg ggt ccc ate tec ggg etc ate gcg gat ctg 

Met Glu Cys Ala Ser Leu Gly Pro lie Ser Gly Leu He Ala Asp Leu 
15 10 15 

aat etc ttt aac ctg ttc tgc etc tac egg ggt tec egg gtt aaa acc 96 
Asn Leu Phe Asn Leu Phe Cys Leu Tyr Arg Gly Ser Arg Val Lys Thr 
20 25 30 

cgc ggg gcc gcc acc tgt aac gtc ccg tgc gca gaa tgc gcc cag ggc 144 
Arg Gly Ala Ala Thr Cys Asn Val Pro Cys Ala Glu Cys Ala Gin Gly 
35 40 45 

ate gtg aga att etc acc gag agg gcc ctt tgt tgt acc gaa aag atg 192 
He Val Arg He Leu Thr Glu Arg Ala Leu Cys Cys Thr Glu Lys Met 
50 55 60 

ttc ata gcg tct gcc tgc age ggg gtc gtt ata ccg ccc cag ttg gcc 240 
Phe He Ala Ser Ala Cys Ser Gly Val Val He Pro Pro Gin Leu Ala 
65 70 75 80 

aga gtt ctg cac gac gtc tac gcg gaa atg aag gcc aag tgt ttg ggg 
Arg Val Leu His Asp Val Tyr Ala Glu Met Lys Ala Lys Cys Leu Gly 
85 90 95 

gcg tgg cgt cgt etc ata tgt tgc agg egg cct ate atg get att gcc 336 
Ala Trp Arg Arg Leu He Cys Cys Arg Arg Pro He Met Ala He Ala 
100 105 HO 

gac teg gtc etc gtc act tat aat acc ctg gat gcc gag gga aaa ctg 384 
Asp Ser Val Leu Val Thr Tyr Asn Thr Leu Asp Ala Glu Gly Lys Leu 
115 120 125 

gaa ctt aga etc aaa gca ctg tgc aaa ctt gtt ttt caa ccc ate ttt 432 
Glu Leu Arg Leu Lys Ala Leu Cys Lys Leu Val Phe Gin Pro He Phe 



288 



179 



130 135 140 

ctt caa aga ate tta gec cct atg cag eta ctg gec aac ggg aag atg 480 

Leu Gin Arg lie Leu Ala Pro Met Gin Leu Leu Ala Asn Gly Lys Met 

145 150 155 160 

gtt cct gac aat tat ttt acc ate acc ggt acg gec gag aag agg cgc 528 

Val Pro Asp Asn Tyr Phe Thr lie Thr Gly Thr Ala Glu Lys Arg Arg 

165 170 175 

cct gtc gtg act gga agt act age ggg atg acg tgt ccg gga age age 576 

Pro Val Val Thr Gly Ser Thr Ser Gly Met Thr Cys Pro Gly Ser Ser 

180 185 190 

ctt gtc ccc gat tec tta ate ctg cca gta tgc gaa ccg ggg ctt etc 624 

Leu Val Pro Asp Ser Leu lie Leu Pro Val Cys Glu Pro Gly Leu Leu 
195 200 205 

ccg gca ccc ctg gtt gac etc agt aat gtc tta gaa aat cca gaa ate 672 

Pro Ala Pro Leu Val Asp Leu Ser Asn Val Leu Glu Asn Pro Glu lie 

210 215 220 

ate etc age gec cca ccc ctg agt caa ttt gtc ate aca aac acg cac 720 

He Leu Ser Ala Pro Pro Leu Ser Gin Phe Val He Thr Asn Thr His 

225 230 235 240 

ccc agt ctg cct cag tea gtc age att att acg cca acc cag ggc gtt 768 

Pro Ser Leu Pro Gin Ser Val Ser lie He Thr Pro Thr Gin Gly Val 

245 250 255 

gtt ccc ggc caa tgt ttt atg gac acg tgg aaa gcg gtg tea cag age 816 

Val Pro Gly Gin Cys Phe Met Asp Thr Trp Lys Ala Val Ser Gin Ser 

260 265 270 

att cac cac cag gca cag acg cct att ttg gec gee gca eta acc ggt 864 

He His His Gin Ala Gin Thr Pro He Leu Ala Ala Ala Leu Thr Gly 
275 280 285 

teg aca tct gcg gee cct ggc ccg cat ate gca tgt tec cca gtt gee 912 

Ser Thr Ser Ala Ala Pro Gly Pro His He Ala Cys Ser Pro Val Ala 

290 295 300 

ggc acg tct egg cag gtg gaa ggg tec gcg ggc gtc gat tgc ggg aaa 960 

Gly Thr Ser Arg Gin Val Glu Gly Ser Ala Gly Val Asp Cys Gly Lys 

305 310 315 320 

cca gca tgc gtt ccg cag ccc gcg tta ccg ccc aat gtc ccc gee aag 1008 

Pro Ala Cys Val Pro Gin Pro Ala Leu Pro Pro Asn Val Pro Ala Lys 

325 330 335 

agg atg gaa acg gta gca cag ttg gga aac get ccc gta aaa aac gtc 1056 

Arg Met Glu Thr Val Ala Gin Leu Gly Asn Ala Pro Val Lys Asn Val 

340 345 350 

cac ate gga ggc cgc gta tac get cca ctg gtt aat ata cca ata ata 1104 

His He Gly Gly Arg Val Tyr Ala Pro Leu Val Asn He Pro lie He 
355 360 365 

gac tta acg tec ccg tea ggg tec ggc cag agt ccg gec gat ate gee 1152 

Asp Leu Thr Ser Pro Ser Gly Ser Gly Gin Ser Pro Ala Asp He Ala 

370 375 380 

180 



aac act cca gag tec cgc atg gcg gec ggc tct ccg ccc ttc gec gaa 

Asn Thr Pro Glu Ser Arg Met Ala Ala Gly Ser Pro Pro Phe Ala Glu 

385 390 395 400 

acc gec gca acg gtc ccc get aag aga aag cag cca cgc gag gac gtg 

Thr Ala Ala Thr Val Pro Ala Lys Arg Lys Gin Pro Arg Glu Asp Val 

405 410 415 

gca gac aaa aga ctg aag ggc gac gtt egg ggc gec gca aca gta aac 

Ala Asp Lys Arg Leu Lys Gly Asp Val Arg Gly Ala Ala Thr Val Asn 

420 425 430 

cac cct ttc ccg gga ccg tec ggg atg cgc gtt cgc gag cag ggc tta 

His Pro Phe Pro Gly Pro Ser Gly Met Arg Val Arg Glu Gin Gly Leu 

435 440 445 

ttc gat tta ate gaa age tec acg gat gta acc gcg aac gca tct gga 

Phe Asp Leu He Glu Ser Ser Thr Asp Val Thr Ala Asn Ala Ser Gly 

450 455 460 

ccc aaa aac gac gac gac atg eta gcg get ate eta cag gac ctg tat 

Pro Lys Asn Asp Asp Asp Met Leu Ala Ala He Leu Gin Asp Leu Tyr 

465 470 475 480 

gga eta cag tec ccc ccg gec ate gat tec ccc tec age aac teg gac 

Gly Leu Gin Ser Pro Pro Ala He Asp Ser Pro Ser Ser Asn Ser Asp 

485 490 495 

aat gag gag ata ttt cca gag gtt agt ccg cca tct age ggc cac gga 

Asn Glu Glu He Phe Pro Glu Val Ser Pro Pro Ser Ser Gly His Gly 

500 505 510 

teg cct tga 
Ser Pro 

515 



<210> 97 
<211> 514 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 97 

Met Glu Cys Ala Ser Leu Gly Pro He Ser Gly Leu He Ala Asp Leu 

15 10 15 

Asn Leu Phe Asn Leu Phe Cys Leu Tyr Arg Gly Ser Arg Val Lys Thr 

20 25 30 

Arg Gly Ala Ala Thr Cys Asn Val Pro Cys Ala Glu Cys Ala Gin Gly 

35 40 45 

He Val Arg He Leu Thr Glu Arg Ala Leu Cys Cys Thr Glu Lys Met 

50 55 60 

Phe lie Ala Ser Ala Cys Ser Gly Val Val He Pro Pro Gin Leu Ala 
65 70 75 80 

Arg Val Leu His Asp Val Tyr Ala Glu Met Lys Ala Lys Cys Leu Gly 

85 90 95 

Ala Trp Arg Arg Leu He Cys Cys Arg Arg Pro He Met Ala lie Ala 

100 105 HO 

Asp Ser Val Leu Val Thr Tyr Asn Thr Leu Asp Ala Glu Gly Lys Leu 

115 120 125 

Glu Leu Arg Leu Lys Ala Leu Cys Lys Leu Val Phe Gin Pro He Phe 

181 



130 

Leu Gin Arg lie 
145 

Val Pro Asp Asn 

Pro Val Val Thr 
180 

Leu Val Pro Asp 
195 

Pro Ala Pro Leu 
210 

He Leu Ser Ala 

225 

Pro Ser Leu Pro 

Val Pro Gly Gin 
260 

He His His Gin 
275 

Ser Thr Ser Ala 
290 

Gly Thr Ser Arg 
305 

Pro Ala Cys Val 

Arg Met Glu Thr 
340 

His He Gly Gly 
355 

Asp Leu Thr Ser 
370 

Asn Thr Pro Glu 
385 

Thr Ala Ala Thr 

Ala Asp Lys Arg 
420 

His Pro Phe Pro 
435 

Phe Asp Leu He 
450 

Pro Lys Asn Asp 
465 

Gly Leu Gin Ser 

Asn Glu Glu He 
500 

Ser Pro 



135 

Leu Ala Pro Net 
150 

Tyr Phe Thr He 
165 

Gly Ser Thr Ser 

Ser Leu He Leu 
200 

Val Asp Leu Ser 
215 

Pro Pro Leu Ser 
230 

Gin Ser Val Ser 
245 

Cys Phe Met Asp 

Ala Gin Thr Pro 
280 

Ala Pro Gly Pro 
295 

Gin Val Glu Gly 
310 

Pro Gin Pro Ala 

325 

Val Ala Gin Leu 

Arg Val Tyr Ala 
360 

Pro Ser Gly Ser 
375 

Ser Arg Met Ala 
390 

Val Pro Ala Lys 
405 

Leu Lys Gly Asp 

Gly Pro Ser Gly 
440 

Glu Ser Ser Thr 
455 

Asp Asp Met Leu 
470 

Pro Pro Ala He 
485 

Phe Pro Glu Val 



140 

Gin Leu Leu Ala 
155 

Thr Gly Thr Ala 
170 

Gly Met Thr Cys 
185 

Pro Val Cys Glu 

Asn Val Leu Glu 
220 

Gin Phe Val He 
235 

He He Thr Pro 
250 

Thr Trp Lys Ala 
265 

He Leu Ala Ala 

His He Ala Cys 
300 

Ser Ala Gly Val 
315 

Leu Pro Pro Asn 
330 

Gly Asn Ala Pro 
345 

Pro Leu Val Asn 

Gly Gin Ser Pro 
380 

Ala Gly Ser Pro 
395 

Arg Lys Gin Pro 
410 

Val Arg Gly Ala 
425 

Met Arg Val Arg 

Asp Val Thr Ala 
460 

Ala Ala He Leu 
475 

Asp Ser Pro Ser 
490 

Ser Pro Pro Ser 
505 



Asn Gly Lys Met 
160 

Glu Lys Arg Arg 
175 

Pro Gly Ser Ser 
190 

Pro Gly Leu Leu 
205 

Asn Pro Glu He 

Thr Asn Thr His 
240 

Thr Gin Gly Val 
255 

Val Ser Gin Ser 
270 

Ala Leu Thr Gly 
285 

Ser Pro Val Ala 

Asp Cys Gly Lys 
320 

Val Pro Ala Lys 
335 

Val Lys Asn Val 
350 

He Pro He He 
365 

Ala Asp He Ala 

Pro Phe Ala Glu 
400 

Arg Glu Asp Val 
415 

Ala Thr Val Asn 
430 

Glu Gin Gly Leu 
445 

Asn Ala Ser Gly 

Gin Asp Leu Tyr 
480 

Ser Asn Ser Asp 
495 

Ser Gly His Gly 
510 



<210> 98 

<211> 534 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (534) 

<400> 98 



182 



^ JL MS %S 13 



,M C;H M 



atg cct cgt gtg aaa aca caa ccc aag aga ccc caa gtg ctt gaa ttt 

Met Pro Arg Val Lys Thr Gin Pro Lys Arg Pro Gin Val Leu Glu Phe 

15 10 15 

atg cca tta gat etc cac ggt gga aca cac acg gag atg gat tct caa 

Met Pro Leu Asp Leu His Gly Gly Thr His Thr Glu Met Asp Ser Gin 

20 25 30 



aat aga eta etc aag gtg agt gcg egg ctg teg gac aaa aca aag age 
Asn Arg Leu Leu Lys Val Ser Ala Arg Leu Ser Asp Lys Thr Lys Ser 
145 150 155 160 

gee ctt cgc age cat cct tgc ttg cgt tgc tat tct ttg atg ttt aac 
Ala Leu Arg Ser His Pro Cys Leu Arg Cys Tyr Ser Leu Met Phe Asn 
165 170 175 

acg taa 
Thr 



48 



96 



aac ctg tgt cct gac ggc cag gat ctg etc ggg tct tat ate tat acg 144 

Asn Leu Cys Pro Asp Gly Gin Asp Leu Leu Gly Ser Tyr He Tyr Thr 

35 40 45 

gag aat aac ggc ccg ttt tec caa ata atg cac aat gga cag age aat 192 

Glu Asn Asn Gly Pro Phe Ser Gin He Met His Asn Gly Gin Ser Asn 

50 55 60 

acc ggg aca ggt gaa age ttc ggc age tac get gee ggc gac ggt ttt 24 0 

Thr Gly Thr Gly Glu Ser Phe Gly Ser Tyr Ala Ala Gly Asp Gly Phe 

65 70 75 80 

ctg ggc ggt tct gtg tea ggg atg tat gga aac aac acc gga gag ggc 

Leu Gly Gly Ser Val Ser Gly Met Tyr Gly Asn Asn Thr Gly Glu Gly 

85 90 95 

gcg tgt tct aaa aga ccg tec gcg tgc cgt aaa cgc teg get gca eta 

Ala Cys Ser Lys Arg Pro Ser Ala Cys Arg Lys Arg Ser Ala Ala Leu 

100 105 HO 

att cac gcg gcg tec gag gcg tct gtg gee gag caa ggc acc tea cag 

He His Ala Ala Ser Glu Ala Ser Val Ala Glu Gin Gly Thr Ser Gin 

115 120 125 

ggg gca cat gec gta tct gac egg ata ggc aga gac ggt ggc get gac 432 

Gly Ala His Ala Val Ser Asp Arg He Gly Arg Asp Gly Gly Ala Asp 

130 135 140 



288 



336 



384 



480 



528 



534 



<210> 99 
<211> 177 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 99 

Met Pro Arg Val Lys Thr Gin Pro Lys Arg Pro Gin Val Leu Glu Phe 

15 10 15 

Met Pro Leu Asp Leu His Gly Gly Thr His Thr Glu Met Asp Ser Gin 

20 25 30 

Asn Leu Cys Pro Asp Gly Gin Asp Leu Leu Gly Ser Tyr He Tyr Thr 

35 40 45 

Glu Asn Asn Gly Pro Phe Ser Gin lie Met His Asn Gly Gin Ser Asn 



183 
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50 55 60 

Thr Gly Thr Gly Glu Ser Phe Gly Ser Tyr Ala Ala Gly Asp Gly Phe 
65 70 75 80 

Leu Gly Gly Ser Val Ser Gly Met Tyr Gly Asn Asn Thr Gly Glu Gly 

85 90 95 

Ala Cys Ser Lys Arg Pro Ser Ala Cys Arg Lys Arg Ser Ala Ala Leu 

100 105 110 

He His Ala Ala Ser Glu Ala Ser Val Ala Glu Gin Gly Thr Ser Gin 

115 120 125 

Gly Ala His Ala Val Ser Asp Arg He Gly Arg Asp Gly Gly Ala Asp 

130 135 140 

Asn Arg Leu Leu Lys Val Ser Ala Arg Leu Ser Asp Lys Thr Lys Ser 
145 150 155 160 

Ala Leu Arg Ser His Pro Cys Leu Arg Cys Tyr Ser Leu Met Phe Asn 
165 170 175 

Thr 



<210> 100 
<211> 693 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (693) 

<400> 100 

atg gga ttt ggg aac ata cgt ctg gga tgg agg tta tgc ttc atg gtc 4 8 

Met Gly Phe Gly Asn He Arg Leu Gly Trp Arg Leu Cys Phe Met Val 
15 10 15 

tgg gtg gcg tgg att gca egg gga egg teg gtg tgc cca acc tgg cac 96 
Trp Val Ala Trp He Ala Arg Gly Arg Ser Val Cys Pro Thr Trp His 
20 25 30 

ctg aca gat ggg aaa tac gag gcg gta tac agg cac tac etc gaa gag 144 
Leu Thr Asp Gly Lys Tyr Glu Ala Val Tyr Arg His Tyr Leu Glu Glu 
35 40 45 

tgc cgc aaa cat gaa ggc teg ggg age ctg gac ggt tec gga cag aca 192 
Cys Arg Lys His Glu Gly Ser Gly Ser Leu Asp Gly Ser Gly Gin Thr 
50 55 60 

aag ggg tct gga acc aaa gca acc acc gaa get aat ata teg ata aga 240 
Lys Gly Ser Gly Thr Lys Ala Thr Thr Glu Ala Asn He Ser He Arg 
65 70 75 80 

cct aac gtt gtc aca tea ggt caa aat aaa gag ccg cct ggg aca gca 288 
Pro Asn Val Val Thr Ser Gly Gin Asn Lys Glu Pro Pro Gly Thr Ala 
85 90 95 

ccg agg gec gaa tea tea cac gac ctg cca cgc ate aag cag gtt aac 336 
Pro Arg Ala Glu Ser Ser His Asp Leu Pro Arg He Lys Gin Val Asn 
100 105 HO 

get etc cga tta tea acc ccg gaa ttg gcg caa cca etc ccg gta gta 384 
Ala Leu Arg Leu Ser Thr Pro Glu Leu Ala Gin Pro Leu Pro Val Val 
115 120 125 



184 
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aaa teg act ccg cgc gag tea cag tea ggt ggg aca ccc tgg aac gcg 432 
Lys Ser Thr Pro Arg Glu Ser Gin Ser Gly Gly Thr Pro Trp Asn Ala 
130 135 140 



cgc ccc cac gcg ttc att atg cac aca aac gac atg etc aac cca tct 480 

Arg Pro His Ala Phe lie Met His Thr Asn Asp Met Leu Asn Pro Ser 

145 150 155 160 

gtg gtc ctg tct ttc aga gec ate cgt gcg egg tec aca cgc gat ace 528 

Val Val Leu Ser Phe Arg Ala lie Arg Ala Arg Ser Thr Arg Asp Thr 

165 170 175 

gag cag tec gtt cgc gat egg aac acg gtc acg acc age tat cgt ace 576 

Glu Gin Ser Val Arg Asp Arg Asn Thr Val Thr Thr Ser Tyr Arg Thr 

180 185 190 

cct ggc cgc cct tec etc ttt caa gec aga ccc teg tct cac ggt gcg 624 

Pro Gly Arg Pro Ser Leu Phe Gin Ala Arg Pro Ser Ser His Gly Ala 

195 200 205 

cgt eta ccc cct teg ccc cga acg atg gca aga tac gec gag teg cga 672 

Arg Leu Pro Pro Ser Pro Arg Thr Met Ala Arg Tyr Ala Glu Ser Arg 

210 ' 215 220 

aca ata tgc gac caa aat tga 693 
Thr lie Cys Asp Gin Asn 
225 230 



<210> 101 
<211> 230 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



Met 


Gly 


Phe 


Gly 


Asn 


lie 


Arg 


Leu 


Gly 


Trp 


Arg 


Leu 


Cys 


Phe 


Met 


Val 


1 








5 










10 










15 




Trp 


Val 


Ala 


Trp 


lie 


Ala 


Arg 


Gly 


Arg 


Ser 


Val 


Cys 


Pro 


Thr 


Trp 


His 








20 










25 










30 






Leu 


Thr 


Asp 


Gly 


Lys 


Tyr 


Glu 


Ala 


Val 


Tyr 


Arg 


His 


Tyr 


Leu 


Glu 


Glu 






35 










40 










45 








Cys 


Arg 


Lys 


His 


Glu 


Gly 


Ser 


Gly 


Ser 


Leu 


Asp 


Gly 


Ser 


Gly 


Gin 


Thr 




50 










55 










60 










Lys 


Gly 


Ser 


Gly 


Thr 


Lys 


Ala 


Thr 


Thr 


Glu 


Ala 


Asn 


He 


Ser 


He 


Arg 


65 










70 










75 










80 


Pro 


Asn 


Val 


Val 


Thr 


Ser 


Gly 


Gin 


Asn 


Lys 


Glu 


Pro 


Pro 


Gly 


Thr 


Ala 










85 










90 










95 




Pro 


Arg 


Ala 


Glu 


Ser 


Ser 


His 


Asp 


Leu 


Pro 


Arg 


He 


Lys 


Gin 


Val 


Asn 








100 










105 










110 






Ala 


Leu 


Arg 


Leu 


Ser 


Thr 


Pro 


Glu 


Leu 


Ala 


Gin 


Pro 


Leu 


Pro 


Val 


Val 






115 










120 










125 








Lys 


Ser 


Thr 


Pro 


Arg 


Glu 


Ser 


Gin 


Ser 


Gly 


Gly 


Thr 


Pro 


Trp 


Asn 


Ala 




130 










135 










140 










Arg 


Pro 


His 


Ala 


Phe 


He 


Met 


His 


Thr 


Asn 


Asp 


Met 


Leu 


Asn 


Pro 


Ser 


145 










150 










155 










160 


Val 


Val 


Leu 


Ser 


Phe 


Arg 


Ala 


He 


Arg 


Ala 


Arg 


Ser 


Thr 


Arg 


Asp 


Thr 










165 










170 










175 




Glu 


Gin 


Ser 


Val 


Arg 


Asp 


Arg 


Asn 


Thr 


Val 


Thr 


Thr 


Ser 


Tyr 


Arg 


Thr 








180 










185 










190 






Pro 


Gly 


Arg 


Pro 


Ser 


Leu 


Phe 


Gin 


Ala 


Arg 


Pro 


Ser 


Ser 


His 


Gly 


Ala 



195 200 205 



185 
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Arg Leu Pro Pro Ser Pro Arg Thr Met Ala Arg Tyr Ala Glu Ser Arg 

210 215 220 

Thr lie Cys Asp Gin Asn 
225 230 



<210> 102 
<211> 420 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<2 21> CDS 
<222> (1) - . (420) 

<400> 102 

atg tct tec acg cgt ccc aag acg cgc gec ccc aaa aaa gag ctg aca 

Met Ser Ser Thr Arg Pro Lys Thr Arg Ala Pro Lys Lys Glu Leu Thr 

1 5 10 15 

atg gag gag tta gec get cag gtg caa aaa ttg tec gtt gaa aac aag 
Met Glu Glu Leu Ala Ala Gin Val Gin Lys Leu Ser Val Glu Asn Lys 
20 25 30 

cag etc aaa aag ctg ata aat tct ggg gat ccc acc agg tec ggc tec 
Gin Leu Lys Lys Leu lie Asn Ser Gly Asp Pro Thr Arg Ser Gly Ser 
35 40 45 

gac ccc gtc ate tct aac acc gaa aag gag gec aaa ate gca get gec 192 
Asp Pro Val He Ser Asn Thr Glu Lys Glu Ala Lys He Ala Ala Ala 
50 55 60 



gtt teg gee tta tgt aac gtg gca acc cgt aag att gaa get aag gtg 
Val Ser Ala Leu Cys Asn Val Ala Thr Arg Lys He Glu Ala Lys Val 



65 



70 75 80 



gec cag tec agg acc aga aat aac gac gcg gac tag 
Ala Gin Ser Arg Thr Arg Asn Asn Asp Ala Asp 

130 135 140 



48 



96 



144 



240 



288 



agg gcg gec acg gee aag gec gtg acg cgc ggg cag atg gaa gac gee 

Arg Ala Ala Thr Ala Lys Ala Val Thr Arg Gly Gin Met Glu Asp Ala 
85 90 95 

etc gec gga att agt ate cgc gtg gac gtg tea atg gac gag acc acc 336 

Leu Ala Gly He Ser He Arg Val Asp Val Ser Met Asp Glu Thr Thr 
100 105 110 

aga ggc gga ate get gee age gcg gac ggg gcg etc agg agg agg cgc 3 84 

Arg Gly Gly He Ala Ala Ser Ala Asp Gly Ala Leu Arg Arg Arg Arg 

115 120 125 



420 



<210> 103 
<211> 139 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 103 

Met Ser Ser Thr Arg Pro Lys Thr Arg Ala Pro Lys Lys Glu Leu Thr 

186 
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1 5 10 15 

Met Glu Glu Leu Ala Ala Gin Val Gin Lys Leu Ser Val Glu Asn Lys 

20 25 30 

Gin Leu Lys Lys Leu lie Asn Ser Gly Asp Pro Thr Arg Ser Gly Ser 

35 40 45 

Asp Pro Val He Ser Asn Thr Glu Lys Glu Ala Lys He Ala Ala Ala 

50 55 60 

Val Ser Ala Leu Cys Asn Val Ala Thr Arg Lys He Glu Ala Lys Val 
65 70 75. 80 

Arg Ala Ala Thr Ala Lys Ala Val Thr Arg Gly Gin Met Glu Asp Ala 

85 90 95 

Leu Ala Gly He Ser He Arg Val Asp Val Ser Met Asp Glu Thr Thr 

100 105 HO 

Arg Gly Gly He Ala Ala Ser Ala Asp Gly Ala Leu Arg Arg Arg Arg 

115 120 125 

Ala Gin Ser Arg Thr Arg Asn Asn Asp Ala Asp 
130 135 



<210> 104 

<211> 315 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (315) 

<400> 104 

atg aca ggg tea att gtg etc gcg tta gec ctg ttg gcg tgc tta tat 48 

Met Thr Gly Ser He Val Leu Ala Leu Ala Leu Leu Ala Cys Leu Tyr 
1 - 5 10 15 



ctg tgt ctg ccg gtg tgc gec acc gtc acc acc tec teg act aca gga 

Leu Cys Leu Pro Val Cys Ala Thr Val Thr Thr Ser Ser Thr Thr Gly 

20 25 30 

acg ggc acg cct cct gtg act aca aac ccc tec gcg get ccc tec gta 

Thr Gly Thr Pro Pro Val Thr Thr Asn Pro Ser Ala Ala Pro Ser Val 

35 40 45 

acg ccc tct ttt tat gac tat gat tgc teg gcg gac acg tac cag ccc 

Thr Pro Ser Phe Tyr Asp Tyr Asp Cys Ser Ala Asp Thr Tyr Gin Pro 

50 55 60 

gtg ctg tct agt ttt teg agt ata tgg gcg gtg ate aac tct gtt ctg 

Val Leu Ser Ser Phe Ser Ser He Trp Ala Val He Asn Ser Val Leu 

65 70 75 80 

gtc gcg gtt gca aca ttt ctg tac eta aca tat atg tgc ttt ttt aaa 

Val Ala Val Ala Thr Phe Leu Tyr Leu Thr Tyr Met Cys Phe Phe Lys 

85 90 95 



96 



144 



192 



240 



288 



ttt gtt gaa acc gtg gcg cac gag tga 315 
Phe Val Glu Thr Val Ala His Glu 

100 105 



<210> 105 
<211> 104 



187 



<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 105 

Met Thr Gly Ser He Val Leu Ala Leu Ala Leu Leu Ala Cys Leu Tyr 

15 10 15 

Leu Cys Leu Pro Val Cys Ala Thr Val Thr Thr Ser Ser Thr Thr Gly 

20 25 30 

Thr Gly Thr Pro Pro Val Thr Thr Asn Pro Ser Ala Ala Pro Ser Val 

35 40 45 

Thr Pro Ser Phe Tyr Asp Tyr Asp Cys Ser Ala Asp Thr Tyr Gin Pro 

50 55 60 

Val Leu Ser Ser Phe Ser Ser He Trp Ala Val He Asn Ser Val Leu 
65 70 75 80 

Val Ala Val Ala Thr Phe Leu Tyr Leu Thr Tyr Met Cys Phe Phe Lys 

85 90 95 

Phe Val Glu Thr Val Ala His Glu 
100 



<210> 106 
<211> 873 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (873) 

<400> 106 

atg gca gaa gtg acc gcc cat acg gtt ccg tac gcc ttt gac tec tgt 

Met Ala Glu Val Thr Ala His Thr Val Pro Tyr Ala Phe Asp Ser Cys 
15 10 15 

aag ttt gaa ata ata cca aaa aac aac tea tct cgt ata gcg tta cgc 
Lys Phe Glu He He Pro Lys Asn Asn Ser Ser Arg He Ala Leu Arg 
20 25 30 

aac aaa ttt ccc gtc gtg gtc aaa ccg gga gaa cct eta gtc gtg ccc 
Asn Lys Phe Pro Val Val Val Lys Pro Gly Glu Pro Leu Val Val Pro 
35 40 45 

ctg gga eta aaa att ate cgc gcg ccc cag tgc gca ttc ttt etc age 
Leu Gly Leu Lys He He Arg Ala Pro Gin Cys Ala Phe Phe Leu Ser 
50 55 60 

gga gcg ccg acg gat gag gtg tat tac cac acc gga ctt ata gat caa 
Gly Ala Pro Thr Asp Glu Val Tyr Tyr His Thr Gly Leu He Asp Gin 
65 70 75 80 

ggg tat cgt gga gaa ata aaa etc ate gtt etc aat aag act aaa cag 
Gly Tyr Arg Gly Glu He Lys Leu He Val Leu Asn Lys Thr Lys Gin 
85 90 95 

gtc gtc aca ctt tac egg gga gaa gtc aac gtc tea etc ate gcg ttt 
Val Val Thr Leu Tyr Arg Gly Glu Val Asn Val Ser Leu He Ala Phe 
100 105 110 

atg tac gcc teg ccc ggg ccc eta aaa tgc ccg ata ctg aac eta cca 
Met Tyr Ala Ser Pro Gly Pro Leu Lys Cys Pro He Leu Asn Leu Pro 
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115 120 125 

cat tab tct ctg gac get gga ttt gat gtt acg teg cca cat gec atg 432 

His Tyr Ser Leu Asp Ala Gly Phe Asp Val Thr Ser Pro His Ala Met 
130 135 140 

acc ata ccc cca ace gac aga acc ccg ttt acg ttg agt ttg tat tat 480 

Thr lie Pro Pro Thr Asp Arg Thr Pro Phe Thr Leu Ser Leu Tyr Tyr 

145 150 155 160 



aag agt cca cag etc age acc ccc cac gtc cca eta ate gtg ggg cgc 
Lys Ser Pro Gin Leu Ser Thr Pro His Val Pro Leu lie Val Gly Arg 
165 170 175 



gac cac gtc cca age ggg tgg aac att ctt aga tec cgc gta caa etc 
Asp His Val Pro Ser Gly Trp Asn lie Leu Arg Ser Arg Val Gin Leu 
225 230 235 240 

ggc age acc etc cag ata teg tgg gec aaa ate agg ttt acc gac gtg 
Gly Ser Thr Leu Gin He Ser Trp Ala Lys He Arg Phe Thr Asp Val 
245 250 255 

gec acg etc ccc aaa acc cac ccg etc aac tec cgc cac act caa age 
Ala Thr Leu Pro Lys Thr His Pro Leu Asn Ser Arg His Thr Gin Ser 
260 265 270 

caa acc gaa ccg gag acc gec cgc ggc gca aag ggg ttg ggg teg tea 
Gin Thr Glu Pro Glu Thr Ala Arg Gly Ala Lys Gly Leu Gly Ser Ser 
275 280 ■ 285 

ggg tta taa 
Gly Leu 
290 



<210> 107 
<211> 290 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 107 

Met Ala Glu Val Thr Ala His Thr Val Pro Tyr Ala Phe Asp Ser Cys 

1 5 10 15 

Lys Phe Glu He He Pro Lys Asn Asn Ser Ser Arg He Ala Leu Arg 

20 25 30 

Asn Lys Phe Pro Val Val Val Lys Pro Gly Glu Pro Leu Val Val Pro 

35 40 45 

Leu Gly Leu Lys He He Arg Ala Pro Gin Cys Ala Phe Phe Leu Ser 
50 55 60 



528 



tct gga ctg gcg aca aag ggg ctt acc gtc gac gcg acc aaa tgg acc 576 
Ser Gly Leu Ala Thr Lys Gly Leu Thr Val Asp Ala Thr Lys Trp Thr 
180 185 190 



624 



cag teg ttg gta cat ctg agg ttt tat aac ttt acc aaa gag cca ata 

Gin Ser Leu Val His Leu Arg Phe Tyr Asn Phe Thr Lys Glu Pro He 

195 200 205 

gat ata cca gca aac age cgc ata tgt cag gta gtg ttt ate cac gaa 672 

Asp He Pro Ala Asn Ser Arg He Cys Gin Val Val Phe He His Glu 

210 215 220 



720 



768 



816 



864 



873 



189 



Gly Ala Pro Thr Asp Glu Val Tyr Tyr His Thr Gly Leu lie Asp Gin 
65 70 75 80 

Gly Tyr Arg Gly Glu lie Lys Leu lie Val Leu Asn Lys Thr Lys Gin 

85 90 95 

Val Val Thr Leu Tyr Arg Gly Glu Val Asn Val Ser Leu He Ala Phe 

100 105 110 

Met Tyr Ala Ser Pro Gly Pro Leu Lys Cys Pro He Leu Asn Leu Pro 

115 120 125 

His Tyr Ser Leu Asp Ala Gly Phe Asp Val Thr Ser Pro His Ala Met 

130 135 140 

Thr He Pro Pro Thr Asp Arg Thr Pro Phe Thr Leu Ser Leu Tyr Tyr 
145 150 155 160 

Lys Ser Pro Gin Leu Ser Thr Pro His Val Pro Leu He Val Gly Arg 

165 170 175 

Ser Gly Leu Ala Thr Lys Gly Leu Thr Val Asp Ala Thr Lys Trp Thr 

180 185 190 

Gin Ser Leu Val His Leu Arg Phe Tyr Asn Phe Thr Lys Glu Pro He 

195 200 205 

Asp lie Pro Ala Asn Ser Arg He Cys Gin Val Val Phe He His Glu 

210 215 220 

Asp His Val Pro Ser Gly Trp Asn He Leu Arg Ser Arg Val Gin Leu 
225 230 235 240 

Gly Ser Thr Leu Gin He Ser Trp Ala Lys He Arg Phe Thr Asp Val 

245 250 255 

Ala Thr Leu Pro Lys Thr His Pro Leu Asn Ser Arg His Thr Gin Ser 

260 265 270 

Gin Thr Glu Pro Glu Thr Ala Arg Gly Ala Lys Gly Leu Gly Ser Ser 
275 280 285 

Gly Leu 
290 



<210> 108 
<211> 633 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (633) 

<400> 108 

atg gcc gcg cct ggt teg ttt tgg acg tgt tgc ggg ttt agt ccg ttt 

Met Ala Ala Pro Gly Ser Phe Trp Thr Cys Cys Gly Phe Ser Pro Phe 
15 10 15 

gga cgc gtg gga tgt caa tat cgt ccc ctg cca gat cct ctt aac gag 
Gly Arg Val Gly Cys Gin Tyr Arg Pro Leu Pro Asp Pro Leu Asn Glu 
20 25 30 

tgt cct act cac tgg egg ace gaa ata gcc atg ggt ctg ccc ccg gga 
Cys Pro Thr His Trp Arg Thr Glu lie Ala Met Gly Leu Pro Pro Gly 
35 40 45 

gtg gat atg gga gat gtg aag cag gca gag atg tgc acc gcc gcc ctt 
Val Asp Met Gly Asp Val Lys Gin Ala Glu Met Cys Thr Ala Ala Leu 
50 55 60 

egg cag acg tat ctg ttg gcc gtt caa tec aat aaa ata acc gag tat 
Arg Gin Thr Tyr Leu Leu Ala Val Gin Ser Asn Lys He Thr Glu Tyr 
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65 70 75 80 

ctt cgt cgt ttt gac gcg get agg gta ccc gcg ggg tgc cag gag acg 288 
Leu Arg Arg Phe Asp Ala Ala Arg Val Pro Ala Gly Cys Gin Glu Thr 
85 90 95 

gtt agg att caa ata tct aag eta aag tct ata cag aac gtt ate tgg 336 
Val Arg lie Gin lie Ser Lys Leu Lys Ser lie Gin Asn Val lie Trp 
100 105 110 

aac gec atg ttg tec ctg get ata ggg gat att acg gtg gac gag age 384 
Asn Ala Met Leu Ser Leu Ala lie Gly Asp lie Thr Val Asp Glu Ser 
115 120 125 

gee ttt cac gcg ctt ttg aat aaa egg gee gac gag acc gtg tea etc 432 
Ala Phe His Ala Leu Leu Asn Lys Arg Ala Asp Glu Thr Val Ser Leu 
130 135 140 

ctg gaa atg gaa aaa ctg get acg acg ate gcg tec gat gat tct gtc 480 
Leu Glu Met Glu Lys Leu Ala Thr Thr lie Ala Ser Asp Asp Ser Val 
145 150 155 160 

acg tgg get gcg gaa att aat aat gtg ctg gta gac acc gaa gcg tec 528 
Thr Trp Ala Ala Glu lie Asn Asn Val Leu Val Asp Thr Glu Ala Ser 
165 170 175 

tec aac ccc tec cat ccg gtg att cga caa ccg acg ccc cag ttg gcg 576 
Ser Asn Pro Ser His Pro Val lie Arg Gin Pro Thr Pro Gin Leu Ala 
180 185 190 

gtg gcg gat aat ate gtg cca gac aaa ata ata cag gat gcg caa get 624 
Val Ala Asp Asn lie Val Pro Asp Lys lie lie Gin Asp Ala Gin Ala 
195 200 205 

gat ggt taa 633 
Asp Gly 
210 



<210> 109 

<211> 210 

<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 109 
Met Ala Ala Pro 
1 

Gly Arg Val Gly 
20 

Cys Pro Thr His 
35 

Val Asp Met Gly 
50 

Arg Gin Thr Tyr 
65 

Leu Arg Arg Phe 

Val Arg lie Gin 
100 

Asn Ala Met Leu 
115 



Gly Ser Phe Trp 
5 

Cys Gin Tyr Arg 

Trp Arg Thr Glu 
40 

Asp Val Lys Gin 
55 

Leu Leu Ala Val 
70 

Asp Ala Ala Arg 
85 

lie Ser Lys Leu 

Ser Leu Ala lie 
120 



Thr Cys Cys Gly 
10 

Pro Leu Pro Asp 
25 

lie Ala Met Gly 

Ala Glu Met Cys 
60 

Gin Ser Asn Lys 
75 

Val Pro Ala Gly 
90 

Lys Ser lie Gin 
105 

Gly Asp lie Thr 



Phe Ser Pro Phe 
15 

Pro Leu Asn Glu 
30 

Leu Pro Pro Gly 
45 

Thr Ala Ala Leu 

lie Thr Glu Tyr 
80 

Cys Gin Glu Thr 
95 

Asn Val lie Trp 
110 

Val Asp Glu Ser 
125 



191 



Ala Phe His Ala Leu Leu Asn Lys Arg Ala Asp Glu Thr Val Ser Leu 

130 135 140 

Leu Glu Met Glu Lys Leu Ala Thr Thr lie Ala Ser Asp Asp Ser Val 
145 150 155 160 

Thr Trp Ala Ala Glu He Asn Asn Val Leu Val Asp Thr Glu Ala Ser 

165 170 175 

Ser Asn Pro Ser His Pro Val He Arg Gin Pro Thr Pro Gin Leu Ala 

180 185 190 

Val Ala Asp Asn He Val Pro Asp Lys He He Gin Asp Ala Gin Ala 
195 200 205 

Asp Gly 
210 



<210> 110 
<211> 2487 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . {2487) 

<400> 110 

atg gta gat gaa att agg gca att ttc tct act agt gga gat atg gcc 48 

Met Val Asp Glu He Arg Ala He Phe Ser Thr Ser Gly Asp Met Ala 
1 5 10 15 

gaa gta att acg gat ata ctg act gaa acg caa gca acg gcg tec ttc 96 
Glu Val He Thr Asp He Leu Thr Glu Thr Gin Ala Thr Ala Ser Phe 
20 25 30 

ttc tgc gtg etc cac gat egg ggc gac gcg cct ata aat act cca cat 144 
Phe Cys Val Leu His Asp Arg Gly Asp Ala Pro He Asn Thr Pro His 
35 40 45 

gcc gta att aaa etc tgc ctg ccc gcc aag cgc cca ggc ggc ggg cca 192 
Ala Val He Lys Leu Cys Leu Pro Ala Lys Arg Pro Gly Gly Gly Pro 
50 55 60 

agg tgt tta ccg ttg atg gtg ctg aac eta ccg gcg tgg cag gtt aat 240 
Arg Cys Leu Pro Leu Met Val Leu Asn Leu Pro Ala Trp Gin Val Asn 
65 70 75 80 

eta ttc eta aca ggt gac gca cca ttg acc teg gat aac att aaa gac 288 
Leu Phe Leu Thr Gly Asp Ala Pro Leu Thr Ser Asp Asn He Lys Asp 
85 90 95 

cgc att gac ctg get cag acc gag gaa ata etc gaa ccc ata tta age 336 
Arg He Asp Leu Ala Gin Thr Glu Glu He Leu Glu Pro He Leu Ser 
100 105 HO 

gta ctg gca tgc aaa egg tec gcg cag cag acc aaa cat gac teg ttt 384 
Val Leu Ala Cys Lys Arg Ser Ala Gin Gin Thr Lys His Asp Ser Phe 
115 120 125 

aaa tct aag gtg gcc tgg ttc agg gca aag ttc gtc teg gcc eta aga 432 
Lys Ser Lys Val Ala Trp Phe Arg Ala Lys Phe Val Ser Ala Leu Arg 
130 135 140 



192 
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aaa gtg tac aaa atg acc cca tec ccc tat tgg atg ata acg ctg ctg 
Lys Val Tyr Lys Met Thr Pro Ser Pro Tyr Trp Met lie Thr Leu Leu 



480 



145 



150 155 160 



ggc teg ttc gag gca teg ttt gta ctg gee ggc acg ttt tat ttt ttc 528 

Gly Ser Phe Glu Ala Ser Phe Val Leu Ala Gly Thr Phe Tyr Phe Phe 

165 170 175 

cag tct cat ata tgc acg gcg gag acg ctg gtg cat eta acg agg tta 576 

Gin Ser His lie Cys Thr Ala Glu Thr Leu Val His Leu Thr Arg Leu 
180 185 190 

ttc age tec agt cag ggc cag age ttg gtc acc gta aac acc tat gac 624 

Phe Ser Ser Ser Gin Gly Gin Ser Leu Val Thr Val Asn Thr Tyr Asp 

195 200 205 

gaa tta ggg cgc gta ttc ggg egg tea gat ttc ctt gga att gtt cca 672 

Glu Leu Gly Arg Val Phe Gly Arg Ser Asp Phe Leu Gly lie Val Pro 

210 215 220 



aat ttt tgg gcg tat eta aaa tac aaa atg caa cag gac gac gtg gag 
Asn Phe Trp Ala Tyr Leu Lys Tyr Lys Met Gin Gin Asp Asp Val Glu 



225 



230 235 240 



720 



tec agg gee ate gat caa aca ate aac tec ata cgt ggg ggg ttg atg 768 
Ser Arg Ala lie Asp Gin Thr lie Asn Ser He Arg Gly Gly Leu Met 
245 250 255 



816 



864 



ctg tea cct cag gac etc gta cac ttc ate tac ctg tea ttt tac gag 
Leu Ser Pro Gin Asp Leu Val His Phe He Tyr Leu Ser Phe Tyr Glu 
260 265 270 

tgc atg aac gca cag acg ttc ctg teg tac tct cgt acc acg tec agt 
Cys Met Asn Ala Gin Thr Phe Leu Ser Tyr Ser Arg Thr Thr Ser Ser 
275 280 285 

tta ccg acc ccc gca acg gtt aac cct ccg cag ctg tgt egg cgc tta 912 
Leu Pro Thr Pro Ala Thr Val Asn Pro Pro Gin Leu Cys Arg Arg Leu 
290 295 300 

gaa gcg gat ttt aag gag cac gtg atg gca tat tac aat aaa gca age 960 
Glu Ala Asp Phe Lys Glu His Val Met Ala Tyr Tyr Asn Lys Ala Ser 
305 310 315 320 

tac ctg agt act tac ata acc att tta acc gta ccc get cct etc ccg 1008 
Tyr Leu Ser Thr Tyr He Thr He Leu Thr Val Pro Ala Pro Leu Pro 
325 330 335 

gac gga tac gaa aac ttt cag gaa eta gca tgt caa tac tgg tgt gga 1056 
Asp Gly Tyr Glu Asn Phe Gin Glu Leu Ala Cys Gin Tyr Trp Cys Gly 
340 345 350 

caa teg aga gac gtg gcg gaa ate atg act aga att aat gac cag tat 1104 
Gin Ser Arg Asp Val Ala Glu He Met Thr Arg He Asn Asp Gin Tyr 
355 360 365 

cca cag ctg aat eta aca aaa gat tta tec ggt etc ctt gac eta gcg 1152 
Pro Gin Leu Asn Leu Thr Lys Asp Leu Ser Gly Leu Leu Asp Leu Ala 
370 375 380 

gec eta gat cag tat tec gga ggc ccc aag gaa aac ctt ttt acg gtg 
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Ala Leu Asp Gin Tyr Ser Gly Gly Pro Lys Glu Asn Leu Phe Thr Val 
385 390 395 400 

get teg aga att ccc aca tac agg tgc gag ttt ttg aat aaa cag tat 
Ala Ser Arg He Pro Thr Tyr Arg Cys Glu Phe Leu Asn Lys Gin Tyr 
405 410 415 

ttt gtt eta atg cac gca gac tgt ata gac gcg tac tgg aaa caa aac 
Phe Val Leu Met His Ala Asp Cys He Asp Ala Tyr Trp Lys Gin Asn 
420 425 430 

att ate gtg ccc gaa gac gee caa ttg caa ggc ctg acg gat caa gat 
He He Val Pro Glu Asp Ala Gin Leu Gin Gly Leu Thr Asp Gin Asp 
435 440 445 

etc ace tec agg ata ttt tac tgc gac etc ggc eta tct eta cct aca 
Leu Thr Ser Arg He Phe Tyr Cys Asp Leu Gly Leu Ser Leu Pro Thr 
450 455 460 

ttt aaa caa caa att eta gtt tea cgc cac gag tat ttc aac cca cga 
Phe Lys Gin Gin He Leu Val Ser Arg His Glu Tyr Phe Asn Pro Arg 
465 470 475 480 

etc cca gtt tat aga tgg gtg ttg gat -ttt gat eta aag gta acc gaa 
Leu Pro Val Tyr Arg Trp Val Leu Asp Phe Asp Leu Lys Val Thr Glu 
485 490 495 

ggt aga egg act eta aac gat ate tac aac ata tgc gta acg ctg egg 
Gly Arg Arg Thr Leu Asn Asp He Tyr Asn He Cys Val Thr Leu Arg 
500 505 510 

cag gtc ata tta gaa acg ttg cag tta ata ggt cca eta aaa ccg aac 
Gin Val He Leu Glu Thr Leu Gin Leu He Gly Pro Leu Lys Pro Asn 
515 520 525 

cac ccc gtg tat ttt ttt aag teg gec tgt cca gee gta acc tgg ccg 
His Pro Val Tyr Phe Phe Lys Ser Ala Cys Pro Ala Val Thr Trp Pro 
530 535 540 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



gat gat ate tea gac acc gcg ttc tgt cac tgt gac gca aaa ata ggg 
Asp Asp He Ser Asp Thr Ala Phe Cys His Cys Asp Ala Lys He Gly 
545 550 555 560 



1680 



atg aga att gta acc ccg ttt ccg age gga tac tgt ttg gtg ggg tec 
Met Arg He Val Thr Pro Phe Pro Ser Gly Tyr Cys Leu Val Gly Ser 
565 570 575 



1728 



get cca etc gtg tec ctg acc gac att tta aac cgc gtg gtt aaa ctg 
Ala Pro Leu Val Ser Leu Thr Asp He Leu Asn Arg Val Val Lys Leu 
580 585 590 

gac acg egg eta gee tea gag tat cct ggg ate tta gaa gat aaa gga 
Asp Thr Arg Leu Ala Ser Glu Tyr Pro Gly He Leu Glu Asp Lys Gly 
595 600 605 

cca ttt gac tct ggc att tac gee aag ggg cga tgc gtg cgc gta cct 
Pro Phe Asp Ser Gly He Tyr Ala Lys Gly Arg Cys Val Arg Val Pro 
610 615 620 

cac tgc tat aag gtt ggc ccg ggg ggc gag ctg tea egg etc eta aaa 
His Cys Tyr Lys Val Gly Pro Gly Gly Glu Leu Ser Arg Leu Leu Lys 

194 



1776 



1824 



1872 



1920 



625 



630 



635 



640 



att att ate tgt cac ccc gaa gag tea gat aaa teg gcg tat ttg aaa 1968 

lie lie lie Cys His Pro Glu Glu Ser Asp Lys Ser Ala Tyr Leu Lys 
645 f 650 655 

aac gcg ttt aag gtt tct aat ctg eta cat cac gec cct gga gac tct 2016 

Asn Ala Phe Lys Val Ser Asn Leu Leu His His Ala Pro Gly Asp Ser 
660 665 670 

gtc acc aaa aac ggc cac ctg gtg tac gcg ate act gac gaa aac gag 2064 

Val Thr Lys Asn Gly His Leu Val Tyr Ala He Thr Asp Glu Asn Glu 

675 680 685 

ggc ttt eta gaa age aaa acc aag aat aat ctt ccc aaa aca ate acg 2112 

Gly Phe Leu Glu Ser Lys Thr Lys Asn Asn Leu Pro Lys Thr He Thr 

690 695 700 

gac ctg gec gaa aaa att gag cga acc aca gaa aaa cca eta att gat 2160 

Asp Leu Ala Glu Lys He Glu Arg Thr Thr Glu Lys Pro Leu He Asp 

705 710 715 720 

tgg gca gcg acc gca gtg tgg cca aaa eta cac gat acc ata cag cgt 2208 

Trp Ala Ala Thr Ala Val Trp Pro Lys Leu His Asp Thr He Gin Arg 
725 730 735 

ttc ttt ccg gat gac cgc ate ggc caa ttt gca tct gtg age ttc atg 2256 

Phe Phe Pro Asp Asp Arg He Gly Gin Phe Ala Ser Val Ser Phe Met 
740 745 750 

cac tec gga gac aac att ata caa gtt aaa cca caa aaa gga aac aac 2304 

His Ser Gly Asp Asn He He Gin Val Lys Pro Gin Lys Gly Asn Asn 

755 760 765 

ttt ttc tgt att aat cat aag cac cga aac cac act cag aca gtc cgt 2352 

Phe Phe Cys He Asn His Lys His Arg Asn His Thr Gin Thr Val Arg 

770 775 780 



gta ttt tta acc eta cat tec aca aaa gaa age gag gtc acg gtg acc 

Val Phe Leu Thr Leu His Ser Thr Lys Glu Ser Glu Val Thr Val Thr 

785 790 795 800 

ttt atg agt cag tgc ttt get gca aaa tgt aat cat aat age cca act 

Phe Met Ser Gin Cys Phe Ala Ala Lys Cys Asn His Asn Ser Pro Thr 

805 810 815 

gca cat ttt tea ttt atg gta ccc ata acc ggc acg taa 

Ala His Phe Ser Phe Met Val Pro He Thr Gly Thr 

820 825 



2400 



2448 



2487 



<210> 111 

<211> 828 

<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 111 

Met Val Asp Glu He Arg Ala He Phe Ser Thr Ser Gly Asp Met Ala 

1 5 10 15 

Glu Val He Thr Asp He Leu Thr Glu Thr Gin Ala Thr Ala Ser Phe 
20 25 30 



195 



Phe Cys Val Leu His Asp Arg Gly Asp Ala Pro lie Asn Thr Pro His 

35 40 45 

Ala Val He Lys Leu Cys Leu Pro Ala Lys Arg Pro Gly Gly Gly Pro 

50 55 60 

Arg Cys Leu Pro Leu Met Val Leu Asn Leu Pro Ala Trp Gin Val Asn 
65 70 75 80 

Leu Phe Leu Thr Gly Asp Ala Pro Leu Thr Ser Asp Asn lie Lys Asp 

85 90 95 

Arg He Asp Leu Ala Gin Thr Glu Glu He Leu Glu Pro He Leu Ser 

100 105 110 

Val Leu Ala Cys Lys Arg Ser Ala Gin Gin Thr Lys His Asp Ser Phe 

115 120 125 

Lys Ser Lys Val Ala Trp Phe Arg Ala Lys Phe Val Ser Ala Leu Arg 

130 135 140 

Lys Val Tyr Lys Met Thr Pro Ser Pro Tyr Trp Met He Thr Leu Leu 
145 150 155 160 

Gly Ser Phe Glu Ala Ser Phe Val Leu Ala Gly Thr Phe Tyr Phe Phe 

165 170 175 

Gin Ser His He Cys Thr Ala Glu Thr Leu Val His Leu Thr Arg Leu 

180 185 190 

Phe Ser Ser Ser Gin Gly Gin Ser Leu Val Thr Val Asn Thr Tyr Asp 

195 200 205 

Glu Leu Gly Arg Val Phe Gly Arg Ser Asp Phe Leu Gly He Val Pro 

210 215 220 

Asn Phe Trp Ala Tyr Leu Lys Tyr Lys Met Gin Gin Asp Asp Val Glu 
225 230 235 240 

Ser Arg Ala He Asp Gin Thr lie Asn Ser He Arg Gly Gly Leu Met 

245 250 255 

Leu Ser Pro Gin Asp Leu Val His Phe lie Tyr Leu Ser Phe Tyr Glu 

260 265 270 

Cys Met Asn Ala Gin Thr Phe Leu Ser Tyr Ser Arg Thr Thr Ser Ser 

275 280 285 

Leu Pro Thr Pro Ala Thr Val Asn Pro Pro Gin Leu Cys Arg Arg Leu 

290 295 300 

Glu Ala Asp Phe Lys Glu His Val Met Ala Tyr Tyr Asn Lys Ala Ser 
305 310 315 320 

Tyr Leu Ser Thr Tyr He Thr He Leu Thr Val Pro Ala Pro Leu Pro 

325 330 335 

Asp Gly Tyr Glu Asn Phe Gin Glu Leu Ala Cys Gin Tyr Trp Cys Gly 

340 345 350 

Gin Ser Arg Asp Val Ala Glu He Met Thr Arg He Asn Asp Gin Tyr 

355 360 365 

Pro Gin Leu Asn Leu Thr Lys Asp Leu Ser Gly Leu Leu Asp Leu Ala 

370 375 380 

Ala Leu Asp Gin Tyr Ser Gly Gly Pro Lys Glu Asn Leu Phe Thr Val 
385 390 395 400 

Ala Ser Arg He Pro Thr Tyr Arg Cys Glu Phe Leu Asn Lys Gin Tyr 

405 410 415 

Phe Val Leu Met His Ala Asp Cys lie Asp Ala Tyr Trp Lys Gin Asn 

420 425 430 

He He Val Pro Glu Asp Ala Gin Leu Gin Gly Leu Thr Asp Gin Asp 

435 440 445 

Leu Thr Ser Arg He Phe Tyr Cys Asp Leu Gly Leu Ser Leu Pro Thr 

450 455 460 

Phe Lys Gin Gin He Leu Val Ser Arg His Glu Tyr Phe Asn Pro Arg 
465 470 475 480 

Leu Pro Val Tyr Arg Trp Val Leu Asp Phe Asp Leu Lys Val Thr Glu 

485 490 495 

Gly Arg Arg Thr Leu Asn Asp He Tyr Asn He Cys Val Thr Leu Arg 

500 505 510 

Gin Val lie Leu Glu Thr Leu Gin Leu He Gly Pro Leu Lys Pro Asn 
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<210> 112 
<211> 1329 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (1329) 

<400> 112 

atg egg tat gtg ttt cac gca eta att tgt ttt ata ggg ggc ata tea 48 

Met Arg Tyr Val Phe His Ala Leu He Cys Phe He Gly Gly He Ser 
15 10 15 

teg teg gac ttc gac gac teg tec teg gac gaa atg gac gac ctg tec 96 
Ser Ser Asp Phe Asp Asp Ser Ser Ser Asp Glu Met Asp Asp Leu Ser 
20 25 30 



cca acg ccg gag cca gaa ccg tec aca acg ccg aac age ttt ccg gag 

197 



144 



Pro Thr Pro Glu Pro Glu Pro Ser Thr Thr Pro Asn Ser Phe Pro Glu 
35 40 _ 45 



ggg ccc aaa tea caa gtg gtg gcg tta cct aaa ate cgt aaa aga tct 192 
Gly Pro Lys Ser Gin Val Val Ala Leu Pro Lys lie Arg Lys Arg Ser 
50 55 60 



gac tat cgc gga aaa gec get tta acg egg age att aaa gag tct att 
Asp Tyr Arg Gly Lys Ala Ala Leu Thr Arg Ser lie Lys Glu Ser He 



145 



150 155 160 



gtg gee aaa aaa gtt age ctg get aaa eta aca age eta tac aaa, ccc 

Val Ala Lys Lys Val Ser Leu Ala Lys Leu Thr Ser Leu Tyr Lys Pro 
225 230 235 240 

ctg etc acg ttt gtg acg ggg aga aac aac cag gee cac tgg ttg gcg 

Leu Leu Thr Phe Val Thr Gly Arg Asn Asn Gin Ala His Trp Leu Ala 
245 250 255 

acg cgc aaa aac acg eta gcg tct gcg gga etc gag get etc gcg gee 

Thr Arg Lys Asn Thr Leu Ala Ser Ala Gly Leu Glu Ala Leu Ala Ala 

260 265 ■ 270 

ttt ate gag gaa ggt ctg gcg tgg gee cag gtg tgc gta tct caa aac 

Phe He Glu Glu Gly Leu Ala Trp Ala Gin Val Cys Val Ser Gin Asn 

198 



288 



336 



cga tct gag acg ccg gta aaa att gag cac aga tct cca ctt aac cgc 240 

Arg Ser Glu Thr Pro Val Lys He Glu His Arg Ser Pro Leu Asn Arg 

65 70 75 80 

teg cga tct cga tec aga acg egg tec ggg tec ggt caa cga tea aac 

Ser Arg Ser Arg Ser Arg Thr Arg Ser Gly Ser Gly Gin Arg Ser Asn 

85 90 95 

cag tea ggc aga tac gtc aag aga ttc aaa cca acg gtt gat gca ccc 

Gin Ser Gly Arg Tyr Val Lys Arg Phe Lys Pro Thr Val Asp Ala Pro 

100 105 HO 

cgt cat cga gaa ccg tgg cac agg ggc ggt aag gga aag gec ccg ttt 3 84 

Arg His Arg Glu Pro Trp His Arg Gly Gly Lys Gly Lys Ala Pro Phe 
115 120 125 

ate cgc aga gac gca atg get ggc cgc ggt cga cgc aca tac ggc cac 432 

He Arg Arg Asp Ala Met Ala Gly Arg Gly Arg Arg Thr Tyr Gly His 

130 135 140 



480 



528 



aaa aag atg cac ctt cca tec acc atg etc tct cgt gcg cac gat aaa 

Lys Lys Met His Leu Pro Ser Thr Met Leu Ser Arg Ala His Asp Lys 
165 170 175 

aag gta ttc gag gga ctg ttg cca cga cac ctg gga cag tgc ttt cag 576 

Lys Val Phe Glu Gly Leu Leu Pro Arg His Leu Gly Gin Cys Phe Gin 
180 185 190 

gtg tgc ctg ccg gcg ccg cca ccg ctg caa ccc gag gtg ttc acg gat 624 

Val Cys Leu Pro Ala Pro Pro Pro Leu Gin Pro Glu Val Phe Thr Asp 
195 200 205 

cga cag ctt acc get ata gtc aag tct ggc ggg cgc aga gac gcg ctg 672 
Arg Gin Leu Thr Ala He Val Lys Ser Gly Gly Arg Arg Asp Ala Leu 

210 215 220 



720 



768 



816 



864 



275 



280 



285 



aga teg ttg aac gac age aac eta gat ate att eta gac agt agt cag 

Arg Ser Leu Asn Asp Ser Asn Leu Asp lie lie Leu Asp Ser Ser Gin 
290 295 300 

age gtg tgc aec tgg ttc ate tec aaa ate agg cat ttg cac ata cag 

Ser Val Cys Thr Trp Phe* He Ser Lys He Arg His Leu His He Gin 

305 310 315 320 

tgc ttc ttg gag aat cag ggt gag gtt age ctg gtg aag cag eta act 

Cys Phe Leu Glu Asn Gin Gly Glu Val Ser Leu Val Lys Gin Leu Thr 
325 330 335 

tac etc gtg tgt ata aat aac cgc ctg gcg gag gcg gcg aac ctg gcg 

Tyr Leu Val Cys He Asn Asn Arg Leu Ala Glu Ala Ala Asn Leu Ala 
340 345 350 

gga gag gtg aaa ctg aat ttt aaa etc gga atg ctg att ggg ttc gee 

Gly Glu Val Lys Leu Asn Phe Lys Leu Gly Met Leu He Gly Phe Ala 
355 360 365 

eta acc ttg ccg gcg etc ctt gca gag cat aag ttg tct ggc gag age 

Leu Thr Leu Pro Ala Leu Leu Ala Glu His Lys Leu Ser Gly Glu Ser 
370 375 380 

ctg tac ctg ttt agg tct ttc ctg gag aaa tat aga ccg gga gat gtc 

Leu Tyr Leu Phe Arg Ser Phe Leu Glu Lys Tyr Arg Pro Gly Asp Val 

385 390 395 400 

atg gga etc tta aat tea ata gtg gtg gag cac tac acc aaa tgt cgt 

Met Gly Leu Leu Asn Ser He Val Val Glu His Tyr Thr Lys Cys Arg 
405 410 415 

age gcg gag tgt gtc att aca acc cac gee atg gta gga tct ggc gaa 

Ser Ala Glu Cys Val He Thr Thr His Ala Met Val Gly Ser Gly Glu 
420 425 430 

aac aac aag gga ctt ttt ttc ttt cca gtg taa 
Asn Asn Lys Gly Leu Phe Phe Phe Pro Val 
435 440 



<210> 113 
<211> 442 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 113 

Met Arg Tyr Val Phe His Ala Leu He Cys Phe He Gly Gly He Ser 

1 5 10 15 

Ser Ser Asp Phe Asp Asp Ser Ser Ser Asp Glu Met Asp Asp Leu Ser 

20 25 30 

Pro Thr Pro Glu Pro Glu Pro Ser Thr Thr Pro Asn Ser Phe Pro Glu 

35 40 45 

Gly Pro Lys Ser Gin Val Val Ala Leu Pro Lys lie Arg Lys Arg Ser 

50 55 60 

Arg Ser Glu Thr Pro Val Lys He Glu His Arg Ser Pro Leu Asn Arg 
65 70 75 80 

Ser Arg Ser Arg Ser Arg Thr Arg Ser Gly Ser Gly Gin Arg Ser Asn 
85 90 95 



Gin Ser Gly Arg Tyr Val Lys Arg Phe Lys Pro Thr Val Asp Ala Pro 

100 105 HO 

Arg His Arg Glu Pro Trp His Arg Gly Gly Lys Gly Lys Ala Pro Phe 

115 120 125 

lie Arg Arg Asp Ala Met Ala Gly Arg Gly Arg Arg Thr Tyr Gly His 

130 135 140 

Asp Tyr Arg Gly Lys Ala Ala Leu Thr Arg Ser lie Lys Glu Ser He 
145 150 155 160 

Lys Lys Met His Leu Pro Ser Thr Met Leu Ser Arg Ala His Asp Lys 

165 170 175 

Lys Val Phe Glu Gly Leu Leu Pro Arg His Leu Gly Gin Cys Phe Gin 

180 185 190 

Val Cys Leu Pro Ala Pro Pro Pro Leu Gin Pro Glu Val Phe Thr Asp 

195 200 205 

Arg Gin Leu Thr Ala He Val Lys Ser Gly Gly Arg Arg Asp Ala Leu 

210 215 220 

Val Ala Lys Lys Val Ser Leu Ala Lys Leu Thr Ser Leu Tyr Lys Pro 
225 230 235 240 

Leu Leu Thr Phe Val Thr Gly Arg Asn Asn Gin Ala His Trp Leu Ala 

245 250 255 

Thr Arg Lys Asn Thr Leu Ala Ser Ala Gly Leu Glu Ala Leu Ala Ala 

260 265 270 

Phe He Glu Glu Gly Leu Ala Trp Ala Gin Val Cys Val Ser Gin Asn 

275 280 285 

Arg Ser Leu Asn Asp Ser Asn Leu Asp He He Leu Asp Ser Ser Gin 

290 295 300 

Ser Val Cys Thr Trp Phe He Ser Lys He Arg His Leu His He Gin 
305 310 315 320 

Cys Phe Leu Glu Asn Gin Gly Glu Val Ser Leu Val Lys Gin Leu Thr 

325 330 335 

Tyr Leu Val Cys He Asn Asn Arg Leu Ala Glu Ala Ala Asn Leu Ala 

340 345 350 

Gly Glu Val Lys Leu Asn Phe Lys Leu Gly Met Leu He Gly Phe Ala 

355 360 365 

Leu Thr Leu Pro Ala Leu Leu Ala Glu His Lys Leu Ser Gly Glu Ser 

370 375 380 

Leu Tyr Leu Phe Arg Ser Phe Leu Glu Lys Tyr Arg Pro Gly Asp Val 
385 390 395 400 

Met Gly Leu Leu Asn Ser He Val Val Glu His Tyr Thr Lys Cys Arg 

405 410 415 

Ser Ala Glu Cys Val He Thr Thr His Ala Met Val Gly Ser Gly Glu 

420 425 430 

Asn Asn Lys Gly Leu Phe Phe Phe Pro Val 
435 440 



<210> 114 
<211> 1248 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (1248) 

<400> 114 

atg get acc tgg cgc cca cct cag age ggt ggt ccg tec gec atg gga 

Met Ala Thr Trp Arg Pro Pro Gin Ser Gly Gly Pro Ser Ala Met Gly 
1 5 10 15 



200 



ctt cga gaa tgg ate gtt act cac gca aat ctt gca acg tat teg gga 96 

Leu Arg Glu Trp lie Val Thr His Ala Asn Leu Ala Thr Tyr Ser Gly 
20 25 30 

ctg ttt tgg gca gac gac gaa aaa acc agg gtc gtg ctg gcg act acg 144 

Leu Phe Trp Ala Asp Asp Glu Lys Thr Arg Val Val Leu Ala Thr Thr 
35 40 45 

acc ccg tgg tct gtg gga ttt gat tac ctt egg gat ggg aag atg tac 192 

Thr Pro Trp Ser Val Gly Phe Asp Tyr Leu Arg Asp Gly Lys Met Tyr 
50 55 60 

gag gac tac tgc aac cag aga aac ate ccc ctt ccg tec ggg agg tct 24 0 

Glu Asp Tyr Cys Asn Gin Arg Asn He Pro Leu Pro Ser Gly Arg Ser 

65 70 75 80 

agg ctg ggc cag gec aag gcg cgc eta ttg ggt gcg ata aga aag agt 288 

Arg Leu Gly Gin Ala Lys Ala Arg Leu Leu Gly Ala He Arg Lys Ser 
85 90 95 

gec tat ttc ata gaa gaa aag gac gtt ttg aga egg agt ttt tea ttt 336 

Ala Tyr Phe He Glu Glu Lys Asp Val Leu Arg Arg Ser Phe Ser Phe 
100 105 110 

gca aac gtg gtc ttc agg ttg cgc tec gat gaa gag atg tta tgc agg 384 

Ala Asn Val Val Phe Arg Leu Arg Ser Asp Glu Glu Met Leu Cys Arg 
115 120 125 

ttg tgc cct egg gee tct ggg gta gcg gca gaa ctg egg ggc ttg agg 432 

Leu Cys Pro Arg Ala Ser Gly Val Ala Ala Glu Leu Arg Gly Leu Arg 
130 135 140 

ttt egg atg ttt aag cgc aag ggc gec gac gag gcg ggc agg gta tct 480 

Phe Arg Met Phe Lys Arg Lys Gly Ala Asp Glu Ala Gly Arg Val Ser 

145 150 155 160 



gaa tac acg gta aag cag ctg etc ggg eta ctt cgt acc cgt ccc gec 
Glu Tyr Thr Val Lys Gin Leu Leu Gly Leu Leu Arg Thr Arg Pro Ala 
165 170 175 



528 



ggt acc ttc acc atg acc gec ccg gec acc gag gcg tct gca acc gca 576 

Gly Thr Phe Thr Met Thr Ala Pro Ala Thr Glu Ala Ser Ala Thr Ala 
180 185 190 

acc gec tec ggt gaa gac ggt agg cag gac aat agt caa ggg ggc gcg 624 

Thr Ala Ser Gly Glu Asp Gly Arg Gin Asp Asn Ser Gin Gly Gly Ala 
195 200 205 

gtt gcg ttg ccc ggg gaa cac gee ctg ccg eta tct gca teg age ggt 672 

Val Ala Leu Pro Gly Glu His Ala Leu Pro Leu Ser Ala Ser Ser Gly 
210 215 220 

ctt tct gcg tgt ttg gcg cca tct gtt gac gac ccg tgg gga ttt atg 720 

Leu Ser Ala Cys Leu Ala Pro Ser Val Asp Asp Pro Trp Gly Phe Met 
225 230 235 240 

cat att cag gtt tat tat tat ggg gtc ctt caa gca cag act ttt acc 768 

His He Gin Val Tyr Tyr Tyr Gly Val Leu Gin Ala Gin Thr Phe Thr 

245 250 255 

cat tct ggg atg ggc gtt cgt tta tec acg aga cca acg gat aaa aat 816 



201 
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His Ser Gly Met Gly Val Arg Leu Ser Thr Arg Pro Thr Asp Lys Asn 
260 265 270 

gaa cac cac gtc tgc atg get ccc ggg ccg ctt cag ttg tgg ttg ccc 864 

Glu His His Val Cys Met Ala Pro Gly Pro Leu Gin Leu Trp Leu Pro 

275 280 285 

ccg gcg cct tat atg gat gac gat ttt atg ttg age aga ctt gtg aac 912 

Pro Ala Pro Tyr Met Asp Asp Asp Phe Met Leu Ser Arg Leu Val Asn 

290 295 300 

gcg tta cac gec ctg gag gac ggc att gtt ttg tgc agt tgt cag tat . 960 

Ala Leu His Ala Leu Glu Asp Gly lie Val Leu Cys Ser Cys Gin Tyr 

305 310 315 320 

ggg att atg atg aat ggc tat gga ttt ctt aat tta tgg ttc cgt ggc 1008 

Gly lie Met Met Asn Gly Tyr Gly Phe Leu Asn Leu Trp Phe Arg Gly 
325 330 335 

aat acc tea aac acc tct gaa cca aga cgc gtg cca tct ggg gtg gga 1056 

Asn Thr Ser Asn Thr Ser Glu Pro Arg Arg Val Pro Ser Gly Val Gly 
340 345 350 



cat cgc gtt ttt gat acc gat gaa tac atg tta aag tta gec cag tea 
His Arg Val Phe Asp Thr Asp Glu Tyr Met Leu Lys Leu Ala Gin Ser 
355 360 365 



gtg tct gec tgg tec eta tac gag gag gag gat caa tea cag gcg ccg 

Val Ser Ala Trp Ser Leu Tyr Glu Glu Glu Asp Gin Ser Gin Ala Pro 

385 390 395 400 

ata tgt att gtt gtc cac cag cgt gaa ata tat egg cac ttt gaa tga 

He Cys He Val Val His Gin Arg Glu He Tyr Arg His Phe Glu 

405 410 415 



1104 



cca cga ccc tct gat ccc ggg cca ccc gat ccg ttt gca cag att tgg 1152 
Pro Arg Pro Ser Asp Pro Gly Pro Pro Asp Pro Phe Ala Gin He Trp 
370 375 380 



1200 



1248 



<210> 115 
<211> 415 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 115 
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390 










395 










400 


He 


Cys 


He 


Val 


Val 


His 


Gin 


Arg 


Glu 


He 


Tyr 


Arg 


His 


Phe 


Glu 





405 410 415 



<210> 116 
<211> 1248 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (1248) 

<400> 116 

atg gcg ggc cgt gga gtc gat ate agg gec tgg etc gtt gec gcg gtc 
Met Ala Gly Arg Gly Val Asp lie Arg Ala Trp Leu Val Ala Ala Val 
15 10 15 

gaa teg ggc gag tac cgc gga ttg gtt tgg gag aat gag gac aag acc 
Glu Ser Gly Glu Tyr Arg Gly Leu Val Trp Glu Asn Glu Asp Lys Thr 
20 25 30 



48 



96 



gtt gtc egg gtc ccg tgg aac aaa gta act gca gat egg tec gta tgg 144 
Val Val Arg Val Pro Trp Asn Lys Val Thr Ala Asp Arg Ser Val Trp 
35 40 45 
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aat age gag aag ttt ttt gat gac tac tgc aac atg cga ggc ata tgt 192 

Asn Ser Glu Lys Phe Phe Asp Asp Tyr Cys Asn Met Arg Gly lie Cys 

50 55 60 

cag ggc gaa aaa ccg tec cac tat ggc aga ttc agg aaa atg aga ttc 240 

Gin Gly Glu Lys Pro Ser His Tyr Gly Arg Phe Arg Lys Met Arg Phe 
65 70 75 80 

ttg tac gac atg aga cac cac aaa agt ata cga gag ctt aaa ttt att 288 

Leu Tyr Asp Met Arg His His Lys Ser lie Arg Glu Leu Lys^.Phe lie 

85 90 95 

aat aag gec tat ggg cga ccg ggt get cga tat aga ctg ttt cgc ctg 336 

Asn Lys Ala Tyr Gly Arg Pro Gly Ala Arg Tyr Arg Leu Phe Arg Leu 

100 105 110 

ttg ccg gaa ccg gtg gta agt tgt gca atg tgt aat ctg atg tct tec 384 

Leu Pro Glu Pro Val Val Ser Cys Ala Met Cys Asn Leu Met Ser Ser 

115 120 125 

ace gaa act cag tgc ctt ggt tta ate agt gag ttc caa tac gac cag 432 

Thr Glu Thr Gin Cys Leu Gly Leu lie Ser Glu Phe Gin Tyr Asp Gin 

130 135 140 

999 ggt ggg tct gga cgt gaa aga aga egg gtt ttt acc gca acg gta 480 

Gly Gly Gly Ser Gly Arg Glu Arg Arg Arg Val Phe Thr Ala Thr Val 

145 150 155 160 

ttg get egg agt egg atg gac aaa aac aag cgt gtg cgc gag cat egg 528 

Leu Ala Arg Ser Arg Met Asp Lys Asn Lys Arg Val Arg Glu His Arg 

165 170 175 

ctg cca ggg gec ate caa tta acg ttt eta tat ttt gga gag acc gtt 576 

Leu Pro Gly Ala lie Gin Leu Thr Phe Leu Tyr Phe Gly Glu Thr Val 

180 185 190 

ggc etc gaa cgt gtt cat get gga ata egg gtc tgc agt cga cca tgt 624 

Gly Leu Glu Arg Val His Ala Gly lie Arg Val Cys Ser Arg Pro Cys 

195 200 205 

ccc gtc etc get ggc cac gcg tgt tgt ttc caa gat gag agg act ttg 672 

Pro Val Leu Ala Gly His Ala Cys Cys Phe Gin Asp Glu Arg Thr Leu 

210 215 220 

ttt tta ccg tea ccg ggg gtc gtt gac tgt cag ttt gec aga gaa gat 720 

Phe Leu Pro Ser Pro Gly Val Val Asp Cys Gin Phe Ala Arg Glu Asp 

225 230 235 240 

eta agg gtt atg cac aaa aag tgc gag aag ggt gtg ttg att acc ctt 768 

Leu Arg Val Met His Lys Lys Cys Glu Lys Gly Val Leu lie Thr Leu 

245 250 255 

acg gat act ggc att tgc gtg aag aac ctg gag aac egg gaa atg aag 816 

Thr Asp Thr Gly lie Cys Val Lys Asn Leu Glu Asn Arg Glu Met Lys 

260 265 270 

gtg ctt act aac aat gag gag gag tat aag gac ctg cca tec agg caa 864 

Val Leu Thr Asn Asn Glu Glu Glu Tyr Lys Asp Leu Pro Ser Arg Gin 

275 280 285 
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ccc gtt cag gtt ttt gat atg gtg gat tat ctt aga get ctt gca agg 912 

Pro Val Gin Val Phe Asp Met Val Asp Tyr Leu Arg Ala Leu Ala Arg 
290 295 300 

tec ccg aac cct gga gac gaa cca cct aga gat tat gca cag ate gee 960 

Ser Pro Asn Pro Gly Asp Glu Pro Pro Arg Asp Tyr Ala Gin lie Ala 

305 310 315 320 

ctg tgt ctg teg gtc cag teg ccg aat ccc gtc gat gcg ccc ate gee 1008 

Leu Cys Leu Ser Val Gin Ser Pro Asn Pro Val Asp Ala Pro lie Ala 

325 330 335 

atg aga ctg cgt tac gtg tgc gaa acc teg tct gta tgc ggt aca gag 1056 

Met Arg Leu Arg Tyr Val Cys Glu Thr Ser Ser Val Cys Gly Thr Glu 

340 345 350 

ggt tgt ttt tac ccc ggg acg act gta aca tea gag ggt cgc acc gac 1104 

Gly Cys Phe Tyr Pro Gly Thr Thr Val Thr Ser Glu Gly Arg Thr Asp 

355 360 365 

tgt tea ttc cag atg gaa gat cca gga gag ggg acg age caa teg cat 1152 

Cys Ser Phe Gin Met Glu Asp Pro Gly Glu Gly Thr Ser Gin Ser His 
370 375 380 

gac ccg gca gta gag ctg ggc gac tec gga cca gat tea atg gac gat 1200 

Asp Pro Ala Val Glu Leu Gly Asp Ser Gly Pro Asp Ser Met Asp Asp 

385 390 395 400 

ccg gac gcg gga acg age ggt gaa gat gat ggg gtg gcg tgc teg tga 124 8 

Pro Asp Ala Gly Thr Ser Gly Glu Asp Asp Gly Val Ala Cys Ser 

405 410 415 



<210> 117 
<211> 415 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 117 

Met Ala Gly Arg Gly Val Asp lie Arg Ala Trp Leu Val Ala Ala Val 

1 5 10 15 

Glu Ser Gly Glu Tyr Arg Gly Leu Val Trp Glu Asn Glu Asp Lys Thr 

20 25 30 

Val Val Arg Val Pro Trp Asn Lys Val Thr Ala Asp Arg Ser Val Trp 

35 40 45 

Asn Ser Glu Lys Phe Phe Asp Asp Tyr Cys Asn Met Arg Gly lie Cys 

50 55 60 

Gin Gly Glu Lys Pro Ser His Tyr Gly Arg Phe Arg Lys Met Arg Phe 
65 70 75 80 

Leu Tyr Asp Met Arg His His Lys Ser lie Arg Glu Leu Lys Phe lie 

85 90 95 

Asn Lys Ala Tyr Gly Arg Pro Gly Ala Arg Tyr Arg Leu Phe Arg Leu 

100 105 110 

Leu Pro Glu Pro Val Val Ser Cys Ala Met Cys Asn Leu Met Ser Ser 

115 120 125 

Thr Glu Thr Gin Cys Leu Gly Leu He Ser Glu Phe Gin Tyr Asp Gin 

130 135 140 

Gly Gly Gly Ser Gly Arg Glu Arg Arg Arg Val Phe Thr Ala Thr Val 
145 150 155 160 

Leu Ala Arg Ser Arg Met Asp Lys Asn Lys Arg Val Arg Glu His Arg 
165 170 175 
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Leu Pro Gly Ala lie Gin Leu Thr Phe Leu Tyr Phe Gly Glu Thr Val 

180 185 190 

Gly Leu Glu Arg Val His Ala Gly lie Arg Val Cys Ser Arg Pro Cys 

195 200 205 

Pro Val Leu Ala Gly His Ala Cys Cys Phe Gin Asp Glu Arg Thr Leu 

210 215 220 

Phe Leu Pro Ser Pro Gly Val Val Asp Cys Gin Phe Ala Arg Glu Asp 
225 230 235 240 

Leu Arg Val Met His Lys Lys Cys Glu Lys Gly Val Leu lie Thr Leu 

245 250 255 

Thr Asp Thr Gly lie Cys Val Lys Asn Leu Glu Asn Arg Glu Met Lys 

260 265 270 

Val Leu Thr Asn Asn Glu Glu Glu Tyr Lys Asp Leu Pro Ser Arg Gin 

275 280 285 

Pro Val Gin Val Phe Asp Met Val Asp Tyr Leu Arg Ala Leu Ala Arg 

290 295 300 

Ser Pro Asn Pro Gly Asp Glu Pro Pro Arg Asp Tyr Ala Gin lie Ala 
305 310 315 320 

Leu Cys Leu Ser Val Gin Ser Pro Asn Pro Val Asp Ala Pro lie Ala 

325 330 335 

Met Arg Leu Arg Tyr Val Cys Glu Thr Ser Ser Val Cys Gly Thr Glu 

340 345 350 

Gly Cys Phe Tyr Pro Gly Thr Thr Val Thr Ser Glu Gly Arg Thr Asp 

355 360 365 

Cys Ser Phe Gin Met Glu Asp Pro Gly Glu Gly Thr Ser Gin Ser His 

370 375 380 

Asp Pro Ala Val Glu Leu Gly Asp Ser Gly Pro Asp Ser Met Asp Asp 
385 390 395 400 

Pro Asp Ala Gly Thr Ser Gly Glu Asp Asp Gly Val Ala Cys Ser 
405 410 415 



<210> 118 
<211> 1056 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1056) 

<400> 118 

atg gag egg ccg gtc cga gtg acc aaa cca tec age ctt cga gga tgg 

Met Glu Arg Pro Val Arg Val Thr Lys Pro Ser Ser Leu Arg Gly Trp 
1 5 10 15 



ctg gtg gaa tgc tgc gag aca ggt cgt cat ccc gga atg egg tgg gtt 

Leu Val Glu Cys Cys Glu Thr Gly Arg His Pro Gly Met Arg Trp Val 
20 25 30 

gac gag gaa aga act tta ate cgc ata ccg tgg aat cac gac aga ggg 

Asp Glu Glu Arg Thr Leu He Arg He Pro Trp Asn His Asp Arg Gly 
35 40 45 

age aga gga gtc gag gag gcg gaa aaa aac att ttt att gac tac tgt 

Ser Arg Gly Val Glu Glu Ala Glu Lys Asn He Phe He Asp Tyr Cys 
50 55 60 



cga tec agg ggc ate ttg cac gcg get ggc agg gaa tta acc get aag 
Arg Ser Arg Gly He Leu His Ala Ala Gly Arg Glu Leu Thr Ala Lys 



65 70 75 80 

gaa tgc aaa aac tgg eta tec age gec ata egg cat agt caa acc gtg 288 
Glu Cys Lys Asn Trp Leu Ser Ser Ala lie Arg His Ser Gin Thr Val 
85 90 95 

teg gac gtg tea act aag gac aac ctt teg act cca tat ccg gac aga 336 
Ser Asp Val Ser Thr Lys Asp Asn Leu Ser Thr Pro Tyr Pro Asp Arg 
100 105 110 

tgc cga ate att cgc ctt tta cca att acc gtg agg teg tgt get cgt 384 
Cys Arg He He Arg Leu Leu Pro He Thr Val Arg Ser Cys Ala Arg 
115 120 125 

tgc gat cag gec tec ggc act act get atg tta egg ggg ttg cgc gag 432 
Cys Asp Gin Ala Ser Gly, Thr Thr Ala Met Leu Arg Gly Leu Arg Glu 
130 135 140 

gag gec gtg aat aag ttt ggt ccg gtt ggc gec ggt gtt egg tac act 480 
Glu Ala Val Asn Lys Phe Gly Pro Val Gly Ala Gly Val Arg Tyr Thr 
145 150 155 160 

ggt gcg gtg gga gee ggg ggt gaa cag tgt tgg atg eta cgt ata atg 52 8 
Gly Ala Val Gly Ala Gly Gly Glu Gin Cys Trp Met Leu Arg He Met 
165 170 175 

ttt tac tat tat ggg gac aga gtt ggt gaa gtg gtg act gaa teg tec 576 
Phe Tyr Tyr Tyr Gly Asp Arg Val Gly Glu Val Val Thr Glu Ser Ser 
180 185 190 

aac ggc ate cgt gtg etc cca ttg tec gaa cgt cgc cct caa ggg cac 624 
Asn Gly He Arg Val Leu Pro Leu Ser Glu Arg Arg Pro Gin Gly His 
195 200 205 

ata tgc gec gca ccg ate gec gag cag gca ctg gtt ccg gag att cca 672 
He Cys Ala Ala Pro He Ala Glu Gin Ala Leu Val Pro Glu He Pro 
210 215 220 

ggc cac tta gca gag ttt cag gee gag gcg ttg cgc ttc ctg gat aaa 720 
Gly His Leu Ala Glu Phe Gin Ala Glu Ala Leu Arg Phe Leu Asp Lys 
225 230 235 240 

gat ctt ttg cgc ggg ttg gcg ttt tgg gec gac ccc tec ggc att tac 768 
Asp Leu Leu Arg Gly Leu Ala Phe Trp Ala Asp Pro Ser Gly He Tyr 
245 250 255 

att aga tgg ttg ggg cat tec ctg gcg ttc gtt caa ggg aac gtc gaa 816 
He Arg Trp Leu Gly His Ser Leu Ala Phe Val Gin Gly Asn Val Glu 
260 265 270 

tct acg ggg gcg gtt gcg gtc ctg teg tgc gee aac gcg tgt cgt gca 864 
Ser Thr Gly Ala Val Ala Val Leu Ser Cys Ala Asn Ala Cys Arg Ala 
275 280 285 

ttt aac ttg gtt gac tat atg aca gee atg gec aga aca tct ccg gac 912 
Phe Asn Leu Val Asp Tyr Met Thr Ala Met Ala Arg Thr Ser Pro Asp 
290 295 300 

gga gcg gcg ctt cca caa get tgt gtt tac eta tat ttt ggg ggc gta 960 
Gly Ala Ala Leu Pro Gin Ala Cys Val Tyr Leu Tyr Phe Gly Gly Val 
305 310 315 320 



207 



cct aca ccc gag ggc gga gtg caa age act gta ccg eta att att cag 

Pro Thr Pro Glu Gly Gly Val Gin Ser Thr Val Pro Leu lie He Gin 
325 330 335 

ctg tgg cac gag tgt ctg tgg egg gcg ttg tec gec get aat gtt taa 

Leu Trp His Glu Cys Leu Trp Arg Ala Leu Ser Ala Ala Asn Val 
340 345 350 



<210> 119 
<211> 351 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 119 

Met Glu Arg Pro Val Arg Val Thr Lys Pro Ser Ser Leu Arg Gly Trp 

15 10 15 

Leu Val Glu Cys Cys Glu Thr Gly Arg His Pro Gly Met Arg Trp Val 

20 25 30 

Asp Glu Glu Arg Thr Leu He Arg He Pro Trp Asn His Asp Arg Gly 

35 40 45 

Ser Arg Gly Val Glu Glu Ala Glu Lys Asn He Phe He Asp Tyr Cys 

50 55 60 

Arg Ser Arg Gly He Leu His Ala Ala Gly Arg Glu Leu Thr Ala Lys 
65 70 75 80 

Glu Cys Lys Asn Trp Leu Ser Ser Ala He Arg His Ser Gin Thr Val 

85 90 95 

Ser Asp Val Ser Thr Lys Asp Asn Leu Ser Thr Pro Tyr Pro Asp Arg 

100 105 HO 

Cys Arg He He Arg Leu Leu Pro He Thr Val Arg Ser Cys Ala Arg 

115 120 125 

Cys Asp Gin Ala Ser Gly Thr Thr Ala Met Leu Arg Gly Leu Arg Glu 

130 135 140 

Glu Ala Val Asn Lys Phe Gly Pro Val Gly Ala Gly Val Arg Tyr Thr 
145 150 155 160 

Gly Ala Val Gly Ala Gly Gly Glu Gin Cys Trp Met Leu Arg lie Met 

165 170 175 

Phe Tyr Tyr Tyr Gly Asp Arg Val Gly Glu Val Val Thr Glu Ser Ser 

180 185 190 

Asn Gly He Arg Val Leu Pro Leu Ser Glu Arg Arg Pro Gin Gly His 

195 200 205 

He Cys Ala Ala Pro He Ala Glu Gin Ala Leu Val Pro Glu He Pro 

210 215 220 

Gly His Leu Ala Glu Phe Gin Ala Glu Ala Leu Arg Phe Leu Asp Lys 
225 230 235 240 

Asp Leu Leu Arg Gly Leu Ala Phe Trp Ala Asp Pro Ser Gly He Tyr 

245 250 255 

He Arg Trp Leu Gly His Ser Leu Ala Phe Val Gin Gly Asn Val Glu 

260 265 270 

Ser Thr Gly Ala Val Ala Val Leu Ser Cys Ala Asn Ala Cys Arg Ala 

275 280 285 

Phe Asn Leu Val Asp Tyr Met Thr Ala Met Ala Arg Thr Ser Pro Asp 

290 295 300 

Gly Ala Ala Leu Pro Gin Ala Cys Val Tyr Leu Tyr Phe Gly Gly Val 
305 310 315 320 

Pro Thr Pro Glu Gly Gly Val Gin Ser Thr Val Pro Leu He He Gin 

325 330 335 

Leu Trp His Glu Cys Leu Trp Arg Ala Leu Ser Ala Ala Asn Val 
340 345 350 
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<210> 120 
<211> 762 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (762) 



<400> 120 

atg gat tct gga tgc tat gcg tgt ate ctt gat gag aac tct gaa ggg 

Met Asp Ser Gly Cys Tyr Ala Cys lie Leu Asp Glu Asn Ser Glu Gly 
1 5 10 15 

att ata aat tat ttg gag cag gtg tgt ggc ate ggg tta gag ccg gga 
He He Asn Tyr Leu Glu Gin Val Cys Gly He Gly Leu Glu Pro Gly 
20 25 30 

atg ccg ctg ccc gee ccc ttg ccc acg ttg gtc cct ccg acg aga tea 
Met Pro Leu Pro Ala Pro Leu Pro Thr Leu Val Pro Pro Thr Arg Ser 
35 40 45 

gcg tac gec cga gca cat cgt etc ggg gtc cca gaa gcg ccg ctg cct 
Ala Tyr Ala Arg Ala His Arg Leu Gly Val Pro Glu Ala Pro Leu Pro 
50 55 60 



48 



96 



144 



192 



cat caa att gtg ccg ttt tgg egg ttg aga att cag gtg ttt tat ttt 
His Gin He Val Pro Phe Trp Arg Leu Arg He Gin Val Phe Tyr Phe 

70 75 80 



65 



240 



ggc gtt ctg gca ctg gat cac acg tec caa gat egg aga ggt gtt cgt 
Gly Val Leu Ala Leu Asp His Thr Ser Gin Asp Arg Arg Gly Val Arg 
85 90 95 

ttg cat ccg cgc ccg gtt ccc cat ccc ggt cat ctg tgt ttc tat ggc 
Leu His Pro Arg Pro Val Pro His Pro Gly His Leu Cys Phe Tyr Gly 
100 105 HO 



288 



336 



acc ggg ttc act gtc tgg ttt cca tct cca gat cgt gaa aaa ctg acc 
Thr Gly Phe Thr Val Trp Phe Pro Ser Pro Asp Arg Glu Lys Leu Thr 
115 120 125 



384 



gca gaa cag ata acg caa ata aaa acg atg ctg gtc gcg tat aac gag 
Ala Glu Gin He Thr Gin He Lys Thr Met Leu Val Ala Tyr Asn Glu 

135 140 



130 



432 



ggc ate tac gta cac ggc aac gaa acc gga gtg tac gtc gat aat agg 
Gly He Tyr Val His Gly Asn Glu Thr Gly Val Tyr Val Asp Asn Arg 
145 150 155 160 

aac agg gaa acg ctg tac gcg gcg ggt aac gac tgt aat ggt gat att 
Asn Arg Glu Thr Leu Tyr Ala Ala Gly Asn Asp Cys Asn Gly Asp He 
165 170 175 



480 



528 



ate caa cga gag gtc atg ttc etc tct aaa cag aaa att ttt aac ttc 
He Gin Arg Glu Val Met Phe Leu Ser Lys Gin Lys He Phe Asn Phe 
180 185 190 



576 



atg ggt ttc atg aga aag ctt gee cgc tct ccg gga cca gag age cat 

209 



624 



Met Gly Phe Met Arg Lys Leu Ala Arg Ser Pro Gly Pro Glu Ser His 
195 200 205 



gcg ccg tgc aac ggg gca aca ctg tat etc tea cag cag cct ggt gec 

Ala Pro Cys Asn Gly Ala Thr Leu Tyr Leu Ser Gin Gin Pro Gly Ala 

210 215 220 

caa gag age cca caa gtt cct att tct gtg gtg gtc tgt cag gac gag 

Gin Glu Ser Pro Gin Val Pro He Ser Val Val Val Cys Gin Asp Glu 
225 230 235 240 

ctg gtg cag gga cac atg aat cct tec egg tgg tgc tea taa 

Leu Val Gin Gly His Met Asn Pro Ser Arg Trp Cys Ser 
245 250 



<210> 121 
<211> 253 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 121 



Met 


Asp 


Ser 


Gly 


Cys 


Tyr 


Ala 


Cys 


He 


Leu 


Asp 


Glu 


Asn 


Ser 


Glu 


Gly 


1 






5 










10 










15 




He 


He 


Asn 


Tyr 


Leu 


Glu 


Gin 


Val 


Cys 


Gly 


lie 


Gly 


Leu 


Glu 


Pro 


Gly 








20 










25 










30 






Met 


Pro 


Leu 


Pro 


Ala 


Pro 


Leu 


Pro 


Thr 


Leu 


Val 


Pro 


Pro 


Thr 


Arg 


Ser 






35 










40 










45 








Ala 


Tyr 


Ala 


Arg 


Ala 


His 


Arg 


Leu 


Gly 


Val 


Pro 


Glu 


Ala 


Pro 


Leu 


Pro 




50 










55 










60 










His 


Gin 


He 


Val 


Pro 


Phe 


Trp 


Arg 


Leu 


Arg 


He 


Gin 


Val 


Phe 


Tyr 


Phe 


65 










70 










75 










80 


Gly 


Val 


Leu 


Ala 


Leu 


Asp 


His 


Thr 


Ser 


Gin 


Asp 


Arg 


Arg 


Gly 


val 


Arg 








85 










90 










95 




Leu 


His 


Pro 


Arg 


Pro 


Val 


Pro 


His 


Pro 


Gly 


His 


Leu 


Cys 


Phe 


Tyr 


Gly 








100 










105 










110 






Thr 


Gly 


Phe 


Thr 


Val 


Trp 


Phe 


Pro 


Ser 


Pro 


Asp 


Arg 


Glu 


Lys 


Leu 


Thr 






115 










120 










125 








Ala 


Glu 


Gin 


He 


Thr 


Gin 


He 


Lys 


Thr 


Met 


Leu 


Val 


Ala 


Tyr 


Asn 


Glu 




130 










135 










140 










Gly 


He 


Tyr 


Val 


His 


Gly 


Asn 


Glu 


Thr 


Gly 


Val 


Tyr 


Val 


Asp 


Asn 


Arg 


145 










150 










155 










160 


Asn 


Arg 


Glu 


Thr 


Leu 


Tyr 


Ala 


Ala 


Gly 


Asn 


Asp 


Cys 


Asn 


Gly 


Asp 


He 








165 










170 










175 




He 


Gin 


Arg 


Glu 


Val 


Met 


Phe 


Leu 


Ser 


Lys 


Gin 


Lys 


He 


Phe 


Asn 


Phe 








180 










185 










190 






Met 


Gly 


Phe 


Met 


Arg 


Lys 


Leu 


Ala 


Arg 


Ser 


Pro 


Gly 


Pro 


Glu 


Ser 


His 






195 










200 










205 








Ala 


Pro 


Cys 


Asn 


Gly 


Ala 


Thr 


Leu 


Tyr 


Leu 


Ser 


Gin 


Gin 


Pro 


Gly 


Ala 




210 










215 










220 










Gin 


Glu 


Ser 


Pro 


Gin 


Val 


Pro 


He 


Ser 


Val 


Val 


Val 


Cys 


Gin 


Asp 


Glu 


225 










230 










235 










240 


Leu 


Val 


Gin 


Gly 


His 


Met 


Asn 


Pro 


Ser 


Arg 


Trp 


Cys 


Ser 









245 250 



<210> 122 

<211> 1158 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 



<220> 
<221> CDS 
<222> (1) . 



(1158) 



acg gcg ctg gtc ttt agg ttg cgt tea tec gaa gag atg tct tgt ccg 
Thr Ala Leu Val Phe Arg Leu Arg Ser Ser Glu Glu Met Ser Cys Pro 
115 120 125 



48 



96 



<400> 122 

atg gec get ggg gaa teg aga agg ggc cca tct cgt tat ggc atg gec 

Met Ala Ala Gly Glu Ser Arg Arg Gly Pro Ser Arg Tyr Gly Met Ala 

15 10 15 

etc aag gaa tgg ctg acg ttc aaa gec gat agt ggt ctg tac ccg ggc 

Leu Lys Glu Trp Leu Thr Phe Lys Ala Asp Ser Gly Leu Tyr Pro Gly 

20 25 30 

ctg ttt tgg gca gac gag caa aag acg agg ctg gtg ttg gca get aca 144 

Leu Phe Trp Ala Asp Glu Gin Lys Thr Arg Leu Val Leu Ala Ala Thr 

35 40 45 

cca ccg teg ttt ccc aat tat gat tat caa cgc gac ggt caa cac tat 192 

Pro Pro Ser Phe Pro Asn Tyr Asp Tyr Gin Arg Asp Gly Gin His Tyr 

50 55 60 

gat gec tac tgt gag tta aga cac ate cca ctg ccg tct gga agg age 

Asp Ala Tyr Cys Glu Leu Arg His He Pro Leu Pro Ser Gly Arg Ser 

65 70 75 80 

agg ctg tgt caa gec agg ggg cgc ctg ctg gga gee gta aga aag age 

Arg Leu Cys Gin Ala Arg Gly Arg Leu Leu Gly Ala Val Arg Lys Ser 

85 90 95 

aag tac ttt gag gaa gat aag gaa ttc cct acc gat caa ttt ccg ttc 336 

Lys Tyr Phe Glu Glu Asp Lys Glu Phe Pro Thr Asp Gin Phe Pro Phe 

100 105 HO 



240 



288 



384 



gtg tgt ccg cgc gtg tgc gcg etc cgc ctt gag ttg cgc aat atg egg 432 • 
Val Cys Pro Arg Val Cys Ala Leu Arg Leu Glu Leu Arg Asn Met Arg 
130 135 140 



ttt gec atg etc ggg cgc gga atg ctt cac gec ctt teg gga ccg tec 480 

Phe Ala Met Leu Gly Arg Gly Met Leu His Ala Leu Ser Gly Pro Ser 
145 150 155 160 

gta agt gat caa gag aga egg tat egg gaa ggt cat caa gat gga cac 52 8 

Val Ser Asp Gin Glu Arg Arg Tyr Arg Glu Gly His Gin Asp Gly His 
165 170 175 

gac gcg cag gac gac gat gca gcg tat tct teg ggc ctt ctg cgc gee 576 

Asp Ala Gin Asp Asp Asp Ala Ala Tyr Ser Ser Gly Leu Leu Arg Ala 
180 185 190 

aga ctt atg gcg tgt get gcg cca tct gee ggc gat cca tgg ggt cac 624 

Arg Leu Met Ala Cys Ala Ala Pro Ser Ala Gly Asp Pro Trp Gly His 
195 200 205 

atg cat ata aag att tat tac tat ggg caa ttg cag get gag ctg ttg 672 

Met His He Lys He Tyr Tyr Tyr Gly Gin Leu Gin Ala Glu Leu Leu 

210 215 220 



j\. %3 D.Ji ii.j! ::;;;!* ilji a:;;:: ; u .,j 



aca gcc acc ggg cag gga ate agg eta tct tea aaa cca aca aat aag 720 

Thr Ala Thr Gly Gin Gly lie Arg Leu Ser Ser Lys Pro Thr Asn Lys 

225 230 235 240 

gcg gga cac cac gtg tgt gtt ttg gat gga ccc ctt cag gcg tgg ttt 768 

Ala Gly His His Val Cys Val Leu Asp Gly Pro Leu Gin Ala Trp Phe 

245 250 255 



cct ccg ata cca cag acg act gag teg tct gtg gtg cag aga etc gag 

Pro Pro lie Pro Gin Thr Thr Glu Ser Ser Val Val Gin Arg Leu Glu 
260 265 270 

gac get ttg aaa tgg etc gtg gat gga ata att ttc tgc age acc age 

Asp Ala Leu Lys Trp Leu Val Asp Gly lie He Phe Cys Ser Thr Ser 
275 " 280 285 

agg gga att atg ttc acg att acc ggg ggt cct aac gtg tgg ttt cag 

Arg Gly He Met Phe Thr He Thr Gly Gly Pro Asn Val Trp Phe Gin 
290 295 300 

gga aat acc gtg gaa ccg tac agt ttg cca cat cgt gcc tac acc ggg 

Gly Asn Thr Val Glu Pro Tyr Ser Leu Pro His Arg Ala Tyr Thr Gly 

305 310 315 320 



gaa taa 

Glu 

385 



816 



864 



912 



960 



1008 



1056 



atg cac gtg tgg gca ttt gac act gat cgt tat ttg ctt gat atg gcc 

Met His Val Trp Ala Phe Asp Thr Asp Arg Tyr Leu Leu Asp Met Ala 

325 330 335 

agg tec ccc tec ccg cgc gac acg ggt ccc ccc gcg gcc ttt gtt aaa 

Arg Ser Pro Ser Pro Arg Asp Thr Gly Pro Pro Ala Ala Phe Val Lys 
340 345 350 

etc tgg gtt teg ggt tgt tec ctg ggc gag gag cga aac tct teg agg 1104 

Leu Trp Val Ser Gly Cys Ser Leu Gly Glu Glu Arg Asn Ser Ser Arg 
355 360 365 

get cca eta tct att att gtt tat caa aca gaa ate tat agg cac ttt 1152 

Ala Pro Leu Ser He He Val Tyr Gin Thr Glu He Tyr Arg His Phe 
370 375 380 



1158 



<210> 123 

<211> 385 

<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 123 

Met Ala Ala Gly Glu Ser Arg Arg Gly Pro Ser Arg Tyr Gly Met Ala 

1 5 10 15 

Leu Lys Glu Trp Leu Thr Phe Lys Ala Asp Ser Gly Leu Tyr Pro Gly 

20 25 30 

Leu Phe Trp Ala Asp Glu Gin Lys Thr Arg Leu Val Leu Ala Ala Thr 

35 40 45 

Pro Pro Ser Phe Pro Asn Tyr Asp Tyr Gin Arg Asp Gly Gin His Tyr 

50 55 60 

Asp Ala Tyr Cys Glu Leu Arg His He Pro Leu Pro Ser Gly Arg Ser 

212 



65 70 75 80 

Arg Leu Cys Gin Ala Arg Gly Arg Leu Leu Gly Ala Val Arg Lys Ser 

85 90 95 

Lys Tyr Phe Glu Glu Asp Lys Glu Phe Pro Thr Asp Gin Phe Pro Phe 

100 105 110 

Thr Ala Leu Val Phe Arg Leu Arg Ser Ser Glu Glu Met Ser Cys Pro 

115 120 125 

Val Cys Pro Arg Val Cys Ala Leu Arg Leu Glu Leu Arg Asn Met Arg 

130 135 140 

Phe Ala Met Leu Gly Arg Gly Met Leu His Ala Leu Ser Gly Pro Ser 
145 150 155 160 

Val Ser Asp Gin Glu Arg Arg Tyr Arg Glu Gly His Gin Asp Gly His 

165 170 175 

Asp Ala Gin Asp Asp Asp Ala Ala Tyr Ser Ser Gly Leu Leu Arg Ala 

180 185 190 

Arg Leu Met Ala Cys Ala Ala Pro Ser Ala Gly Asp Pro Trp Gly His 

195 200 205 

Met His lie Lys lie Tyr Tyr Tyr Gly Gin Leu Gin Ala Glu Leu Leu 

210 215 220 

Thr Ala Thr Gly Gin Gly He Arg Leu Ser Ser Lys Pro Thr Asn Lys 
225 230 235 240 

Ala Gly His His Val Cys Val Leu Asp Gly Pro Leu Gin Ala Trp Phe 

245 250 255 

Pro Pro He Pro Gin Thr Thr Glu Ser Ser Val Val Gin Arg Leu Glu 

260 265 270 

Asp Ala Leu Lys Trp Leu Val Asp Gly He He Phe Cys Ser Thr Ser 

275 280 285 

Arg Gly He Met Phe Thr He Thr Gly Gly Pro Asn Val Trp Phe Gin 

290 295 300 

Gly Asn Thr Val Glu Pro Tyr Ser Leu Pro His Arg Ala Tyr Thr Gly 
305 310 315 320 

Met His Val Trp Ala Phe Asp Thr Asp Arg Tyr Leu Leu Asp Met Ala 

325 330 335 

Arg Ser Pro Ser Pro Arg Asp Thr Gly Pro Pro Ala Ala Phe Val Lys 

340 345 350 

Leu Trp Val Ser Gly Cys Ser Leu Gly Glu Glu Arg Asn Ser Ser Arg 

355 360 365 

Ala Pro Leu Ser He He Val Tyr Gin Thr Glu He Tyr Arg His Phe 
370 375 380 

Glu 
385 



<210> 124 
<211> 1173 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (1173) 

<400> 124 

atg gcg gaa cgc gat atg gat tta aaa get tgg ttc ate gag gec gtg 

Met Ala Glu Arg Asp Met Asp Leu Lys Ala Trp Phe He Glu. Ala Val 
15 10 15 

gag tct aag cga tac ccc gga gtg gaa tgg gat gac gag gac aag acc 
Glu Ser Lys Arg Tyr Pro Gly Val Glu Trp Asp Asp Glu Asp Lys Thr 
20 25 30 



213 
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agg aca egg egg ccg agg get aag gcg cca ctg aga gtc tec gtt tat 

Arg Thr Arg Arg Pro Arg Ala Lys Ala Pro Leu Arg Val Ser Val Tyr 
145 150 155 160 

aaa egg aaa tec aaa cga etc cag gac tct tct gcg caa cct gtg eta 

Lys Arg Lys Ser Lys Arg Leu Gin Asp Ser Ser Ala Gin Pro Val Leu 

165 170 175 

ggg get gtt gag gta tec ttt ttt tac ttt gga gag aac gtc gga gtt 

Gly Ala Val Glu Val Ser Phe Phe Tyr Phe Gly Glu Asn Val Gly Val 
180 185 190 

caa att ttg cga gee ggc teg ggt gtg cgt ata tgt ggc ctt cct gat 

Gin lie Leu Arg Ala Gly Ser Gly Val Arg lie Cys Gly Leu Pro Asp 
195 200 205 



ttc ttg ccg teg tec cag ttg att ccc tgc gag ttt gec agg gca gat 

Phe Leu Pro Ser Ser Gin Leu lie Pro Cys Glu Phe Ala Arg Ala Asp 

225 230 235 240 

ttg cag gec ctg cag aaa aca tgc gag aga ggt ttg ate tgt gtt atg 

Leu Gin Ala Leu Gin Lys Thr Cys Glu Arg Gly Leu lie Cys Val Met 
245 250 255 

acg gag tct gga ate tgt gtg aaa aac ctg gaa gaa cge aac atg acg 

Thr Glu Ser Gly lie Cys Val Lys Asn Leu Glu Glu Arg Asn Met Thr 
260 265 270 



att ata cgt gtc ccg tgg aac egg tgc ace gac agt cgc gtc gat gaa 144 

lie lie Arg Val Pro Trp Asn Arg Cys Thr Asp Ser Arg Val Asp Glu 

35 40 45 

gat tat aac aag ata ttt gac gat ttt tgc tea gcg cgt ggc gtg tgt 192 

Asp Tyr Asn Lys lie Phe Asp Asp Phe Cys Ser Ala Arg Gly Val Cys 

50 55 60 

caa acc ggt tea cat gee cag aaa ttt aag aaa att agg atg tta tat 240 

Gin Thr Gly Ser His Ala Gin Lys Phe Lys Lys lie Arg Met Leu Tyr 

65 70 75 80 

gca gtg aga tct cac agg tat ttg agg gag ctg aca ccg ccg age aag 2 88 

Ala Val Arg Ser His Arg Tyr Leu Arg Glu Leu Thr Pro Pro Ser Lys 

85 90 95 

gec ggg ggc gtc tct ggg gaa aga tac aga etc ttt caa ttg ctt cct 336 

Ala Gly Gly Val Ser Gly Glu Arg Tyr Arg Leu Phe Gin Leu Leu Pro 

100 105 HO 

gag gtt acg gtg ggc tgc gat ctg tgt aac etc ate gcg acc aca teg 384 

Glu Val Thr Val Gly Cys Asp Leu Cys Asn Leu lie Ala Thr Thr Ser 

115 120 125 

ttg cat age tgt tec atg ggc agt tgc gtt cga gag gat gtt ttc gag 432 

Leu His Ser Cys Ser Met Gly Ser Cys Val Arg Glu Asp Val Phe Glu 

130 135 140 



480 



528 



576 



624 



ccc aaa cgt ccc gga cac ctg tgt tgt gec gat aat cca ttg acg tgt 672 
Pro Lys Arg Pro Gly His Leu Cys Cys Ala Asp Asn Pro Leu Thr Cys 
210 215 220 



720 



768 



816 



214 



gca ttg acc aac tat tct gaa aat tac tat gag ctg egg cct teg cag 

Ala Leu Thr Asn Tyr Ser.Glu Asn Tyr Tyr Glu Leu Arg Pro Ser Gin 

275 280 285 

ccg etc cag gcg ttc gat etc ttg cat tat ttg agg gag ctg gcg aga 

Pro Leu Gin Ala Phe Asp Leu Leu His Tyr Leu Arg Glu Leu Ala Arg 
290 295 300 

teg cca act cca gga gat gtc ccg ccg cgt gac tgt gcg tgg att ttc 

Ser Pro Thr Pro Gly Asp Val Pro Pro Arg Asp Cys Ala Trp lie Phe 

305 310 315 320 

atg tgc ccg tct act cag tec gaa aat aca tgg gat get ccc att gec 

Met Cys Pro Ser Thr Gin Ser Glu Asn Thr Trp Asp Ala Pro lie Ala 
325 330 335 

ctg aaa ctt cgc tac gtg tgc aat gat gat gtg tea gat gat gtc age 

Leu Lys Leu Arg Tyr Val Cys Asn Asp Asp Val Ser Asp Asp Val Ser 
340 345 350 

aac ggt gec gcg gga gat gac age ggg gac gag gga ccg tct gga gcg 

Asn Gly Ala Ala Gly Asp Asp Ser Gly Asp Glu Gly Pro Ser Gly Ala 

355 360 365 

ggt gtc ggt get teg gga aca acg gga age aca tct gta tct acc etc 

Gly Val Gly Ala Ser Gly Thr Thr Gly Ser Thr Ser Val Ser Thr Leu 
370 375 380 



gca ccg tat ggg agg aag taa 
Ala Pro Tyr Gly Arg Lys 
385 390 



<210> 125 
<211> 390 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 125 

Met Ala Glu Arg Asp Met Asp Leu Lys Ala Trp Phe lie Glu Ala Val 

15 10 15 

Glu Ser Lys Arg Tyr Pro Gly Val Glu Trp Asp Asp Glu Asp Lys Thr 

20 25 30 

He He Arg Val Pro Trp Asn Arg Cys Thr Asp Ser Arg Val Asp Glu 

35 40 45 

Asp Tyr Asn Lys He Phe Asp Asp Phe Cys Ser Ala Arg Gly Val Cys 

50 55 60 

Gin Thr Gly Ser His Ala Gin Lys Phe Lys Lys He Arg Met Leu Tyr 
65 70 75 80 

Ala Val Arg Ser His Arg Tyr Leu Arg Glu Leu Thr Pro Pro Ser Lys 

85 90 95 

Ala Gly Gly Val Ser Gly Glu Arg Tyr Arg Leu Phe Gin Leu Leu Pro 

100 105 110 

Glu Val Thr Val Gly Cys Asp Leu Cys Asn Leu He Ala Thr Thr Ser 

115 120 125 

Leu His Ser Cys Ser Met Gly Ser Cys Val Arg Glu Asp Val Phe Glu 

130 135 140 

Arg Thr Arg Arg Pro Arg Ala Lys Ala Pro Leu Arg Val Ser Val Tyr 
145 150 155 160 

Lys Arg Lys Ser Lys Arg Leu Gin Asp Ser Ser Ala Gin Pro Val Leu 
165 170 175 



Gly 


Ala 


Val 


Glu 
180 


Val 


Ser 


Phe 


Phe 


Tyr 
185 


Phe 


Gly 


Glu 


Asn 


Val 
190 


Gly 


Val 


Gin 


lie 


Leu 
195 


Arg 


Ala 


Gly 


Ser 


Gly 
200 


Val 


Arg 


He 


Cys 


Gly 
205 


Leu 


Pro 


Asp 


Pro 


Lys 
210 


Arg 


Pro 


Gly 


His 


Leu 
215 


Cys 


Cys 


Ala 


Asp 


Asn 
220 


Pro 


Leu 


Thr 


Cys 


Phe 


Leu 


Pro 


Ser 


Ser 


Gin 


Leu 


He 


Pro 


Cys 


Glu 


Phe 


Ala 


Arg 


Ala 


Asp 


225 










230 










235 










240 


Leu 


Gin 


Ala 


Leu 


Gin 
245 


Lys 


Thr 


Cys 


Glu 


Arg 
250 


Gly 


Leu 


He 


Cys 


Val 

255 


Met 


Thr 


Glu 


Ser 


Gly 
260 


He 


Cys 


Val 


Lys 


Asn 
265 


Leu 


Glu 


Glu 


Arg 


Asn 
270 


Met 


Thr 


Ala 


Leu 


Thr 
275 


Asn 


Tyr 


Ser 


Glu 


Asn 
280 


Tyr 


Tyr 


Glu 


Leu 


Arg 
285 


Pro 


Ser 


Gin 


Pro 


Leu 
290 


Gin 


Ala 


Phe 


Asp 


Leu 

295 


Leu 


His 


Tyr 


Leu 


Arg 
300 


Glu 


Leu 


Ala 


Arg 


Ser 


Pro 


Thr 


Pro 


Gly 


Asp 


Val 


Pro 


Pro 


Arg 


Asp 


Cys 


Ala 


Trp 


He 


Phe 


305 










310 










315 










320 


Met 


Cys 


Pro 


Ser 


Thr 

325 


Gin 


Ser 


Glu 


Asn 


Thr 
330 


Trp 


Asp 


Ala 


Pro 


He 
335 


Ala 


Leu 


Lys 


Leu 


Arg 
340 


Tyr 


Val 


Cys 


Asn 


Asp 
345 


Asp 


Val 


Ser 


Asp 


Asp 
350 


Val 


Ser 


Asn 


Gly 


Ala 
355 


Ala 


Gly 


Asp 


Asp 


Ser 
360 


Gly 


Asp 


Glu 


Gly 


Pro 
365 


Ser 


Gly 


Ala 


Gly 


Val 
370 


Gly 


Ala 


Ser 


Gly 


Thr 
375 


Thr 


Gly 


Ser 


Thr 


Ser 
380 


Val 


Ser 


Thr 


Leu 


Ala 


Pro 


Tyr 


Gly 


Arg 


Lys 























385 390 



<210> 126 
<211> 1068 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1068) 

<400> 126 

atg gcg gaa gga egg gca ggg teg ate cgc gta aac egg ccg tct ggc 48 

Met Ala Glu Gly Arg Ala Gly Ser lie Arg Val Asn Arg Pro Ser Gly 
15 10 15 

ctg agg gca tgg ttg etc gat tgt tgc gat aat gat aag cat cct gga 96 
Leu , Arg Ala Trp Leu Leu Asp Cys Cys Asp Asn Asp Lys His Pro Gly 
20 25 30 

atg cat tgg ctg gat gag gaa aag act ctg gtt cgt tta cct tgg aat 144 
Met His Trp Leu Asp Glu Glu Lys Thr Leu Val Arg Leu Pro Trp Asn 
35 40 45 

cat tta aag ggc gcg ggt ggc gtc tct gac gat gag aga aac atg tat 192 
His Leu Lys Gly Ala Gly Gly Val Ser Asp Asp Glu Arg Asn Met Tyr 
50 55 60 

ctg gac tat tgt caa ttt aaa ggc ata egg cag acc ggt aac aga cga 240 
Leu Asp Tyr Cys Gin Phe Lys Gly He Arg Gin Thr Gly Asn Arg Arg 
65 70 75 80 



216 



ttg age gtc agg gaa tgt aag aac tgg ttg gec age get ata cgc cac 288 
Leu Ser Val Arg Glu Cys Lys Asn Trp Leu Ala Ser Ala lie Arg His 
85 90 95 

agt cag acc gtc gaa gat gta tec act gag gag aac ctg teg gcg ccg 336 
Ser Gin Thr Val Glu Asp Val Ser Thr Glu Glu Asn Leu Ser Ala Pro 
100 105 110 

gcg cct aac agg tgc cgc gtt att cgc ctg ttg ccg ate ttt gta cga 384 
Ala Pro Asn Arg Cys Arg Val lie Arg Leu Leu Pro He Phe Val Arg 
115 120 125 

tct tgc ccg etc tgt aac gaa gcg gat gee acc ggc ggc atg etc ctg 432 
Ser Cys Pro Leu Cys Asn Glu Ala Asp Ala Thr Gly Gly Met Leu Leu 
130 135 140 

gac gta cgc aac gag gta acc gee aga ttc egg tat etc ggt gec ggg 480 
Asp Val Arg Asn Glu Val Thr Ala Arg Phe Arg Tyr Leu Gly Ala Gly 
145 150 155 160 

atg gag tac gaa ggc gec gtg gga ggc gac ggg gag cag tgt tgg atg 52 8 
Met Glu Tyr Glu Gly Ala Val Gly Gly Asp Gly Glu Gin Cys Trp Met 
165 170 175 

eta egg ctg gtt gtt tat tac tat ggg egg ttg gtt ggg aac atg gag 576 
Leu Arg Leu Val Val Tyr Tyr Tyr Gly Arg Leu Val Gly Asn Met Glu 
180 185 190 

gtc ggg teg ccc aac ggc gtt cgc ctg ttg ccg gcg cca aag cga cca 624 
Val Gly Ser Pro Asn Gly Val Arg Leu Leu Pro Ala Pro Lys Arg Pro 
195 200 205 

ctg caa gga cac gtt tgt gcg ggc att egg ccg gaa cag gec ctg ttg 672 
Leu Gin Gly His Val Cys Ala Gly He Arg Pro Glu Gin Ala Leu Leu 
210 215 220 



ccg cat acc cca cag gat atg ttt cct cac cag acg age atg eta aag 
Pro His Thr Pro Gin Asp Met Phe Pro His Gin Thr Ser Met Leu Lys 
225 230 235 \ 240 



720 



tgg ctg ggg aag gag ate ata cgc ggg ttg atg att tac gca gac ggg 768 

Trp Leu Gly Lys Glu He He Arg Gly Leu Met He Tyr Ala Asp Gly 
245 250 255 

tct ggg att tac att egg tat atg ggt cac gtt cca gec ttc ctg ctg 816 

Ser Gly He Tyr He Arg Tyr Met Gly His Val Pro Ala Phe Leu Leu 
260 265 270 

ggt aac gga ggt teg ctg gag ccg gtg gat ata att aac aac gcg cga 864 

Gly Asn Gly Gly Ser Leu Glu Pro Val Asp He He Asn Asn Ala Arg 
275 280 285 

gtt ctg cgc gtg ttt teg etc gec caa tat ctg agt gcg gtg tea gec 912 

Val Leu Arg Val Phe Ser Leu Ala Gin Tyr Leu Ser Ala Val Ser Ala 

290 295 300 

acc ccg cca cac gga acg cga ttt cca gec gee tat gcg tct etc cac 960 

Thr Pro Pro His Gly Thr Arg Phe Pro Ala Ala Tyr Ala Ser Leu His 
305 310 315 320 

eta gga ggc gtt ccc act ccg gaa ggc gag ccg tgt ccc aca ate ccc 1008 

217 



i 



Leu Gly Gly Val Pro Thr Pro Glu Gly Glu Pro Cys Pro Thr lie Pro 
325 330 335 



ctg tec att caa att tgg cac gag tgt ctg tgg egg gcg tgc ggg gat 
Leu Ser He Gin He Trp His Glu Cys Leu Trp Arg Ala Cys Gly Asp 
340 345 350 

gcg gec cag tga 
Ala Ala Gin 
355 



<210> 127 
<211> 355 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 127 

Met Ala Glu Gly Arg Ala Gly Ser He Arg Val Asn Arg Pro Ser Gly 

15 10 15 

Leu Arg Ala Trp Leu Leu Asp Cys Cys Asp Asn Asp Lys His Pro Gly 

20 25 30 

Met His Trp Leu Asp Glu Glu Lys Thr Leu Val Arg Leu Pro Trp Asn 

35 40 45 

His Leu Lys Gly Ala Gly Gly Val Ser Asp Asp Glu Arg Asn Met Tyr 

50 55 60 

Leu Asp Tyr Cys Gin Phe Lys Gly He Arg Gin Thr Gly Asn Arg Arg 
65 70 75 80 

Leu Ser Val Arg Glu Cys Lys Asn Trp Leu Ala Ser Ala He Arg His 

85 90 95 

Ser Gin Thr Val Glu Asp Val Ser Thr Glu Glu Asn Leu Ser Ala Pro 

100 105 HO 

Ala Pro Asn Arg Cys Arg Val He Arg Leu Leu Pro He Phe Val Arg 

115 120 125 

Ser Cys Pro Leu Cys Asn Glu Ala Asp Ala Thr Gly Gly Met Leu Leu 

130 135 140 

Asp Val Arg Asn Glu Val Thr Ala Arg Phe Arg Tyr Leu Gly Ala Gly 
145 150 155 160 

Met Glu Tyr Glu Gly Ala Val Gly Gly Asp Gly Glu Gin Cys Trp Met 

165 170 175 

Leu Arg Leu Val Val Tyr Tyr Tyr Gly Arg Leu Val Gly Asn Met Glu 

180 185 190 

Val Gly Ser Pro Asn Gly Val Arg Leu Leu Pro Ala Pro Lys Arg Pro 

195 200 205 

Leu Gin Gly His Val Cys Ala Gly He Arg Pro Glu Gin Ala Leu Leu 

210 215 220 

Pro His Thr Pro Gin Asp Met Phe Pro His Gin Thr Ser Met Leu Lys 
225 230 235 240 

Trp Leu Gly Lys Glu He He Arg Gly Leu Met He Tyr Ala Asp Gly 

245 250 255 

Ser Gly He Tyr He Arg Tyr Met Gly His Val Pro Ala Phe Leu Leu 

260 265 270 

Gly Asn Gly Gly Ser Leu Glu Pro Val Asp He He Asn Asn Ala Arg 

275 280 285 

Val Leu Arg Val Phe Ser Leu Ala Gin Tyr Leu Ser Ala Val Ser Ala 

290 295 300 

Thr Pro Pro His Gly Thr Arg Phe Pro Ala Ala Tyr Ala Ser Leu His 
305 310 315 320 

Leu Gly Gly Val Pro Thr Pro Glu Gly Glu Pro Cys Pro Thr He Pro 

325 330 335 

Leu Ser He Gin He Trp His Glu Cys Leu Trp Arg Ala Cys Gly Asp 

218 



340 



345 



350 



Ala Ala Gin 
355 



<210> 128 
<211> 1095 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1095) 

<400> 128 

atg acc gag ata gaa ate act cac aat cac ctg cga aga tgg att att 48 
Met Thr Glu lie Glu lie Thr His Asn His Leu Arg Arg Trp lie lie 
1 5 10 15 

tec aat ttg gaa gcg aac acg ttt ccc gaa cac ttg tgt tgg tgc gac 96 
Ser Asn Leu Glu Ala Asn Thr Phe Pro Glu His Leu Cys Trp Cys Asp 
20 25 30 

gag gaa aag agg age ttt agg ata tea tgg cac cga ggt atg agt ggc 144 
Glu Glu Lys Arg Ser Phe Arg lie Ser Trp His Arg Gly Met Ser Gly 
35 40 45 

atg caa cct gtg gtt gcg tac tgc ctg gat agg gat ctg gaa tgt gga 192 
Met Gin Pro Val Val Ala Tyr Cys Leu Asp Arg Asp Leu Glu Cys Gly 
50 55 60 

cgc cag cac aac gtg teg gag tgc egg aaa aga ctg eta agg gtg ctt 24 0 
Arg Gin His Asn Val Ser Glu Cys Arg Lys Arg Leu Leu Arg Val Leu 
65 70 75 80 

egg gaa aac get ggg ttt gaa cag gac gat gec egg gca aca aca acg 2 88 
Arg Glu Asn Ala Gly Phe Glu Gin Asp Asp Ala Arg Ala Thr Thr Thr 
85 90 95 

cgt ttt ggc gga gaa aga ttt ttt tac ttg aga ccc gec gtg gat ccg 336 
Arg Phe Gly Gly Glu Arg Phe Phe Tyr Leu Arg Pro Ala Val Asp Pro 
100 105 110 

ctg tgt tat gec tgc ate tta gat agt cat teg gag acc gtc eta aat 384 
Leu Cys Tyr Ala Cys lie Leu Asp Ser His Ser Glu Thr Val Leu Asn 
115 120 125 

tac tta gag gcg gee tgc gtc cat ggg ctt gag cca ggg acg ccc ctg 432 
Tyr Leu Glu Ala Ala Cys Val His Gly Leu Glu Pro Gly Thr Pro Leu 
130 135 140 

cct cct ccc get ccg gcg gag get gac ggt gcg gcg egg tec gta tat 480 
Pro Pro Pro Ala Pro Ala Glu Ala Asp Gly Ala Ala Arg Ser Val Tyr 
145 150 155 160 

gee agg gec gee agg ttg gcg acg gtt gca cca ccg cat cca gac cag 528 
Ala Arg Ala Ala Arg Leu Ala Thr Val Ala. Pro Pro His Pro Asp Gin 
165 170 175 



ate act cca ttt tgg egg eta cgt ata cga gtt ttt tat ttt ggg teg 576 

219 
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lie Thr Pro Phe Trp Arg Leu Arg lie Arg Val Phe Tyr Phe Gly Ser 
180 185 190 

ctg gta gcg gaa cac acg tec cag gac cgt aga ggg gta cga ttg cat 624 

Leu Val Ala Glu His Thr Ser Gin Asp Arg Arg Gly Val Arg Leu His 
195 200 205 

aag cgc caa gac ccc aaa ccg ggt cac gag tgt ttc tat ggg aca gcg 672 

Lys Arg Gin Asp Pro Lys Pro Gly His Glu Cys Phe Tyr Gly Thr Ala 
210 215 220 

tat aag atg tgg ctt cca aaa cct cag ttg gat ggg ccg eta acg ccg 72 0 

Tyr Lys Met Trp Leu Pro Lys Pro Gin Leu Asp Gly Pro Leu Thr Pro 
225 230 235 240 

gag cag aga gaa acc gtg tgt gag att ata aac ggg tgt gag gag ggc 76 8 

Glu Gin Arg Glu Thr Val Cys Glu lie lie Asn Gly Cys Glu Glu Gly 
245 250 255 

gtc ttt ttg cat ggc aat gag ctg ggg atg tat gtg gat aac aga acc 816 

Val Phe Leu His Gly Asn Glu Leu Gly Met Tyr Val Asp Asn Arg Thr 
260 265 270 

agg cac acg gtt cgc tgc gca ggg aac gac gca gag ggg aac cac gca 864 

Arg His Thr Val Arg Cys Ala Gly Asn Asp Ala Glu Gly Asn His Ala 
275 280 285 

caa egg get gtg cga tec tct gtc aaa tct caa ate ttc tat gtt atg 912 

Gin Arg Ala Val Arg Ser Ser Val Lys Ser Gin lie Phe Tyr Val Met 
290 295 300 

ggt eta ctg cgc aga etc gec egg tea ccc gtt ccc ggc gac act gtt 960 

Gly Leu Leu Arg Arg Leu Ala Arg Ser Pro Val Pro Gly Asp Thr Val 
305 310 315 320 

ccc age aac gca gtc act ctt tac ctt ggg ggt cgc cct ggc tec agt 1008 

Pro Ser Asn Ala Val Thr Leu Tyr Leu Gly Gly Arg Pro Gly Ser Ser 
325 330 335 

aaa aga ccc cag gtc cct gtc act ttg gtg ate tgt cag gat gaa ttg 1056 
Lys Arg Pro Gin Val Pro Val Thr Leu Val lie Cys Gin Asp Glu Leu 
340 345 350 

act cat ggt gac att egg gcg get egg tgg att ttg tag 1095 
Thr His Gly Asp lie Arg Ala Ala Arg Trp lie Leu 

355 360 365 



<210> 129 
<211> 364 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 129 

Met Thr Glu lie Glu lie Thr His Asn His Leu Arg Arg Trp lie lie 

1 5 10 15 

Ser Asn Leu Glu Ala Asn Thr Phe Pro Glu His Leu Cys Trp Cys Asp 

20 25 30 

Glu Glu Lys Arg Ser Phe Arg He Ser Trp His Arg Gly Met Ser Gly 

35 40 45 

Met Gin Pro Val Val Ala Tyr Cys Leu Asp Arg Asp Leu Glu Cys Gly 

220 
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50 55 60 

Arg Gin His Asn Val Ser Glu Cys Arg Lys Arg Leu Leu Arg Val Leu 
65 70 75 80 

Arg Glu Asn Ala Gly Phe Glu Gin Asp Asp Ala Arg Ala Thr Thr Thr 

85 90 ; 95 

Arg Phe Gly Gly Glu Arg Phe Phe Tyr Leu Arg Pro Ala Val Asp Pro 

100 105 110 

Leu Cys Tyr Ala Cys He Leu Asp Ser His Ser Glu Thr Val Leu Asn 

115 120 125 

Tyr Leu Glu Ala Ala Cys Val His Gly Leu Glu Pro Gly Thr Pro Leu 

130 135 140 

Pro Pro Pro Ala Pro Ala Glu Ala Asp Gly Ala Ala Arg Ser Val Tyr 
145 150 155 160 

Ala Arg Ala Ala Arg Leu Ala Thr Val Ala Pro Pro His Pro Asp Gin 

165 170 175 

He Thr Pro Phe Trp Arg Leu Arg He Arg Val Phe Tyr Phe Gly Ser 

180 185 190 

Leu Val Ala Glu His Thr Ser Gin Asp Arg Arg Gly Val Arg Leu His 

195 200 205 

Lys Arg Gin Asp Pro Lys Pro Gly His Glu Cys Phe Tyr Gly Thr Ala 

210 215 220 

Tyr Lys Met Trp Leu Pro Lys Pro Gin Leu Asp Gly Pro Leu Thr Pro 
225 230 235 240 

Glu Gin Arg Glu Thr Val Cys Glu He He Asn Gly Cys Glu Glu Gly 

245 250 255 

Val Phe Leu His Gly Asn Glu Leu Gly Met Tyr Val Asp Asn Arg Thr 

260 265 270 

Arg His Thr Val Arg Cys Ala Gly Asn Asp Ala Glu Gly Asn His Ala 

275 280 285 

Gin Arg Ala Val Arg Ser Ser Val Lys Ser Gin He Phe Tyr Val Met 

290 295 300 

Gly Leu Leu Arg Arg Leu Ala Arg Ser Pro Val Pro Gly Asp Thr Val 
305 310 315 320 

Pro Ser Asn Ala Val Thr Leu Tyr Leu Gly Gly Arg Pro Gly Ser Ser 

325 330 335 

Lys Arg Pro Gin Val Pro Val Thr Leu Val He Cys Gin Asp Glu Leu 

340 345 350 

Thr His Gly Asp He Arg Ala Ala Arg Trp He Leu 
355 360 



<210> 130 
<211> 1083 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (1083) 

<400> 130 

atg ggc acg tac acg teg gag gcg tec etc gec tgg eta tct ttt atg 

Met Gly Thr Tyr Thr Ser Glu Ala Ser Leu Ala Trp Leu Ser Phe Met 

15 10 15 

age ggg acg gtc teg get tec ccg ttt att ctg tgt ttt att tat cac 
Ser Gly Thr Val Ser Ala Ser Pro Phe He Leu Cys Phe He Tyr His 
20 25 30 

teg ctg tat ttt gta gag ccc ctg att age gtt gag aac att ata ttc 

221 



48 



96 



144 



Ser Leu Tyr Phe Val Glu Pro Leu lie Ser Val Glu Asn He He Phe 

35 40 45 

tec tgg ggc gcg gtt ggc tta cat ggg ctg ctt ttg eta ttc tgt ata 192 

Ser Trp Gly Ala Val Gly Leu His Gly Leu Leu Leu Leu Phe Cys He 

50 55 60 

ttt ggg eta ccg gec tgg etc teg egg cag gtg gat gta ccg tgc ace 240 

Phe Gly Leu Pro Ala Trp Leu Ser Arg Gin Val Asp Val Pro Cys Thr 

65 70 75 80 

ate teg gcg ttt ctt att acc gec ggt tct atg gca tec ace etc ggc 288 

He Ser Ala Phe Leu lie Thr Ala Gly Ser Met Ala Ser Thr Leu Gly 

85 90 95 

gtt gac ctt cca tgg gtt cac gtt tec att ttt gtg ggg teg tgc ctg 336 

Val Asp Leu Pro Trp Val His Val Ser He Phe Val Gly Ser Cys Leu 

100 105 110 

tgc ctg ctg etc tgt gtt gtt gcg gcg aat gac gtg gta tat ttg tgc 3 84 

Cys Leu Leu Leu Cys Val Val Ala Ala Asn Asp Val Val Tyr Leu Cys 

115 120 125 

ccc acg att gec cac aga tac tac gaa ctt gga ttt tta gca gcg ttc 432 

Pro Thr He Ala His Arg Tyr Tyr Glu Leu Gly Phe Leu Ala Ala Phe 

130 135 140 

teg gtg tat tat ttt tta gtt ctt aag aac ctt ttt tta gec ccg gtg 480 

Ser Val Tyr Tyr Phe Leu Val Leu Lys Asn Leu Phe Leu Ala Pro Val 

145 150 155 160 

ttt ttg ctg ccc ctg gtg gec ttt ata gta ggt ggc gtt tgt tea etc 528 

Phe Leu Leu Pro Leu Val Ala Phe He Val Gly Gly Val Cys Ser Leu 

165 170 175 

aga gcg ctg egg tea cat ccg ctt tat gag gcg ggc ctg cag cga cgc 576 

Arg Ala Leu Arg Ser His Pro Leu Tyr Glu Ala Gly Leu Gin Arg Arg 

180 185 190 

cat gca att ttt teg etc acc tct agg egg tac ate aca tac tct ata 624 

His Ala He Phe Ser Leu Thr Ser Arg Arg Tyr He Thr Tyr Ser He 

195 200 205 

aag caa gcg ctt gag gtg tgt gga tgg gac ttt tat etc gtg act gtt 672 

Lys Gin Ala Leu Glu Val Cys Gly Trp Asp Phe Tyr Leu Val Thr Val 

210 215 220 

ttg att gga ggg gcg gec gcg ggt acg ctg tct gtt ggc ctg acg acg 720 

Leu He Gly Gly Ala Ala Ala Gly Thr Leu Ser Val Gly Leu Thr Thr 

225 230 235 240 

cct ctg etc etc gga ctt gta cac tat ttt ttt gtc ttc cac gtt ggg 768 

Pro Leu Leu Leu Gly Leu Val His Tyr Phe Phe Val Phe His Val Gly 

245 250 255 

ctt ttt tgt tgc etc gga etc gtg ttt egg tea aac gtt tta gee ctg 816 

Leu Phe Cys Cys Leu Gly Leu Val Phe Arg Ser Asn Val Leu Ala Leu 

260 265 ' 270 



gtt tat gtt tta gca gec get gtg ctg ctg acg etc acg cac gtc ttg 
Val Tyr Val Leu Ala Ala Ala Val Leu Leu Thr Leu Thr His Val Leu 



864 



222 



275 



280 



285 



ggg cca gga acg cat aat ttg ttc acc aga gtg tgt gtg ttt acg gtt 

Gly Pro Gly Thr His Asn Leu Phe Thr Arg Val Cys Val Phe Thr Val 
290 295 300 

ttt tta ttg act atg ttt ggg gcg att gga tgc gaa tta caa ata ate 

Phe Leu Leu Thr Met Phe Gly Ala lie Gly Cys Glu Leu Gin lie lie 
305 310 315 320 

aga aaa aaa eta cag cgt gec gcg aac teg cca aga ata gtc ttg ggg 

Arg Lys Lys Leu Gin Arg Ala Ala Asn Ser Pro Arg lie Val Leu Gly 
325 330 335 

gtg tgt gee tgc gga aac ctt ctg atg gcg gtg gtt ttt ttc tec tta 

Val Cys Ala Cys Gly Asn Leu Leu Met Ala Val Val Phe Phe Ser Leu 
340 345 350 

aat aaa gtt gag ctt ggt gec ctt taa 

Asn Lys Val Glu Leu Gly Ala Leu 

355 360 



<210> 131 
<211> 360 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 131 

Met Gly Thr Tyr Thr Ser Glu Ala Ser Leu Ala Trp Leu Ser Phe Met 

1 5 10 15 

Ser Gly Thr Val Ser Ala Ser Pro Phe lie Leu Cys Phe lie Tyr His 

20 25 30 

Ser Leu Tyr Phe Val Glu Pro Leu lie Ser Val Glu Asn lie lie Phe 

35 40 45 

Ser Trp Gly Ala Val Gly Leu His Gly Leu Leu Leu Leu Phe Cys He 

50 55 60 

Phe Gly Leu Pro Ala Trp Leu Ser Arg Gin Val Asp Val Pro Cys Thr 
65 70 75 80 

He Ser Ala Phe Leu He Thr Ala Gly Ser Met Ala Ser Thr Leu Gly 

85 90 95 

Val Asp Leu Pro Trp Val His Val Ser He Phe Val Gly Ser Cys Leu 

100 105 HO 

Cys Leu Leu Leu Cys Val Val Ala Ala Asn Asp Val Val Tyr Leu Cys 

115 120 125 

Pro Thr He Ala His Arg Tyr Tyr Glu Leu Gly Phe Leu Ala Ala Phe 

130 135 140 

Ser Val Tyr Tyr Phe Leu Val Leu Lys Asn Leu Phe Leu Ala Pro Val 
145 150 155 160 

Phe Leu Leu Pro Leu Val Ala Phe He Val Gly Gly Val Cys Ser Leu 

165 170 175 

Arg Ala Leu Arg Ser His Pro Leu Tyr Glu Ala Gly Leu Gin Arg Arg 

180 185 190 

His Ala He Phe Ser Leu Thr Ser Arg Arg Tyr He Thr Tyr Ser He 

195 200 205 

Lys Gin Ala Leu Glu Val Cys Gly Trp Asp Phe Tyr Leu Val Thr Val 

210 215 220 

Leu He Gly Gly Ala Ala Ala Gly Thr Leu Ser Val Gly Leu Thr Thr 
225 230 235 240 

Pro Leu Leu Leu Gly Leu Val His Tyr Phe Phe Val Phe His Val Gly 
245 250 255 
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Leu Phe Cys Cys 
260 

Val Tyr Val Leu 
275 

Gly Pro Gly Thr 
290 

Phe Leu Leu Thr 
305 

Arg Lys Lys Leu 

Val Cys Ala Cys 
340 

Asn Lys Val Glu 
355 



Leu Gly Leu Val 

Ala Ala Ala Val 
280 

His Asn Leu Phe 
295 

Met Phe Gly Ala 
310 

Gin Arg Ala Ala 
325 

Gly Asn Leu Leu 

Leu Gly Ala Leu 
360 



Phe Arg Ser Asn 
265 

Leu Leu Thr Leu 

Thr Arg Val Cys 
300 

lie Gly Cys Glu 
315 

Asn Ser Pro Arg 
330 

Met Ala Val Val 
345 



Val Leu Ala Leu 
270 

Thr His Val Leu 
285 

Val Phe Thr Val 

Leu Gin lie lie 
320 

lie Val Leu Gly 
335 

Phe Phe Ser Leu 
350 



<210> 132 
<211> 1185 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (1185) 

<400> 132 

atg cct gtc teg ttt cat tac ggc gec agg gtg gac gtc gat gcg ctt 48 

Met Pro Val Ser Phe His Tyr Gly Ala Arg Val Asp Val Asp Ala Leu 

15 10 15 

ggc age att age agg gtt tac gac cac att aag ggc ate gtt aag aaa 96 
Gly Ser He Ser Arg Val Tyr Asp His He Lys Gly lie Val Lys Lys 
20 25 30 

ggc gtt ata cag ate age ggc cag ggc agg gec ccc gtc ctt age gtg 144 
Gly Val He Gin He Ser Gly Gin Gly Arg Ala Pro Val Leu Ser Val 
35 40 45 

etc tec age gtg ggg gac gcg ggc gtt etc ggt etc agg ctt aaa aat 192 
Leu Ser Ser Val Gly Asp Ala Gly Val Leu Gly Leu Arg Leu Lys Asn 
50 55 60 

gca ctg gcg ccg ctg atg gta tat tct gac atg acg gac gag gtt age 240 
Ala Leu Ala Pro Leu Met Val Tyr Ser Asp Met Thr Asp Glu Val Ser 
65 70 75 80 

ttt age ttt cga aac acc tec ctt ggg aac acg ttc aca cac acc cgt 288 
Phe Ser Phe Arg Asn Thr Ser Leu Gly Asn Thr Phe Thr His Thr Arg 
85 90 95 



gag atg ttt gga gtt aac att gcg gaa atg aac gtg gcg ttc tac cac 
Glu Met Phe Gly Val Asn He Ala Glu Met Asn Val Ala Phe Tyr His 
100 105 110 



336 



cac ggg gac gag teg gac gcg gaa gga aag ccg cag ttt gtt agg acg 3 84 

His Gly Asp Glu Ser Asp Ala Glu Gly Lys Pro Gin Phe Val Arg Thr 

115 120 125 

acc ate gec tac ggg gac aat cac acg age acc gtg cac aag age gtc 432 

Thr He Ala Tyr Gly Asp Asn His Thr Ser Thr Val His Lys Ser Val 



224 



130 135 140 

gtg gac gaa ccc aac eta ccc teg ttc cac gac agg eta gag cag gee 4 80 

Val Asp Glu Pro Asn Leu Pro Ser Phe His Asp Arg Leu Glu Gin Ala 

145 150 155 160 



ggg acg ggg aat cgc ctg ttc ctt acg gtt aag acg ctg act ttg ttg 
Gly Thr Gly Asn Arg Leu Phe Leu Thr Val Lys Thr Leu Thr Leu Leu 



165 170 175 



528 



tta aag tgg eta agg cag cag aaa acc agg gec aag cag gtg gtc acc 576 

Leu Lys Trp Leu Arg Gin Gin Lys Thr Arg Ala Lys Gin Val Val Thr 

180 185 190 

gtc teg tta age gaa acg ctg gca gtt gec acg ttt acg gtc gac ggg 624 

Val Ser Leu Ser Glu Thr Leu Ala Val Ala Thr Phe Thr Val Asp Gly 

195 200 205 

gtt agt aaa att att gat ttt aaa ccc gac acc ccg gac gec aag tgg 672 

Val Ser Lys lie lie Asp Phe Lys Pro Asp Thr Pro Asp Ala Lys Trp 

210 215 220 

act tgc get aga gga aga aag ctg gac gtg ggt gtg gtt tct age gac 72 0 

Thr Cys Ala Arg Gly Arg Lys Leu Asp Val Gly Val Val Ser Ser Asp 

225 230 235 240 

ctg acg act cat gtg agt tta gag tct ctg gtg gec gca ttg aac gca 768 

Leu Thr Thr His Val Ser Leu Glu Ser Leu Val Ala Ala Leu Asn Ala 
245 250 255 

tgt aag att ccg gga ttt ttt ttg cct ggt ttc cgc tgg cac gec aat 816 

Cys Lys He Pro Gly Phe Phe Leu Pro Gly Phe Arg Trp His Ala Asn 

260 265 270 

gag att ctg gaa gtt gag ggc ctg ccg ctg aca gat age ctt get gac 864 

Glu He Leu Glu Val Glu Gly Leu Pro Leu Thr Asp Ser Leu Ala Asp 

275 280 285 

gtc agg ttg ggc gtc atg eta tta aag gta gat ccg acg gac egg aat 912 

Val Arg Leu Gly Val Met Leu Leu Lys Val Asp Pro Thr Asp Arg Asn 

290 295 300 

aat gee gtc ccg ggc aat etc teg gaa ggt gee gat ccc gag ggg gtt 960 

Asn Ala Val Pro Gly Asn Leu Ser Glu Gly Ala Asp Pro Glu Gly Val 

305 310 315 320 

cct gag etc ccg tec cct ccg aga acg cct gac ctg gac tta aaa gag 1008 

Pro Glu Leu Pro Ser Pro Pro Arg Thr Pro Asp Leu Asp Leu Lys Glu 
325 330 335 

cag tgc gtc cca att gec gag gac ggt gec gag ccg aca gac ggg ggc 1056 

Gin Cys Val Pro He Ala Glu Asp Gly Ala Glu Pro Thr Asp Gly Gly 

340 345 35 0 

gec aaa tea ctg egg acg age ggc tct cga ccg gaa aaa aaa cac ggc 1104 

Ala Lys Ser Leu Arg Thr Ser Gly Ser Arg Pro Glu Lys Lys His Gly 

355 360 365 

aag cga aaa cac age age tea ccc agt cgc ggg aaa gga aaa acc aag 1152 

Lys Arg Lys His Ser Ser Ser Pro Ser Arg Gly Lys Gly Lys Thr Lys 

370 375 380 



225 



acg cct cgc gcg acg ttc aac ccg ttg ttt taa 

Thr Pro Arg Ala Thr Phe Asn Pro Leu Phe 

385 390 395 



<210> 133 
<211> 394 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 133 



Met 


Pro 


Val 


Ser 


Phe 


His 


Tyr 


Gly 


Ala 


Arg 


Val 


Asp 


Val 


Asp 


Ala 


Leu 


1 








5 










10 










15 




Gly 


Ser 


He 


Ser 


Arg 


Val 


Tyr 


Asp 


His 


He 


Lys 


Gly 


He 


Val 


Lys 


Lys 






20 










25 










30 






Gly 


Val 


He 


Gin 


He 


Ser 


Gly 


Gin 


Gly 


Arg 


Ala 


Pro 


Val 


Leu 


Ser 


Val 




35 










40 










45 








Leu 


Ser 


Ser 


Val 


Gly 


Asp 


Ala 


Gly 


Val 


Leu 


Gly 


Leu 


Arg 


Leu 


Lys 


Asn 




50 










55 










60 










Ala 


Leu 


Ala 


Pro 


Leu 


Met 


Val 


Tyr 


Ser 


Asp 


Met 


Thr 


Asp 


Glu 


Val 


Ser 


65 










70 










75 










80 


Phe 


Ser 


Phe 


Arg 


Asn 


Thr 


Ser 


Leu 


Gly 


Asn 


Thr 


Phe 


Thr 


His 


Thr 


Arg 










85 










90 










95 




Glu 


Met 


Phe 


Gly 


Val 


Asn 


He 


Ala 


Glu 


Met 


Asn 


Val 


Ala 


Phe 


Tyr 


His 








100 










105 










110 






His 


Gly 


Asp 


Glu 


Ser 


Asp 


Ala 


Glu 


Gly 


Lys 


Pro 


Gin 


Phe 


Val 


Arg 


Thr 






115 










120 










125 








Thr 


He 


Ala 


Tyr 


Gly 


Asp 


Asn 


His 


Thr 


Ser 


Thr 


Val 


His 


Lys 


Ser 


Val 




130 










135 










140 










Val 


Asp 


Glu 


Pro 


Asn 


Leu 


Pro 


Ser 


Phe 


His 


Asp 


Arg 


Leu 


Glu 


Gin 


Ala 


145 










150 










155 










160 


Gly 


Thr 


Gly 


Asn 


Arg 


Leu 


Phe 


Leu 


Thr 


Val 


Lys 


Thr 


Leu 


Thr 


Leu 


Leu 










165 










170 










175 




Leu 


Lys 


Trp 


Leu 


Arg 


Gin 


Gin 


Lys 


Thr 


Arg 


Ala 


Lys 


Gin 


Val 


Val 


Thr 








180 










185 










190. 






Val 


Ser 


Leu 


Ser 


Glu 


Thr 


Leu 


Ala 


Val 


Ala 


Thr 


Phe 


Thr 


Val 


Asp 


Gly 






195 










200 










205 








Val 


Ser 


Lys 


He 


lie 


Asp 


Phe 


Lys 


Pro 


Asp 


Thr 


Pro 


Asp 


Ala 


Lys 


Trp 




210 










215 










2 2 0 










Thr 


Cys 


Ala 


Arg 


Gly 


Arg 


Lys 


Leu 


Asp 


Val 


Gly 


Val 


Val 


Ser 


Ser 


Asp 


225 








230 










235 










240 


Leu 


Thr 


Thr 


His 


Val 


Ser 


Leu 


Glu 


Ser 


Leu 


Val 


Ala 


Ala 


Leu 


Asn 


Ala 










245 










250 










255 




Cys 


Lys 


He 


Pro 


Gly 


Phe 


Phe 


Leu 


Pro 


Gly 


Phe 


Arg 


Trp 


His 


Ala 


Asn 






260 










265 










270 






Glu 


He 


Leu 


Glu 


Val 


Glu 


Gly 


Leu 


Pro 


Leu 


Thr 


Asp 


Ser 


Leu 


Ala 


Asp 






275 










280 










285 








Val 


Arg 


Leu 


Gly 


Val 


Met 


Leu 


Leu 


Lys 


Val 


Asp 


Pro 


Thr 


Asp 


Arg 


Asn 




290 










295 










300 










Asn 


Ala 


Val 


Pro 


Gly 


Asn 


Leu 


Ser 


Glu 


Gly 


Ala 


Asp 


Pro 


Glu 


Gly 


Val 


305 










310 










315 










320 


Pro 


Glu 


Leu 


Pro 


Ser 


Pro 


Pro 


Arg 


Thr 


Pro 


Asp 


Leu 


Asp 


Leu 


Lys 


Glu 










325 










330 










335 




Gin 


Cys 


Val 


Pro 


He 


Ala 


Glu 


Asp 


Gly 


Ala 


Glu 


Pro 


Thr 


Asp 


Gly 


Gly 






340 










345 










350 






Ala 


Lys 


Ser 


Leu 


Arg 


Thr 


Ser 


Gly 


Ser 


Arg 


Pro 


Glu 


Lys 


Lys 


His 


Gly 






355 










360 










365 








Lys 


Arg 


Lys 


His 


Ser 


Ser 


Ser 


Pro 


Ser 


Arg 


Gly 


Lys 


Gly 


Lys 


Thr 


Lys 




370 










375 










380 










Thr 


Pro 


Arg 


Ala 


Thr 


Phe 


Asn 


Pro 


Leu 


Phe 















385 390 



<210> 134 
<211> 945 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - . (945) 

<400> 134 

atg ttt ggc ctg tea ata gta acc gec gec atg gag tec cca gat egg 

Met Phe Gly Leu Ser lie Val Thr Ala Ala Met Glu Ser Pro Asp Arg 

15 10 15 

ttt ctg tac gcg age gac cac cct ggc ttt tta gca etc acc cag gag 
Phe Leu Tyr Ala Ser Asp His Pro Gly Phe Leu Ala Leu Thr Gin Glu 
20 25 30 

aca tgg cag aac aga tgg ttc ccc agt caa att tec ctg cac gag gac 
Thr Trp Gin Asn Arg Trp Phe Pro Ser Gin lie Ser Leu His Glu Asp 
35 40 45 



ate gaa gac eta gtt aaa gag ttc age aat cac gac gtg acc cac tat 
He Glu Asp Leu Val Lys Glu Phe Ser Asn His Asp Val Thr His Tyr 
85 90 95 



ctg ctg ttc ctg gtt ate gaa ggg ata tac ttt ata agt tea ttt tac 
Leu Leu Phe Leu Val He Glu Gly He Tyr Phe He Ser Ser Phe Tyr 
165 170 175 



48 



96 



144 



tec gat gag gtg cga eta ttg agt ccc acc gat agg gag ttt tac cag 192 

Ser Asp Glu Val Arg Leu Leu Ser Pro Thr Asp Arg Glu Phe Tyr Gin 
50 55 60 

ttt etc ttc aca ttt ttg ggt atg gca gag agt ctg gta aac ttt aac 240 

Phe Leu Phe Thr Phe Leu Gly Met Ala Glu Ser Leu Val Asn Phe Asn 

65 70 75 80 



288 



tac gec gaa cag gta gec atg gaa aac ate cac ggc aaa gtg tat gec ' 336 

Tyr Ala Glu Gin Val Ala Met Glu Asn He His Gly Lys Val Tyr Ala 
100 105 110 

aac ata ctg aac tta ttt ttt ggc ggt aat egg ggg gac tta atg att 384 

Asn He Leu Asn Leu Phe Phe Gly Gly Asn Arg Gly Asp Leu Met He 
115 120 125 

tat gec aaa aaa ata gtc gag gac get acc ctg gcg aag aag att gac 432 

Tyr Ala Lys Lys He Val Glu Asp Ala Thr Leu Ala Lys Lys He Asp 
130 135 140 

tgg ctg cat tec cga gtg cgc aag gec acc acg egg gec gag aag gtg 480 

Trp Leu His Ser Arg Val Arg Lys Ala Thr Thr Arg Ala Glu Lys Val 

145 150 155 160 



528 



tct ata gga ctt ttt cgc ctt cgc gga ate atg cgc ggg gtg tgc ctg 576 
Ser He Gly Leu Phe Arg Leu Arg Gly He Met Arg Gly Val Cys Leu 
180 185 190 



227 
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gaa tat gtc aca tec gtt cat aat gat ctg tag 
Glu Tyr Val Thr Ser Val His Asn Asp Leu 
305 310 315 



get aac gac tac ate tec agg gac gag ctg eta cac ace agg gee gec 624 

Ala Asn Asp Tyr lie Ser Arg Asp Glu Leu Leu His Thr Arg Ala Ala 
195 200 205 

age ttg ctg tac aac acc atg ate tec cga gac gaa tec ccc tec gtc 672 

Ser Leu Leu Tyr Asn Thr Met lie Ser Arg Asp Glu Ser Pro Ser Val 
210 215 220 

gca tac ate cac ggc etc ttt cgc gaa gec gtg gag ata gag act ctg 720 

Ala Tyr He His Gly Leu Phe Arg Glu Ala Val Glu He Glu Thr Leu 
225 230 235 240 

ttt att cgt tec aag age cga gac gtg acc atg gtg aac gtc gga gac 76 8 

Phe He Arg Ser Lys Ser Arg Asp Val Thr Met Val Asn Val Gly Asp 
245 250 255 

ate gag cag ttt ttg cag gec acg gec gac agg att eta aag tea ata 816 

He Glu Gin Phe Leu Gin Ala Thr Ala Asp Arg He Leu Lys Ser He 
260 265 270 

aac ata ccg ccc ctg ttc ggt gec cgc ccc cca aac gcg tgt ccc ttg 864 

Asn He Pro Pro Leu Phe Gly Ala Arg Pro Pro Asn Ala Cys Pro Leu 
275 280 285 

teg tat acg agt gec aag agt gtg aac ttt ttt gag cgt gat aat tct 912 

Ser Tyr Thr Ser Ala Lys Ser Val Asn Phe Phe Glu Arg Asp Asn Ser 
290 295 300 



945 



<210> 135 
<211> 314 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 135 

Met Phe Gly Leu Ser He Val Thr Ala Ala Met Glu Ser Pro Asp Arg 

1 5 10 15 

Phe Leu Tyr Ala Ser Asp His Pro Gly Phe Leu Ala Leu Thr Gin Glu 

20 25 30 

Thr Trp Gin Asn Arg Trp Phe Pro Ser Gin He Ser Leu His Glu Asp 

35 40 45 

Ser Asp Glu Val Arg Leu Leu Ser Pro Thr Asp Arg Glu Phe Tyr Gin 

50 55 60 

Phe Leu Phe Thr Phe Leu Gly Met Ala Glu Ser Leu Val Asn Phe Asn 
65 70 75 80 

He Glu Asp Leu Val Lys Glu Phe Ser Asn His Asp Val Thr His Tyr 

85 90 95 

Tyr Ala Glu Gin Val Ala Met Glu Asn He His Gly Lys Val Tyr Ala 

100 105 HO 

Asn He Leu Asn Leu Phe Phe Gly Gly Asn Arg Gly Asp Leu Met He 

115 120 125 

Tyr Ala Lys Lys He Val Glu Asp Ala Thr Leu Ala Lys Lys He Asp 

130 135 140 

Trp Leu His Ser Arg Val Arg Lys Ala Thr Thr Arg Ala Glu Lys Val 
145 150 155 160 

Leu Leu Phe Leu Val He Glu Gly He Tyr Phe He Ser Ser Phe Tyr 



228 



Ser He Gly Leu 
180 

Ala Asn Asp Tyr 
195 

Ser Leu Leu Tyr 
210 

Ala Tyr He His 
225 

Phe He Arg Ser 

He Glu Gin Phe 
260 

Asn He Pro Pro 
275 

Ser Tyr Thr Ser 
290 

Glu Tyr Val Thr 
305 



165 

Phe Arg Leu Arg 

He Ser Arg Asp 
200 

Asn Thr Met He 
215 

Gly Leu Phe Arg 
230 

Lys Ser Arg Asp 
245 

Leu Gin Ala Thr 

Leu Phe Gly Ala 
280 

Ala Lys Ser Val 
295 

Ser Val His Asn 
310 



170 

Gly He Met Arg 
185 

Glu Leu Leu His 

Ser Arg Asp Glu 
220 

Glu Ala Val Glu 
235 

Val Thr Met Val 
250 

Ala Asp Arg He 
265 

Arg Pro Pro Asn 

Asn Phe Phe Glu 
300 

Asp Leu 



175 

Gly Val Cys Leu 
190 

Thr Arg Ala Ala 
205 

Ser Pro Ser Val 

He Glu Thr Leu 
240 

Asn Val Gly Asp 
255 

Leu Lys Ser He 
270 

Ala Cys Pro Leu 
285 

Arg Asp Asn Ser 



<210> 136 
<211> 2367 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - . (2367) 

<400> 136 

atg aac acc gaa act tct ttc tct gcg gca aaa tct gcc aaa cct tta 

Met Asn Thr Glu Thr Ser Phe Ser Ala Ala Lys Ser Ala Lys Pro Leu 
1 5 10 15 

acc eta gtt acc gac gcg gga act ggg ggg tgt age tea age ctt gac 
Thr Leu Val Thr Asp Ala Gly Thr Gly Gly Cys Ser Ser Ser Leu Asp 
20 25 30 

cct gaa cga tgc gcc gaa tea etc gtg aat tct tta aag gca aca ctt 
Pro Glu Arg Cys Ala Glu Ser Leu Val Asn Ser Leu Lys Ala Thr Leu 
35 40 45 

gga tgg gac ata gag gcc aac tea eta acc ggt etc ctt tgg cac aga 
Gly Trp Asp He Glu Ala Asn Ser Leu Thr Gly Leu Leu Trp His Arg 
50 55 60 

ata atg gag gac agg tgc ctg gtt acg gtg cga gac tat etc gcg gta 
He Met Glu Asp Arg Cys Leu Val Thr Val Arg Asp Tyr Leu Ala Val 
65 70 75 80 

ttc ggg gaa agg eta age gac gag gtt cgc gcg ttt atg tea aaa cac 
Phe Gly Glu Arg Leu Ser Asp Glu Val Arg Ala Phe Met Ser Lys His 
85 90 95 

gaa gcg gcc ctt gac ggt ctt tta cag gac ttt aag cag age aag gca 
Glu Ala Ala Leu Asp Gly Leu Leu Gin Asp Phe Lys Gin Ser Lys Ala 
100 105 HO 



tac aca aat ttc gtt aac tgc ggc tac ctg tct gcc gtt agg ttt tac 
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atg gcg acc gaa gac aaa acc etc aaa gca ctt cat gag gaa atg tec 
Met Ala Thr Glu Asp Lys Thr Leu Lys Ala Leu His Glu Glu Met Ser 
225 230 235 240 

ccc ctt etc get age aag tea ggc gtg ggg ata gat gtc teg tct ttt 
Pro Leu Leu Ala Ser Lys Ser Gly Val Gly lie Asp Val Ser Ser Phe 
245 250 255 

gcg gaa cac aag aac ate acc age tgt tta aaa tta ata aac gec cac 
Ala Glu His Lys Asn lie Thr Ser Cys Leu Lys Leu lie Asn Ala His 
260 265 270 

g tt ggg tat /ttc aac gat aac aat ate cgt cct gtt ggt gee age gec 
Val Gly Tyr Phe Asn Asp Asn Asn lie Arg Pro Val Gly Ala Ser Ala 
275 280 285 

tac atg gaa eta tgg cat cat cag att tgt gat ttt ttg aac gcg aag 
Tyr Met Glu Leu Trp His His Gin lie Cys Asp Phe Leu Asn Ala Lys 
290 295 300 

atg ccg gag aat cag gag egg tgt cac aac ctg ttt cag ggc gtg tgt 
Met Pro Glu Asn Gin Glu Arg Cys His Asn Leu Phe Gin Gly Val Cys 
305 310 315 320 

gtg ccg gag ctt ttt ttc cga ctg tac gaa aca aac cca gat ggc cag 
Val Pro Glu Leu Phe Phe Arg Leu Tyr Glu Thr Asn Pro Asp Gly Gin 
325 330 335 

tgg cac ctg ttt gcg cca gag gtg gcg cca aat etc eta aaa etc tac 
Trp His Leu Phe Ala Pro Glu Val Ala Pro Asn Leu Leu Lys Leu Tyr 
340 345 350 



432 



480 



Tyr Thr Asn Phe Val Asn Cys Gly Tyr Leu Ser Ala Val Arg Phe Tyr 
115 120 125 

gat acg tac gtg ctg aga acc cag ggt tct tea ccc att ttt gaa age 
Asp Thr Tyr Val Leu Arg Thr Gin Gly Ser Ser Pro lie Phe Glu Ser 
130 135 140 

gtg gec cag atg ttt atg agg gtg get gtt ttt gtg gee tgt cag tgt 
Val Ala Gin Met Phe Met Arg Val Ala Val Phe Val Ala Cys Gin Cys 
145 150 155 160 

ata aaa ttt ccg tgt ttg egg gag acc etc aga cac ctg gtg gag age 528 
Xl e Ly S phe Pro Cys Leu Arg Glu Thr Leu Arg His Leu Val Glu Ser 
165 170 175 

gaa acg gag ctg gat gag atg tac ctg gtt ggt tac gcg ttc cat tac 576 
Glu Thr Glu Leu Asp Glu Met Tyr Leu Val Gly Tyr Ala Phe His Tyr 
180 185 190 

ate teg tea cag att gtg tgc tgc gca acg cct gta tta agg tea gec 624 
lie Ser Ser Gin lie Val Cys Cys Ala Thr Pro Val Leu Arg Ser Ala 
195 200 205 

ggg ctt cgc gga ggt cag ctt tct age tgt ttt att tta aag ccg tec 672 
Gly Leu Arg Gly Gly Gin Leu Ser Ser Cys Phe lie Leu Lys Pro Ser 
210 215 220 



720 



768 



816 



864 



912 



960 



1008 



1056 



ggt gec gaa ttt gag att gaa tac aac aga eta gtg gec get ggc aaa 1104 
Gly Ala Glu Phe Glu lie Glu Tyr Asn Arg Leu Val Ala Ala Gly Lys 



230 



i ll :v$ o o «i »o ::r;ni o in: 



355 



360 



365 



cac tct age teg ttg ccg tta aaa teg atg atg tat gee ctt att aac 
His Ser Ser Ser Leu Pro Leu Lys Ser Met Met Tyr Ala Leu lie Asn 
370 375 380 



1152 



act gtc att aag acc ggc agt ccc tac gtc ctg eta aag gag gcg etc 
Thr Val lie Lys Thr Gly Ser Pro Tyr Val Leu Leu Lys Glu Ala Leu 
385 390 395 400 



1200 



aac aaa cac cac tgg tgt gag acg caa ggc age gec ata aac tgt tct 
Asn Lys His His Trp Cys Glu Thr Gin Gly Ser Ala lie Asn Cys Ser 
405 410 415 



1248 



aat etc tgt gcg gaa ate gtg cag caa ccc gag ggt cag get tec gtg 
Asn Leu Cys Ala Glu lie Val Gin Gin Pro Glu Gly Gin Ala Ser Val 
420 425 430 



1296 



tgt aac ttg gca aac att age eta cca aag tgc etc aga ccc cac agg 
Cys Asn Leu Ala Asn lie Ser Leu Pro Lys Cys Leu Arg Pro His Arg 
435 440 445 



1344 



gga gag tct ggt gtg gag cct ggg aag ggg gat gta aca ttt ggt ttt 
Gly Glu Ser Gly Val Glu Pro Gly Lys Gly Asp Val Thr Phe Gly Phe 
450 455 460 



1392 



gaa ctg etc gat gat gec gtg gag gec gca gtt ate att gtc aat gcg 
Glu Leu Leu Asp Asp Ala Val Glu Ala Ala Val lie lie Val Asn Ala 
465 470 475 480 



1440 



tgc att ttg ggc ggg acg gcg. cct acc gag age gtt agg cgc ggc cag 
Cys lie Leu Gly Gly Thr Ala Pro Thr Glu Ser Val Arg Arg Gly Gin 
485 490 495 



1488 



gag gaa cga tea atg ggc att ggc gtc caa gga ctg gec gac gtg ttt 
Glu Glu Arg Ser Met Gly lie Gly Val Gin Gly Leu Ala Asp Val Phe 
500 505 510 



1536 



gec gaa ctg ggc ttt ggc tac ctt gac gee gag age gec aag ctg gac 
Ala Glu Leu Gly Phe Gly Tyr Leu Asp Ala Glu Ser Ala Lys Leu Asp 
515 520 525 



1584 



gtg gag att ttt cag gca atg tat ttt acg gcg gtg caa acc agt cat 
Val Glu lie Phe Gin Ala Met Tyr Phe Thr Ala Val Gin Thr Ser His 
530 535 540 



1632 



gag att gtt tta etc ggc gag ggg act ccg ttt cgt gga tgg gag egg 
Glu lie Val Leu Leu Gly Glu Gly Thr Pro Phe Arg Gly Trp Glu Arg 
545 550 555 560 



1680 



agt cga ttg get cag ggt gtg ttt cac tgg caa acg tgg gac ggc gtt 
Ser Arg Leu Ala Gin Gly Val Phe His Trp Gin Thr Trp Asp Gly Val 
565 570 575 



1728 



aag ccg tec cac cct ccc ttg gag aga tgg gag cag ctg ggg cgc age 
Lys Pro Ser His Pro Pro Leu Glu Arg Trp Glu Gin Leu Gly Arg Ser 
580 585 590 



1776 



att gee cag cat gga att ttc aac age cag ttt ttg gcg ctg atg cca 
lie Ala Gin His Gly He Phe Asn Ser Gin Phe Leu Ala Leu Met Pro 
595 600 605 



1824 



231 
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acg gcc gga acc tct cag ctt acg ggg tac acc gaa gca ttt tat ccg 1872 

Thr Ala Gly Thr Ser Gin Leu Thr Gly Tyr Thr Glu Ala Phe Tyr Pro 

610 615 620 

ttc ttt gcc aat ate gca teg aag gtc acc age aaa gaa gag ate ttg 1920 

Phe Phe Ala Asn He Ala Ser Lys Val Thr Ser Lys Glu Glu He Leu 

625 630 635 640 



aag cca aac gtg act ttt ttt aag aga gtg aag cct ggt gac ctg cga 
Lys Pro Asn Val Thr Phe Phe Lys Arg Val Lys Pro Gly Asp Leu Arg 
645 650 655 



cag teg ctt aat ttc ttt tta aag gag gag cag gcc acg egg gcg agt 

Gin Ser Leu Asn Phe Phe Leu Lys Glu Glu Gin Ala Thr Arg Ala Ser 

705 710 715 720 

tat att cga gac ttg etc ctg acc gga tac agg eta ggg tta aaa acc 

Tyr He Arg Asp Leu Leu Leu Thr Gly Tyr Arg Leu Gly Leu Lys Thr 

725 730 735 

atg ctg tat tat tgt cga att cag aaa cag aca aaa ctt aat gcc ctg 

Met Leu Tyr Tyr Cys Arg lie Gin Lys Gin Thr Lys Leu Asn Ala Leu 

740 745 750 



1968 



acg gtg cgc cgt tac gga ggt gac gtg get tea ttt cca gaa ccc ctg 2016 

Thr Val Arg Arg Tyr Gly Gly Asp Val Ala Ser Phe Pro Glu Pro Leu 

660 665 670 

aag gac aga tac aag att ttt ttg acg gcc ttt gat tac tgc cca att 2064 

Lys Asp Arg Tyr Lys He Phe Leu Thr Ala Phe Asp Tyr Cys Pro He 

675 680 685 

aag cag ttg gag egg gcc ggt gcg egg get cca ttt gta gac cag agt 2112 

Lys Gin Leu Glu Arg Ala Gly Ala Arg Ala Pro Phe Val Asp Gin Ser 

690 695 700 



2160 



2208 



2256 



cag tgc tta gat cag gtc gtg ggt gac aac ate tct tea gag ggg gcg 2304 
Gin Cys Leu Asp Gin Val Val Gly Asp Asn He Ser Ser Glu Gly Ala 
755 760 765 

gag tct aac tgc gta caa aag gcc gac ggt gag egg acc aag gta tgt 2352 
Glu Ser Asn Cys Val Gin Lys Ala Asp Gly Glu Arg Thr Lys Val Cys 
770 775 780 

ttg gcc tgt caa tag 2367 

Leu Ala Cys Gin 

785 



<210> 137 
<211> 788 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 137 

Met Asn Thr Glu Thr Ser Phe Ser Ala Ala Lys Ser Ala Lys Pro Leu 

15 10 15 

Thr Leu Val Thr Asp Ala Gly Thr Gly Gly Cys Ser Ser Ser Leu Asp 

20 25 30 

Pro Glu Arg Cys Ala Glu Ser Leu Val Asn Ser Leu Lys Ala Thr Leu 

232 



35 40 45 

Gly Trp Asp lie Glu Ala Asn Ser Leu Thr Gly Leu Leu Trp His Arg 

50 55 60 

lie Met Glu Asp Arg Cys Leu Val Thr Val Arg Asp Tyr Leu Ala Val 
65 70 75 80 

Phe Gly Glu Arg Leu Ser Asp Glu Val Arg Ala Phe Met Ser Lys His 

85 90 95 

Glu Ala Ala Leu Asp Gly Leu Leu Gin Asp Phe Lys Gin Ser Lys Ala 

100 105 110 

Tyr Thr Asn Phe Val Asn Cys Gly Tyr Leu Ser Ala Val Arg Phe Tyr 

115 120 125 

Asp Thr Tyr Val Leu Arg Thr Gin Gly Ser Ser Pro lie Phe Glu Ser 

130 135 140 

Val Ala Gin Met Phe Met Arg Val Ala Val Phe Val Ala Cys Gin Cys 
145 150 155 160 

lie Lys Phe Pro Cys Leu Arg Glu Thr Leu Arg His Leu Val Glu Ser 

165 170 175 

Glu Thr Glu Leu Asp Glu Met Tyr Leu Val Gly Tyr Ala Phe His Tyr 

180 185 190 

lie Ser Ser Gin lie Val Cys Cys Ala Thr Pro Val Leu Arg Ser Ala 

195 200 205 

Gly Leu Arg Gly Gly Gin Leu Ser Ser Cys Phe lie Leu Lys Pro Ser 

210 215 220 

Met Ala Thr Glu Asp Lys Thr Leu Lys Ala Leu His Glu Glu Met Ser 
225 230 235 240 

Pro Leu Leu Ala Ser Lys Ser Gly Val Gly lie Asp Val Ser Ser Phe 

245 250 255 

Ala Glu His Lys Asn lie Thr Ser Cys Leu Lys Leu lie Asn Ala His 

260 265 270 

Val Gly Tyr Phe Asn Asp Asn Asn lie Arg Pro Val Gly Ala Ser Ala 

275 280 285 

Tyr Met Glu Leu Trp His His Gin lie Cys Asp Phe Leu Asn Ala Lys 

290 295 300 

Met Pro Glu Asn Gin Glu Arg Cys His Asn Leu Phe Gin Gly Val Cys 
305 310 315 320 

Val Pro Glu Leu Phe Phe Arg Leu Tyr Glu Thr Asn Pro Asp Gly Gin 

325 330 335 

Trp His Leu Phe Ala Pro Glu Val Ala Pro Asn Leu Leu Lys Leu Tyr 

340 345 350 

Gly Ala Glu Phe Glu lie Glu Tyr Asn Arg Leu Val Ala Ala Gly Lys 

355 360 365 

His Ser Ser Ser Leu Pro Leu Lys Ser Met Met Tyr Ala Leu lie Asn 

370 375 380 

Thr Val lie Lys Thr Gly Ser Pro Tyr Val Leu Leu Lys Glu Ala Leu 
385 390 395 400 

Asn Lys His His Trp Cys Glu Thr Gin Gly Ser Ala lie Asn Cys Ser 

405 410 415 

Asn Leu Cys Ala Glu lie Val Gin Gin Pro Glu Gly Gin Ala Ser Val 

420 425 430 

Cys Asn Leu Ala Asn lie Ser Leu Pro Lys Cys Leu Arg Pro His Arg 

435 440 445 

Gly Glu Ser Gly Val Glu Pro Gly Lys Gly Asp Val Thr Phe Gly Phe 

450 455 460 

Glu Leu Leu Asp Asp Ala Val Glu Ala Ala Val lie lie Val Asn Ala 
465 470 475 480 

Cys lie Leu Gly Gly Thr Ala Pro Thr Glu Ser Val Arg Arg Gly Gin 

485 490 495 

Glu Glu Arg Ser Met Gly He Gly Val Gin Gly Leu Ala Asp Val Phe 

500 505 510 

Ala Glu Leu Gly Phe Gly Tyr Leu Asp Ala Glu Ser Ala Lys Leu Asp 
515 520 525 
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Val Glu He Phe Gin Ala Met Tyr Phe Thr Ala Val Gin Thr Ser His 

530 535 540 

Glu He Val Leu Leu Gly Glu Gly Thr Pro Phe Arg Gly Trp Glu Arg 
545 550 555 560 

Ser Arg Leu Ala Gin Gly Val Phe His Trp Gin Thr Trp Asp Gly Val 

565 570 575 

Lys Pro Ser His Pro Pro Leu Glu Arg Trp Glu Gin Leu Gly Arg Ser 

580 585 590 

He Ala Gin His Gly He Phe Asn Ser Gin Phe Leu Ala Leu Met Pro 

595 600 605 

Thr Ala Gly Thr Ser Gin Leu Thr Gly Tyr Thr Glu Ala Phe Tyr Pro 

610 615 620 

Phe Phe Ala Asn He Ala Ser Lys Val Thr Ser Lys Glu Glu He Leu 
625 630 635 640 

Lys Pro Asn Val Thr Phe Phe Lys Arg Val Lys Pro Gly Asp Leu Arg 

645 650 655 

Thr Val Arg Arg Tyr Gly Gly Asp Val Ala Ser Phe Pro Glu Pro Leu 

660 665 670 

Lys Asp Arg Tyr Lys He Phe Leu Thr Ala Phe Asp Tyr Cys Pro He 

675 680 685 

Lys Gin Leu Glu Arg Ala Gly Ala Arg Ala Pro Phe Val Asp Gin Ser 

690 695 700 

Gin Ser Leu Asn Phe Phe Leu Lys Glu Glu Gin Ala Thr Arg Ala Ser 
705 710 715 720 

Tyr He Arg Asp Leu Leu Leu Thr Gly Tyr Arg Leu Gly Leu Lys Thr 

725 730 735 

Met Leu Tyr Tyr Cys Arg He Gin Lys Gin Thr Lys Leu Asn Ala Leu 

740 745 750 

Gin Cys Leu Asp Gin Val Val Gly Asp Asn He Ser Ser Glu Gly Ala 

755 760 765 

Glu Ser Asn Cys Val Gin Lys Ala Asp Gly Glu Arg Thr Lys Val Cys 

770 775 780 

Leu Ala Cys Gin 
785 



<210> 138 
<211> 996 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) . - (996) 

<400> 138 

atg aag acc agg gac gca aac gtg aac aag ctg aat gat agt ttg atg 

Met Lys Thr Arg Asp Ala Asn Val Asn Lys Leu Asn Asp Ser Leu Met 

15 10 15 

cgt eta ctg ccg ccg cct cct cac egg gtt teg tta tct cgc ggt cgc 
Arg Leu Leu Pro Pro Pro Pro His Arg Val Ser Leu Ser Arg Gly Arg 
20 25 30 

gac ttt tct aag ggt gtt cgt gat ctg etc tct aaa tat gtg gtg teg 
Asp Phe Ser Lys Gly Val Arg Asp Leu Leu Ser Lys Tyr Val Val Ser 
35 40 45 

acc acc act ggg gtg gag gec ate aaa gat gga ttt tta tct gtc teg 192 
Thr Thr Thr Gly Val Glu Ala He Lys Asp Gly Phe Leu Ser Val Ser 



48 



96 



144 



234 
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50 



55 



60 



cct aca tgt caa acg tat gga gat ttt ttg att tat tct cag acc atg 
Pro Thr Cys Gin Thr Tyr Gly Asp Phe Leu He Tyr Ser Gin Thr Met 
65 70 75 80 



240 



agt tea cag gaa cct egg ggc acc tac ctg ttt tec ttt aaa cag act 
Ser Ser Gin Glu Pro Arg Gly Thr Tyr Leu Phe Ser Phe Lys Gin Thr 
85 90 95 



288 



gac act ggg tec tec att gac atg tta ttc acg cct acg teg ctg gcg 
Asp Thr Gly Ser Ser He Asp Met Leu Phe Thr Pro Thr Ser Leu Ala 
100 105 . 110 



336 



cga ctg tct agg atg gac gcg gac tct gec ccc cag acc aac aga ata 
Arg Leu Ser Arg Met Asp Ala Asp Ser Ala Pro Gin Thr Asn Arg He 
115 120 125 



384 



gcg tgt gtt tgg tac ggc cac gag age ggt eta ctt gac gee ata ccc 
Ala Cys Val Trp Tyr Gly His Glu Ser Gly Leu Leu Asp Ala lie Pro 
130 135 140 



432 



aat ttt gaa gaa ttg ttg gaa acc ggc teg ctt cat cag ttt ctt gcg 
Asn Phe Glu Glu Leu Leu Glu Thr Gly Ser Leu His Gin Phe Leu Ala 
145 150 155 160 



480 



ccg gtc ggg ccg ttg gtt caa acg gtc cac agt acg ttt gtc acc aag 
Pro Val Gly Pro Leu Val Gin Thr Val His Ser Thr Phe Val Thr Lys 
165 170 175 



528 



gtg acg tct gcg tta aag ggc aac gta gtc gec aga gag ccc gtt gtt 
Val Thr Ser Ala Leu Lys Gly Asn Val Val Ala Arg Glu Pro Val Val 
180 185 190 



576 



aca cat att ggc ctt acg ctt cct agt gac atg ttt gtg gat tta gat 
Thr His He Gly Leu Thr Leu Pro Ser Asp Met Phe Val Asp Leu Asp 
195 200 205 



624 



gac tea tgc ccg tec age ctg cgc gac gag cct ttg ccc gec cac teg 
Asp Ser Cys Pro Ser Ser Leu Arg Asp Glu Pro Leu Pro Ala His Ser 
210 215 220 



672 



tct att tat gtt tgc tta acc tac att agg gta aat aat aga ccg gcg 
Ser He Tyr Val Cys Leu Thr Tyr He Arg Val Asn Asn Arg Pro Ala 
225 230 235 240 



720 



eta gga ctt ggg ttt ttt aag tct ggt aaa gga tat tgt gaa ata get 
Leu Gly Leu Gly Phe Phe Lys Ser Gly Lys Gly Tyr Cys Glu He Ala 
245 250 255 



768 



gee caa ttg agg gat ttt tac age ggc gtg att cgc acc aaa tac att 
Ala Gin Leu Arg Asp Phe Tyr Ser Gly Val He Arg Thr Lys Tyr He 
260 265 270 



816 



cag ctg cag aat gac ctg tat ata aat agg ctt gcg ttc ggc gta gtt 
Gin Leu Gin Asn Asp Leu Tyr He Asn Arg Leu Ala Phe Gly Val Val 
275 280 285 



864 



t9 c a 9 a ctt 99 a a 9 c 9 fc 9 cct tcc 99 1 tta caa cca tct ttt caa age 
Cys Arg Leu Gly Ser Val Pro Ser Gly Leu Gin Pro Ser Phe Gin Ser 
290 295 300 

235 



912 
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ctt cac ttt aaa ggc get get tta ccg gtg tta aaa ttt acg gag ttt 960 
Leu His Phe Lys Gly Ala Ala Leu Pro Val Leu Lys Phe Thr Glu Phe 
305 310 315 320 



gtt tec aat ccg ggg tct tgg aag eta ttt ctt taa 
Val Ser Asn Pro Gly Ser Trp Lys Leu Phe Leu 
325 330 



<210> 139 
<211> 331 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 139 

Met Lys Thr Arg Asp Ala Asn Val Asn Lys Leu Asn Asp Ser Leu Met 

! 5 10 15 

Arg Leu Leu Pro Pro Pro Pro His Arg Val Ser Leu Ser Arg Gly Arg 

20 25 30 

Asp Phe Ser Lys Gly Val Arg Asp Leu Leu Ser Lys Tyr Val Val Ser 

35 40 45 

Thr Thr Thr Gly Val Glu Ala lie Lys Asp Gly Phe Leu Ser Val Ser 

50 55 60 

Pro Thr Cys Gin Thr Tyr Gly Asp Phe Leu lie Tyr Ser Gin Thr Met 
65 70 75 80 

Ser Ser Gin Glu Pro Arg Gly Thr Tyr Leu Phe Ser Phe Lys Gin Thr 

85 90 95 

Asp Thr Gly Ser Ser lie Asp Met Leu Phe Thr Pro Thr Ser Leu Ala 

100 105 110 

Arg Leu Ser Arg Met Asp Ala Asp Ser Ala Pro Gin Thr Asn Arg lie 

115 120 125 

Ala Cys Val Trp Tyr Gly His Glu Ser Gly Leu Leu Asp Ala lie Pro 

130 135 140 

Asn Phe Glu Glu Leu Leu Glu Thr Gly Ser Leu His Gin Phe Leu Ala 
145 150 155 160 

Pro Val Gly Pro Leu Val Gin Thr Val His Ser Thr Phe Val Thr Lys 

165 170 175 

Val Thr Ser Ala Leu Lys Gly Asn Val Val Ala Arg Glu Pro Val Val 

180 185 190 

Thr His lie Gly Leu Thr Leu Pro Ser Asp Met Phe Val Asp Leu Asp 

195 200 205 

Asp Ser Cys Pro Ser Ser Leu Arg Asp Glu Pro Leu Pro Ala His Ser 

210 215 220 

Ser lie Tyr Val Cys Leu Thr Tyr lie Arg Val Asn Asn Arg Pro Ala 
225 230 235 240 

Leu Gly Leu Gly Phe Phe Lys Ser Gly Lys Gly Tyr Cys Glu lie Ala 

245 250 255 

Ala Gin Leu Arg Asp Phe Tyr Ser Gly Val He Arg Thr Lys Tyr He 

260 265 270 

Gin Leu Gin Asn Asp Leu Tyr He Asn Arg Leu Ala Phe Gly Val Val 

275 280 285 

Cys Arg Leu Gly Ser Val Pro Ser Gly Leu Gin Pro Ser Phe Gin Ser 

290 295 300 

Leu His Phe Lys Gly Ala Ala Leu Pro Val Leu Lys Phe Thr Glu Phe 
305 310 315 320 

Val Ser Asn Pro Gly Ser Trp Lys Leu Phe Leu 
325 330 



996 



236 
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<210> 140 
<211> 2820 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 



<220> 

<221> CDS 

<222> (1) - - (2820) 



<400> 140 

atg gcg teg tea att cct gec gca egg get gat aac ggt gac gag aac 4 8 

Met Ala Ser Ser lie Pro Ala Ala Arg Ala Asp Asn Gly Asp Glu Asn 

15 10 15 



acc ggc ggg ctt tac aaa tta act gac aac etc eta acg tgc ace gga 96 
Thr Gly Gly Leu Tyr Lys Leu Thr Asp Asn Leu Leu Thr Cys Thr Gly 
20 25 30 



teg eta caa cag ctt aaa etc ctg atg gag ttc caa eta aaa cca eta 144 
Ser Leu Gin Gin Leu Lys Leu Leu Met Glu Phe Gin Leu Lys Pro Leu 
35 40 45 



cca acc gca cac ctt tta age atg ccc acc gtg acc egg ttt tta aat 192 
Pro Thr Ala His Leu Leu Ser Met Pro Thr Val Thr Arg Phe Leu Asn 
50 55 60 



act gca ttc aaa ata gac aac ccc ctg gtt tea ttt att caa aaa cac 240 
Thr Ala Phe Lys lie Asp Asn Pro Leu Val Ser Phe lie Gin Lys His 
65 70 75 80 



cct gtg ttt ttt tta atg aga gtc gec agg ctt ccg gag cca gtc att 288 
Pro Val Phe Phe Leu Met Arg Val Ala Arg Leu Pro Glu Pro Val lie 
85 90 95 



aca gac cac caa age gca gaa acg tea aca ggc ata eta tec gag gtt 336 
Thr Asp His Gin Ser Ala Glu Thr Ser Thr Gly lie Leu Ser Glu Val 
100 105 110 



gtg aat gtt ctt aat aca get att cga aaa cca cac gag tec cca gcg 384 
Val Asn Val Leu Asn Thr Ala He Arg Lys Pro His Glu Ser Pro Ala 
115 120 125 



get aaa gac aac gat tat etc gac aac cgt gee ata ttg gee atg att 432 
Ala Lys Asp Asn Asp Tyr Leu Asp Asn Arg Ala He Leu Ala Met He 
130 135 140 



aca gaa tac ate cat cac gta act tea cgt acg ccc teg ggg ate cca 480 
Thr Glu Tyr He His His Val Thr Ser Arg Thr Pro Ser Gly He Pro 
145 150 155 160 



ccg aca cca cca atg ggt ate age cat eta ccg tgc gta gag caa att 528 
Pro Thr Pro Pro Met Gly He Ser His Leu Pro Cys Val Glu Gin He 
165 170 175 



tta cac gaa acc cac egg caa tac tgg aac eta acc etc ccg gag teg 576 

Leu His Glu Thr His Arg Gin Tyr Trp Asn Leu Thr Leu Pro Glu Ser 
180 185 190 

eta ttt ate gac ate ggc gaa gtc gcg tct ccg ctt cag acg tgg eta 624 

Leu Phe He Asp He Gly Glu Val Ala Ser Pro Leu Gin Thr Trp Leu 
195 200 205 



237 



ate eta teg tat tgt aaa aaa ctg caa ctg gca cca ccg ccc eta ttc 
lie Leu Ser Tyr Cys Lys Lys Leu Gin Leu Ala Pro Pro Pro Leu Phe 
210 215 220 



cca cct gtc gac gaa ctg gec cgt cgt ctg gtt acc ggc cac cac gaa 
Pro Pro Val Asp Glu Leu Ala Arg Arg Leu Val Thr Gly His His Glu 
225 230 235 240 



ttg ttt gtc ccc eta teg acg tea ctg gaa acg tac ate acc atg cca 
Leu Phe Val Pro Leu Ser Thr Ser Leu Glu Thr Tyr lie Thr Met Pro 
245 250 255 



gta tea aaa cag cga gcg ttt gag ata tac age gtc ttt gec aaa tea 
Val Ser Lys Gin Arg Ala Phe Glu lie Tyr Ser Val Phe Ala Lys Ser 
260 265 270 



aaa aac ata gtt gac ggc aca cca att tta gcg ttc acc gac aca gaa 
Lys Asn lie Val Asp Gly Thr Pro He Leu Ala Phe Thr Asp Thr Glu 
275 280 285 



ctt aca acg ttc acc ccc gaa ctg tta ttc ctg tac gac ttc gta ate 
Leu Thr Thr Phe Thr Pro Glu Leu Leu Phe Leu Tyr Asp Phe Val He 
290 295 300 



gag tec ctg tgc aaa aac cag gca tac ggg tgt teg cgc aac gca att 
Glu Ser Leu Cys Lys Asn Gin Ala Tyr Gly Cys Ser Arg Asn Ala He 
305 310 315 320 



gag cat ttt att aag aaa ggt ate gat ttt atg gcg gag ttg ggg gcg 
Glu His Phe He Lys Lys Gly He Asp Phe Met Ala Glu Leu Gly Ala 
325 330 335 



ttc att gaa aaa acg tgt ggc tat cga tea acc gta age ctg tec aac 
Phe He Glu Lys Thr Cys Gly Tyr Arg Ser Thr Val Ser Leu Ser Asn 
340 345 350 



gtg aga gec gtt aag gee agg etc gcg age tgt ggc eta tec aaa gag 
Val Arg Ala Val Lys Ala Arg Leu Ala Ser Cys Gly Leu Ser Lys Glu 
355 360 365 



gcg tgt gaa gat ttc cgc gca atg ata ctg atg aca ccg cac gag acg 
Ala Cys Glu Asp Phe Arg Ala Met He Leu Met Thr Pro His Glu Thr 
370 375 380 



acg cca aag tgg gaa aat ttt acg gac ttt tta gaa atg gta aac caa 
Thr Pro Lys Trp Glu Asn Phe Thr Asp Phe Leu Glu Met Val Asn Gin 
385 390 395 400 



tta aca eta tac ggg ttc tat ttt tac gag tgt ctt aac cag tac age 
Leu Thr Leu Tyr Gly Phe Tyr Phe Tyr Glu Cys Leu Asn Gin Tyr Ser 
405 410 415 



ccc act age ata teg ctg gec aaa att caa aat ate tta aac cga gtc 

Pro Thr Ser He Ser Leu Ala Lys He Gin Asn He Leu Asn Arg Val 
420 425 430 

gat gee gaa cag age gat cgc gec ctg tgg cgc act cca tta ate ggg 

Asp Ala Glu Gin Ser Asp Arg Ala Leu Trp Arg Thr Pro Leu He Gly 

435 440 445 



238 
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tct ttc ccg ttc ccc tgg aaa ttg aac aac gtg ctg gcg ttt ttt aag 
Ser Phe Pro Phe Pro Trp Lys Leu Asn Asn Val Leu Ala Phe Phe Lys 
450 455 460 



1392 



cct age acc ccc gtg gca acg eta caa aaa ate tac aag gca ata ccg 
Pro Ser Thr Pro Val Ala Thr Leu Gin Lys lie Tyr Lys Ala lie Pro 
465 470 475 480 



1440 



teg tac eta atg agg tct ttg ttc gaa ata gcg gec aac aaa teg tgg 
Ser Tyr Leu Met Arg Ser Leu Phe Glu lie Ala Ala Asn Lys Ser Trp 
485 490 495 



1488 



ggt aac ate gcg etc gca gag age gee ccg eta acc gac ata cag acc 
Gly Asn lie Ala Leu Ala Glu Ser Ala Pro Leu Thr Asp lie Gin Thr 
500 505 510 



1536 



gec gaa cca gac cag ggt ccc gtg tec gee cag gtt ate gca aaa tat 
Ala Glu Pro Asp Gin Gly Pro Val Ser Ala Gin Val lie Ala Lys Tyr 
515 520 525 



1584 



tgc age cgt etc caa ata age gcg aca gat tat gac gca gec ate gtc 
Cys Ser Arg Leu Gin lie Ser Ala Thr Asp Tyr Asp Ala Ala lie Val 
530 535 540 



1632 



tec age ccg gga ttc gec gec gag ttc att aaa aca aaa ctg tat ccg 
Ser Ser Pro Gly Phe Ala Ala Glu Phe lie Lys Thr Lys Leu Tyr Pro 
545 550 555 560 



1680 



ate ctt age gag gtg etc cga aac acg tec aaa aaa aat cga teg eta 
lie Leu Ser Glu Val Leu Arg Asn Thr Ser Lys Lys Asn Arg Ser Leu 
565 570 575 



1728 



ttc caa att aga tgg etc ata gtc ttc gcg gca gag gac gec aga gac 
Phe Gin lie Arg Trp Leu lie Val Phe Ala Ala Glu Asp Ala Arg Asp 
580 585 590 



1776 



eta gec cct ate aga cgc teg ctg , gec ctg gcg tat ttt caa ata atg 
Leu Ala Pro lie Arg Arg Ser Leu Ala Leu Ala Tyr Phe Gin He Met 
595 600 605 



1824 



gac att ttg gaa gaa aag cat tct ccg gag teg ttt tac aac ctg ctg 
Asp He Leu Glu Glu Lys His Ser Pro Glu Ser Phe Tyr Asn Leu Leu 
610 615 620 



1872 



gac tat ctt cag gaa aca ttt agg tgc ata cga cag gtg ata ccg gaa 
Asp Tyr Leu Gin Glu Thr Phe Arg Cys He Arg Gin Val He Pro Glu 
625 630 635 640 



1920 



gec acg tgc cca caa gaa ttt eta caa tac ttg ttt act ttt caa aac 
Ala Thr Cys Pro Gin Glu Phe Leu Gin Tyr Leu Phe Thr Phe Gin Asn 
645 650 655 



1968 



att cca ata gca gcg age ttc att caa aca tec atg acc ttt gta gac 
He Pro He Ala Ala Ser Phe He Gin Thr Ser Met Thr Phe Val Asp 
660 665 670 



2016 



gac ctg aaa aac ggc ate ccc ggt ata ctg gac ctt gtc tec tta ggt 
Asp Leu Lys Asn Gly He Pro Gly He Leu Asp Leu Val Ser Leu Gly 
675 680 685 



2064 



gec gcg ttt tat aac atg aaa eta ctg tac gat tea acg eta gac acc 

239 
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Ala Ala Phe Tyr Asn Met Lys Leu Leu Tyr Asp Ser Thr Leu Asp Thr 

690 695 700 

gtc gag att cca acg gaa gaa ggg caa ccc ate gtc gtg age atg ttc 2160 

Val Glu He Pro Thr Glu Glu Gly Gin Pro He Val Val Ser Met Phe 

705 710 715 720 

gta ttc aaa tec acg att cgc gtc etc gag aag etc tta cag gaa gec 2208 

Val Phe Lys Ser Thr lie Arg Val Leu Glu Lys Leu Leu Gin Glu Ala 

725 730 735 

gtt ate gcg tta act caa acg tec gaa ccg atg tac gca gcg cac ate 2256 

Val He Ala Leu Thr Gin Thr Ser Glu Pro Met Tyr Ala Ala His He 

740 745 750 

cgt ctg atg caa cac etc acg tac atg caa aag ate gee gga cac gaa 2304 

Arg Leu Met Gin His Leu Thr Tyr Met Gin Lys He Ala Gly His Glu 

755 760 765 

ata atg acc aca caa ctt cca tec gtg ttt cac gaa ata cac gag gga 2352 

He Met Thr Thr Gin Leu Pro Ser Val Phe His Glu He His Glu Gly 

770 775 780 

tat ttg cag tgt ttt aag cgt ttt aaa cgt etc atg tta cac gtt acg 2400 

Tyr Leu Gin Cys Phe Lys Arg Phe Lys Arg Leu Met Leu His Val Thr 

785 790 795 800 

gga age tgc tgc tac tea ctg acg cga tac ttt gga ttc eta tat caa 2448 

Gly Ser Cys Cys Tyr Ser Leu Thr Arg Tyr Phe Gly Phe Leu Tyr Gin 

805 810 815 

ccc ccc eta ata ccc gat acc ate gta caa aaa' att tta aac ttt aac 2496 

Pro Pro Leu He Pro Asp Thr He Val Gin Lys He Leu Asn Phe Asn 

820 825 830 

gac aaa acg gac aca acc gac gac ate tta aag age ctg tea cag ccc 2544 

Asp Lys Thr Asp Thr Thr Asp Asp He Leu Lys Ser Leu Ser Gin Pro 

835 840 845 

gtg aga caa gga cct eta teg get gaa aac gaa agt age agt cga etc 2592 

Val Arg Gin Gly Pro Leu Ser Ala Glu Asn Glu Ser Ser Ser Arg Leu 

850 855 860 

tea aaa aac aac gtt gag ctg ctt caa aaa ctg tac gac gac ttt egg 2640 

Ser Lys Asn Asn Val Glu Leu Leu Gin Lys Leu Tyr Asp Asp Phe Arg 

865 870 875 880 

acc gee tec aca aac aat aac ccc acc tct att aaa ctt gaa tat teg 2688 

Thr Ala Ser Thr Asn Asn Asn Pro Thr Ser He Lys Leu Glu Tyr Ser 

885 890 895 

ggt aat tat' aac gaa aca caa gtg tec gta gat tgg age aca tat aac 2736 

Gly Asn Tyr Asn Glu Thr Gin Val Ser Val Asp Trp Ser Thr Tyr Asn 

900 905 910 

ctg gtg aca tac acc gca ccc gac gat acg tta aaa ttc acc ccg gtt 2784 

Leu Val Thr Tyr Thr Ala Pro Asp Asp Thr Leu Lys Phe Thr Pro Val 

915 920 925 

aac acg gag gca eta gat cgc atg ttt gca gaa taa 2820 

Asn Thr Glu Ala Leu Asp Arg Met Phe Ala Glu 



240 



930 935 940 



<210> 141 
<211> 939 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 141 

Met Ala Ser Ser lie Pro Ala Ala Arg Ala Asp Asn Gly Asp Glu Asn 

15 10 15 

Thr Gly Gly Leu Tyr Lys Leu Thr Asp Asn Leu Leu Thr Cys Thr Gly 

20 25 30 

Ser Leu Gin Gin Leu Lys Leu Leu Met Glu Phe Gin Leu Lys Pro Leu 

35 40 45 

Pro Thr Ala His Leu Leu Ser Met Pro Thr Val Thr Arg Phe Leu Asn 

50 55 60 

Thr Ala Phe Lys lie Asp Asn Pro Leu Val Ser Phe He Gin Lys His 
65 70 75 80 

Pro Val Phe Phe Leu Met Arg Val Ala Arg Leu Pro Glu Pro Val He 

85 90 95 

Thr Asp His Gin Ser Ala Glu Thr Ser Thr Gly He Leu Ser Glu Val 

100 105 110 

Val Asn Val Leu Asn Thr Ala He Arg Lys Pro His Glu Ser Pro Ala 

115 120 125 

Ala Lys Asp Asn Asp Tyr Leu Asp Asn Arg Ala He Leu Ala Met He 

130 135 140 

Thr Glu Tyr He His His Val Thr Ser Arg Thr Pro Ser Gly He Pro 
145 150 155 160 

Pro Thr Pro Pro Met Gly He Ser His Leu Pro Cys Val Glu Gin He 

165 170 175 

Leu His Glu Thr His Arg Gin Tyr Trp Asn Leu Thr Leu Pro Glu Ser 

180 185 190 

Leu Phe He Asp He Gly Glu Val Ala Ser Pro Leu Gin Thr Trp Leu 

195 200 205 

He Leu Ser Tyr Cys Lys Lys Leu Gin Leu Ala Pro Pro Pro Leu Phe 

210 215 220 

Pro Pro Val Asp Glu Leu Ala Arg Arg Leu Val Thr Gly His His Glu 
225 230 235 240 

Leu Phe Val Pro Leu Ser Thr Ser Leu Glu Thr Tyr He Thr Met Pro 

245 250 255 

Val Ser Lys Gin Arg Ala Phe Glu He Tyr Ser Val Phe Ala Lys Ser 

260 265 270 

Lys Asn He Val Asp Gly Thr Pro He Leu Ala Phe Thr Asp Thr Glu 

275 280 285 

Leu Thr Thr Phe Thr Pro Glu Leu Leu Phe Leu Tyr Asp Phe Val He 

290 295 300 

Glu Ser Leu Cys Lys Asn Gin Ala Tyr Gly Cys Ser Arg Asn Ala He 
305 310 315 320 

Glu His Phe He Lys Lys Gly He Asp Phe Met Ala Glu Leu Gly Ala 

325 330 335 

Phe He Glu Lys Thr Cys Gly Tyr Arg Ser Thr Val Ser Leu Ser Asn 

340 345 350 

Val Arg Ala Val Lys Ala Arg Leu Ala Ser Cys Gly Leu Ser Lys Glu 

355 360 365 

Ala Cys Glu Asp Phe Arg Ala Met He Leu Met Thr Pro His Glu Thr 

370 375 380 

Thr Pro Lys Trp Glu Asn Phe Thr Asp Phe Leu Glu Met Val Asn Gin 
385 390 395 400 

Leu Thr Leu Tyr Gly Phe Tyr Phe Tyr Glu Cys Leu Asn Gin Tyr Ser 
405 410 415 



241 



Pro Thr Ser He Ser Leu Ala Lys He Gin Asn He Leu Asn Arg Val 

420 425 430 

Asp Ala Glu Gin Ser Asp Arg Ala Leu Trp Arg Thr Pro Leu lie Gly 

435 440 445 

Ser Phe Pro Phe Pro Trp Lys Leu Asn Asn Val Leu Ala Phe Phe Lys 

450 455 460 

Pro Ser Thr Pro Val Ala Thr Leu Gin Lys He Tyr Lys Ala He Pro 
465 470 475 480 

Ser Tyr Leu Met Arg Ser Leu Phe Glu He Ala Ala Asn Lys Ser Trp 

485 490 495 

Gly Asn He Ala Leu Ala Glu Ser Ala Pro Leu Thr Asp He Gin Thr 

500 505 510 

Ala Glu Pro Asp Gin Gly Pro Val Ser Ala Gin Val He Ala Lys Tyr 

515 520 525 

Cys Ser Arg Leu Gin He Ser Ala Thr Asp Tyr Asp Ala Ala He Val 

530 535 540 

Ser Ser Pro Gly Phe Ala Ala Glu Phe lie Lys Thr Lys Leu Tyr Pro 
545 550 555 560 

He Leu Ser Glu Val Leu Arg Asn Thr Ser Lys Lys Asn Arg Ser Leu 

565 570 575 

Phe Gin He Arg Trp Leu He Val Phe Ala Ala Glu Asp Ala Arg Asp 

580 585 590 

Leu Ala Pro He Arg Arg Ser Leu Ala Leu Ala Tyr Phe Gin He Met 

595 600 605 

Asp He Leu Glu Glu Lys His Ser Pro Glu Ser Phe Tyr Asn Leu Leu 

610 615 620 

Asp Tyr Leu Gin Glu Thr Phe Arg Cys He Arg Gin Val He Pro Glu 
625 630 635 640 

Ala Thr Cys Pro Gin Glu Phe Leu Gin Tyr Leu Phe Thr Phe Gin Asn 

645 650 655 

He Pro He Ala Ala Ser Phe He Gin Thr Ser Met Thr Phe Val Asp 

660 665 670 

Asp Leu Lys Asn Gly He Pro Gly He Leu Asp Leu Val Ser Leu Gly 

675 680 685 

Ala Ala Phe Tyr Asn Met Lys Leu Leu Tyr Asp Ser Thr Leu Asp Thr 

690 695 700 

Val Glu He Pro Thr Glu Glu Gly Gin Pro He Val Val Ser Met Phe 
705 710 715 720 

Val Phe Lys Ser Thr He Arg Val Leu Glu Lys Leu Leu Gin Glu Ala 

725 730 735 

Val He Ala Leu Thr Gin Thr Ser Glu Pro Met Tyr Ala Ala His He 

740 745 750 

Arg Leu Met Gin His Leu Thr Tyr Met Gin Lys He Ala Gly His Glu 

755 760 765 

He Met Thr Thr Gin Leu Pro Ser Val Phe His Glu He His Glu Gly 

770 775 780 

Tyr Leu Gin Cys Phe Lys Arg Phe Lys Arg Leu Met Leu His Val Thr 
785 790 795 800 

Gly Ser Cys Cys Tyr Ser Leu Thr Arg Tyr Phe Gly Phe Leu Tyr Gin 

805 810 815 

Pro Pro Leu He Pro Asp Thr He Val Gin Lys He Leu Asn Phe Asn 

820 825 830 

Asp Lys Thr Asp Thr Thr Asp Asp He Leu Lys Ser Leu Ser Gin Pro 

835 840 845 

Val Arg Gin Gly Pro Leu Ser Ala Glu Asn Glu Ser Ser Ser Arg Leu 

850 855 860 

Ser Lys Asn Asn Val Glu Leu Leu Gin Lys Leu Tyr Asp Asp Phe Arg 
865 870 875 880 

Thr Ala Ser Thr Asn Asn Asn Pro Thr Ser He Lys Leu Glu Tyr Ser 

885 890 895 

Gly Asn Tyr Asn Glu Thr Gin Val Ser Val Asp Trp Ser Thr Tyr Asn 



242 



JL tJ .in 



900 905 910 

Leu Val Thr Tyr Thr Ala Pro Asp Asp Thr Leu Lys Phe Thr Pro Val 

915 920 925 

Asn Thr Glu Ala Leu Asp Arg Met Phe Ala Glu 
930 935 



<210> 142 
<211> 7647 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (7647) 

<400> 142 

atg gaa ttg ccg cca att ttt teg aaa ttt aaa ata gag ggc gtg gca 

Met Glu Leu Pro Pro lie Phe Ser Lys Phe Lys lie Glu Gly Val Ala 
15 10 15 

acg acg cac cag gec gac tgc agg ttc gga caa tac gec ggc teg cag 
Thr Thr His Gin Ala Asp Cys Arg Phe Gly Gin Tyr Ala Gly Ser Gin 
20 25 30 



gag 



48 



96 



tgc tta age aac tgc gtt ate tac etc gca caa agt tat ttc aat aga 144 
Cys Leu Ser Asn Cys Val lie Tyr Leu Ala Gin Ser Tyr Phe Asn Arg 
35 40 45 



tec ccc gtg acg gac act aac gac etc gac gac gtt tta egg caa 192 
Glu Ser Pro Val Thr Asp Thr Asn Asp Leu Asp Asp Val Leu Arg Gin 
50 55 60 



999 gcg acg ttg gac ttt ata ctg aga egg tec ggg acg etc ggc tat 240 

Gly Ala Thr Leu Asp Phe lie Leu Arg Arg Ser Gly Thr Leu Gly Tyr 

65 70 75 80 

aac caa tat gee caa ctg cac cac ata ccc age ttt ate aag ace aac 288 

Asn Gin Tyr Ala Gin Leu His His He Pro Ser Phe He Lys Thr Asn 
85 90 95 

gag tgg ace gcg gee ate ttt caa tct cag gag tac ttt gga eta att 336 

Glu Trp Thr Ala Ala He Phe Gin Ser Gin Glu Tyr Phe Gly Leu He 
100 105 110 

gga ctt gac gcg gec ate cgc gaa cct ttc ate gaa tec eta aaa teg 384 

Gly Leu Asp Ala Ala He Arg Glu Pro Phe He Glu Ser Leu Lys Ser 

115 120 125 

ate eta acg cga aat tac gee ggc acc gtt caa tac ttt ctg ttt ata 

He Leu Thr Arg Asn Tyr Ala Gly Thr Val Gin Tyr Phe Leu Phe He 

130 135 140 

tgc ggc gac aaa gec ggg gee gta att ate aaa aac aaa acg ttt tac 

Cys Gly Asp Lys Ala Gly Ala Val He He Lys Asn Lys Thr Phe Tyr 

145 150 155 160 

ctg ttc gat ccc cat tgc gta ccc cac gta cca aat agt ccg gca cac 528 

Leu Phe Asp Pro His Cys Val Pro His Val Pro Asn Ser Pro Ala His 
165 170 175 



432 



480 
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gta ate agt teg tea gac ccc acc gec ata eta gag tac gtg tea ccg 

Val lie Ser Ser Ser Asp Pro Thr Ala lie Leu Glu Tyr Val Ser Pro 
180 185 190 

cca gac aga gaa tac act ggc agt ttt eta tac att atg ccc age gaa 

Pro Asp Arg Glu Tyr Thr Gly Ser Phe Leu Tyr He Met Pro Ser Glu 

195 200 205 

tat gtc aat cca gag cac tac ate acc aac cac tat aga act ata acg 

Tyr Val Asn Pro Glu His Tyr He Thr Asn His Tyr Arg Thr He Thr 
210 215 220 

ttc gec aaa gta cac ggc ccg cac ata gat ata tec acc ggc ata gag 

Phe Ala Lys Val His Gly Pro His He Asp He Ser Thr Gly He Glu 

225 230 235 240 



576 



624 



672 



720 



768 



816 



864 



ccg tgc acc ate gaa gac ate cca age ccg ccg cga teg ccg gat gtg 
Pro Cys Thr He Glu Asp He Pro Ser Pro Pro Arg Ser Pro Asp Val 
245 250 255 

acg tea aaa tea tec aac etc gca cgc gtg ccg aga acc acc acc gac 
Thr Ser Lys Ser Ser Asn Leu Ala Arg Val Pro Arg Thr Thr Thr Asp 
260 265 270 

acg teg age gec aaa ccc cca ccg gcg acg eta tec ggt tta egg ggc 
Thr Ser Ser Ala Lys Pro Pro Pro Ala Thr Leu Ser Gly Leu Arg Gly 
275 280 285 

gcg gaa cca ccg aca age tac cca gac ccg gca acc aac gac gcg gac 912 
Ala Glu Pro Pro Thr Ser Tyr Pro Asp Pro Ala Thr Asn Asp Ala Asp 
290 295 300 

aca aaa etc tta act ccc get cca gcg caa acc gec gtc gac cac ccc 960 
Thr Lys Leu Leu Thr Pro Ala Pro Ala Gin Thr Ala Val Asp His Pro 
305 310 315 320 

gaa ttc caa aca aca cct gga gec acg eta ctt ctt tea gaa ctg teg 1008 
Glu Phe Gin Thr Thr Pro Gly Ala Thr Leu Leu Leu Ser Glu Leu Ser 
325 330 335 

gca tec agg ggt egg aaa cgc aag ctt tec age ctt cag cga tat teg 1056 
Ala Ser Arg Gly Arg Lys Arg Lys Leu Ser Ser Leu Gin Arg Tyr Ser 
340 345 350 

gat tec gac gaa gcg teg tct gac gac gaa ggg gec cca cgt agg cgc 1104 
Asp Ser Asp Glu Ala Ser Ser Asp Asp Glu Gly Ala Pro Arg Arg Arg 
355 360 365 

gta cac gac gat gcg ata tec gee gag gtg ate tgg atg gac gac gat 1152 
Val His Asp Asp Ala He Ser Ala Glu Val He Trp Met Asp Asp Asp 
370 375 380 

ata tct cct eta tat tea ccg teg gcg act cca teg ttt gac gac gtg 1200 
He Ser Pro Leu Tyr Ser Pro Ser Ala Thr Pro Ser Phe Asp Asp Val 
385 390 395 400 

ttc gac age ccc cca atg age ecg gag ttt aca tac gaa gac gcg aca 1248 
Phe Asp Ser Pro Pro Met Ser Pro Glu Phe Thr Tyr Glu Asp Ala Thr 
405 410 415 
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gag gac acg gat ggc gcg ttt eta gaa cag ate get cga gac gcg gaa 
Glu Asp Thr Asp Gly Ala Phe Leu Glu Gin He Ala Arg Asp Ala Glu 
420 425 430 



1296 



acg ccg ttc tct gcg ttc gac gac ctt ata acg gac cac gac ttt tct 
Thr Pro Phe Ser Ala Phe Asp Asp Leu He Thr Asp His Asp Phe Ser 
435 440 445 



1344 



tec ctt gat aaa aaa ata gaa cag tta ate aag tac gaa gcg ccc teg 
Ser Leu Asp Lys Lys He Glu Gin Leu He Lys Tyr Glu Ala Pro Ser 
450 455 460 



1392 



cag cac ctg cca aac ate teg gac aaa caa aac ggg cga gee gtc cga 
Gin His Leu Pro Asn He Ser Asp Lys Gin Asn Gly Arg Ala Val Arg 
465 470 475 480 



1440 



gaa gcg gcg gee etc cag gcg atg gac aaa att atg att aat ate ata 
Glu Ala Ala Ala Leu Gin Ala Met Asp Lys He Met He Asn He He 
485 490 495 



1488 



ctt gaa cac ggt eta att aca gac gcg cag gec egg gga ccg tec gcg 
Leu Glu His Gly Leu He Thr Asp Ala Gin Ala Arg Gly Pro Ser Ala 
500 505 510 



1536 



tgc aaa aac gtt ctt caa ttt ttc ate ctg tgg gga gaa aaa etc aac 
Cys Lys Asn Val Leu Gin Phe Phe He Leu Trp Gly Glu Lys Leu Asn 
515 520 525 



1584 



ata cca ata age gac gec aag cag gtc eta gaa etc gat ctg caa ctg 
He Pro He Ser Asp Ala Lys Gin Val Leu Glu Leu Asp Leu Gin Leu 
530 535 540 



1632 



ata ccg tta cat acg get ate age gaa gga aaa ttc aaa cag ggg gcg 
He Pro Leu His Thr Ala He Ser Glu Gly Lys Phe Lys Gin Gly Ala 
545 550 555 560 



1680 



ttc aaa aaa cac eta aca act aaa ate aac egg tgt ctg gcg tea atg 
Phe Lys Lys His Leu Thr Thr Lys He Asn Arg Cys Leu Ala Ser Met 
565 570 575 



1728 



agg gec acg cac gca gat gcg caa aaa aaa ctg gcg teg get ttc aac 
Arg Ala Thr His Ala Asp Ala Gin Lys Lys Leu Ala Ser Ala Phe Asn 
580 585 590 



1776 



gtc gaa ggc tea cag att teg tec age gaa gcg aaa ata tea gtc egg 
Val Glu Gly Ser Gin He Ser Ser Ser Glu Ala Lys He Ser Val Arg 
595 600 605 



1824 



gcg ctg aag gaa cag ate gec aac cac ctg agt cca ggc ttt tta gcg 
Ala Leu Lys Glu Gin He Ala Asn His Leu Ser Pro Gly Phe Leu Ala 
610 615 - 620 



1872 



gtc tac tec gcg gac gag gta aaa cat eta egg gat aaa att cag gac 
Val Tyr Ser Ala Asp Glu Val Lys His Leu Arg Asp Lys He Gin Asp 
625 630 635 640 



1920 



eta aaa aca ggc ate gag cag cgc aac aaa gaa ate caa cag gaa gaa 
Leu Lys Thr Gly lie Glu Gin Arg Asn Lys Glu He Gin Gin Glu Glu 
645 650 655 



1968 



ctg ttt ttt gat gee atg etc aca gec ctg gac acg ttc caa ccc cct 

245 



2016 



Leu Phe Phe Asp Ala Met Leu Thr Ala Leu Asp Thr Phe Gin Pro Pro 
660 665 670 



ccg aaa acg gca ttt cca atg gag ate ttt ccg cac cgt aaa acc gaa 2064 
Pro Lys Thr Ala Phe Pro Met Glu lie Phe Pro His Arg Lys Thr Glu 
675 680 685 

gtt atg etc gac cac ctg gcg tec ata acc acc agg tta acc gag gac 2112 
Val Met Leu Asp His Leu Ala Ser lie Thr Thr Arg Leu Thr Glu Asp 
690 695 700 

gec acc gaa gec etc aac aat tac ctg gag acc ccg ccc gac caa gga 2160 
Ala Thr Glu Ala Leu Asn Asn Tyr Leu Glu Thr Pro Pro Asp Gin Gly 
705 710 715 720 

acg cac att acc aac att cca aac ttt tea tec ate gtg gca aat ate 2208 
Thr His lie Thr Asn lie Pro Asn Phe Ser Ser lie Val Ala Asn lie 
725 730 735 

ata tec acg tta aaa ate eta acg tac gca gaa aac gac atg caa tta 2256 
lie Ser Thr Leu Lys lie Leu Thr Tyr Ala Glu Asn Asp Met Gin Leu 
740 745 750 

aac gta acg ccc atg gca acg tac agg cgt cag ctg ttg tac etc gga 2304 
Asn Val Thr Pro Met Ala Thr Tyr Arg Arg Gin Leu Leu Tyr Leu Gly 
755 760 765 

ggc gag tta gca acc ate ttt aat tta gag tgg cca tac gaa acc gtg 2352 
Gly Glu Leu Ala Thr lie Phe Asn Leu Glu Trp Pro Tyr Glu Thr Val 
770 775 780 

cca ccg gtt caa gaa ctg ccc etc gtg gcg egg gcg aaa gca aaa atg 2400 
Pro Pro Val Gin Glu Leu Pro Leu Val Ala Arg Ala Lys Ala Lys Met 
785 790 795 800 

gaa teg gta aca aaa atg gaa aag aac caa cag get etc gac caa ata 2448 
Glu Ser Val Thr Lys Met Glu Lys Asn Gin Gin Ala Leu Asp Gin lie 
805 810 815 

ctg gga gac gec gaa acg tta ctt gac aca ata acc gca aca tec gga 2496 
Leu Gly Asp Ala Glu Thr Leu Leu Asp Thr lie Thr Ala Thr Ser Gly 
820 825 830 

gat gag aac ccg gtc cgc gee atg tec ata ccg ata ctg gag acc tac 2544 
Asp Glu Asn Pro Val Arg Ala Met Ser lie Pro lie Leu Glu Thr Tyr 
835 840 845 

att aca aac gca ggc gee ctg ata ggc agt tct cga aac cag egg ttc 2592 
lie Thr Asn Ala Gly Ala Leu lie Gly Ser Ser Arg Asn Gin Arg Phe 
850 855 860 

gaa aaa etc aag gec gec ate cac gac ctg gca tea teg gag teg ttc 2640 
Glu Lys Leu Lys Ala Ala lie His -Asp Leu Ala Ser Ser Glu Ser Phe 
865 870 875 880 

ata ata atg ctg eta aac aac acg egg etc gat aac ata tea gac aat 2688 
lie lie Met Leu Leu Asn Asn Thr Arg Leu Asp Asn lie Ser Asp Asn 
885 890 895 

ct g gec aag ate gac ggc ate ctg acc aac aac aca cgt ttt ctt tea 2736 
Leu Ala Lys He Asp Gly He Leu Thr Asn Asn Thr Arg Phe Leu Ser 
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900 905 910 

aac gcc act gtt age aaa acg etc caa acg ctg gga ggc age eta ata 2784 

Asn Ala Thr Val Ser Lys Thr Leu Gin Thr Leu Gly Gly Ser Leu He 

915 920 925 

cgc gaa tgc gta gaa gcg eta aat aaa agg age ccc tct tec etc aac 2832 

Arg Glu Cys Val Glu Ala Leu Asn Lys Arg Ser Pro Ser Ser Leu Asn 

930 935 940 

aac gcg cgt etc etc gcg gtt caa acc ata ctg ggg cac gcg tec gtt 

Asn Ala Arg Leu Leu Ala Val Gin Thr He Leu Gly His Ala Ser Val 

945 950 955 960 

cca gat cac gag acg ctg acg cga ate gtt tec ggc gtc gcc age gca 

Pro Asp His Glu Thr Leu Thr Arg He Val Ser Gly Val Ala Ser Ala 

965 970 975 

caa aag gaa tec get ggc gat gat cca gat agg tgg acg cga gta acc 2 976 

Gin Lys Glu Ser Ala Gly Asp Asp Pro Asp Arg Trp Thr Arg Val Thr 

980 985 990 



ggt cac eta aac gag ctg aag etc gta act acc caa teg cgt gtc gac 
Gly His Leu Asn Glu Leu Lys Leu Val Thr Thr Gin Ser Arg Val Asp 
995 1000 1005 



2880 



2928 



3024 



aaa gcc acc agg cgc aaa ctg tta atg ata ata acc cgt gac etc aag 3072 

Lys Ala Thr Arg Arg Lys Leu Leu Met He He Thr Arg Asp Leu Lys 
1010 1015 1020 

gag gcg gag gtg tct cag gaa acg gtc ctg gaa aca egg tgg caa gaa 312 0 

Glu Ala Glu Val Ser Gin Glu Thr Val Leu Glu Thr Arg Trp Gin Glu 
1025 1030 1035 1040 

aac gtg eta aag ttt caa ccg teg acg tec aaa gaa ate gaa gac ttt 3168 

Asn Val Leu Lys Phe Gin Pro Ser Thr Ser Lys Glu He Glu Asp Phe 
1045 1050 1055 

tta cag teg gca ccg tea gca aag gcc cga aaa ttc gca gaa aaa cac 3216 

Leu Gin Ser Ala Pro Ser Ala Lys Ala Arg Lys Phe Ala Glu Lys His 
1060 1065 1070 

eta egg acg ctg ate acc caa ttc aac ggc cac gag cga ccg ccg tec 3264 

Leu Arg Thr Leu He Thr Gin Phe Asn Gly His Glu Arg Pro Pro Ser 

1075 1080 1085 

gag gcc acc gcc gtt ccc atg gac tac acg ccg acg ccc ata ccc acg 3312 

Glu Ala Thr Ala Val Pro Met Asp Tyr Thr Pro Thr Pro He Pro Thr 
1090 1095 1100 

cca cag gcc gtt tct acg get acc gcg gaa aag gga aag gcc gca tgg 3360 
Pro Gin Ala Val Ser Thr Ala Thr Ala Glu Lys Gly Lys Ala Ala Trp 
1105 1110 1H5 H20 

aat aaa att caa cag gcc ttt cag gat ttc aac ttt cac etc ate gac 

Asn Lys He Gin Gin Ala Phe Gin Asp Phe Asn Phe His Leu He Asp 
1125 1130 H35 

get teg gat tgg caa gag atg gca tea gaa tac tec aga cac ggc teg 3456 
Ala Ser Asp Trp Gin Glu Met Ala Ser Glu Tyr Ser Arg His Gly Ser 
1140 1145 1150 



3408 
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tec ctt cct ggt acg gtt gga cca aag ctg gtg cgc ttc atg gag age 3504 
Ser Leu Pro Gly Thr Val Gly Pro Lys Leu Val Arg Phe Met Glu Ser 
1155 1160 1165 

ate tea aac ace ctg gac gac ate etc acg cag aag ctg gca tct ctg 3552 
lie Ser Asn Thr Leu Asp Asp He Leu Thr Gin Lys Leu Ala Ser Leu 
1170 1175 H80 

ctt cca aac ggg ccc gcg ttc aga ccc cca gcg ttt gac tgg ate gcg 3600 
Leu Pro Asn Gly Pro Ala Phe Arg Pro Pro Ala Phe Asp Trp He Ala 
1185 H90 H95 1200 

ccc tat caa aca cgc gta aac gcg ttt eta aaa acc ata ggc ctg ccc 3648 
Pro Tyr Gin Thr Arg Val Asn Ala Phe Leu Lys Thr He Gly Leu Pro 
1205 1210 1215 

atg gtg cgc aac ctg gcg gac aag ate cat cac caa tgc caa act gtc 3696 
Met Val Arg Asn Leu Ala Asp Lys He His His Gin Cys Gin Thr Val 
1220 1225 1230 

agt cac gcg gtg caa tec gca gac ctt caa cag gec acg gtg gga aca 3744 
Ser His Ala Val Gin Ser Ala Asp Leu Gin Gin Ala Thr Val Gly Thr 
1235 1240 1245 



agt tta gaa cga ccc gcg gec gaa tac tgt cga ata etc tct gac atg 

Ser Leu Glu Arg Pro Ala Ala Glu Tyr Cys Arg He Leu Ser Asp Met 

1250 1255 1260 

caa gtc gcg ttc aac gac cac gga ate gec gta aga teg gag gec gcg 

Gin Val Ala Phe Asn Asp His Gly He Ala Val Arg Ser Glu Ala Ala 

12 65 1270 1275 1280 

gcg tac acg gac gca ate aac teg ccg gec aac gtc gtg act ccc ccg 

Ala Tyr Thr Asp Ala He Asn Ser Pro Ala Asn Val Val Thr Pro Pro 

1285 1290 1295 

aaa ccc aac eta gaa gec ccc aag aag eta ata acg gca act gac gec 

Lys Pro Asn Leu Glu Ala Pro Lys Lys Leu He Thr Ala Thr Asp Al« 

1300 1305 1310 



a 



3792 



3840 



3888 



3936 



3984 



eta acc gtc gag gac ttt cca gat ttc eta aaa acg tea ate ctt caa 
Leu Thr Val Glu Asp Phe Pro Asp Phe Leu Lys Thr Ser He Leu Gin 
1315 1320 1325 

cag gag cag cga etc att gcg etc cag aga gcg gaa ttt cag caa eta 4032 
Gin Glu Gin Arg Leu He Ala Leu Gin Arg Ala Glu Phe Gin Gin Leu 
1330 1335 1340 

gag gee age ate teg gcg gec gaa egg etc cgc caa tec acc cgt gac 4080 
Glu Ala Ser He Ser Ala Ala Glu Arg Leu Arg Gin Ser Thr Arg Asp 
1345 1350 1355 1360 

gag ate gca ggc aag atg gca acc get ate acg caa etc tta ccc cgc 4128 
Glu He Ala Gly Lys Met Ala Thr Ala He Thr Gin Leu Leu Pro Arg 
1365 1370 1375 



gee 



ccc gtc gca ata tea teg aga ccg ttg aac tta tea aaa cct ata 4176 
Ala Pro Val Ala He Ser Ser Arg Pro Leu Asn Leu Ser Lys Pro He 
1380 1385 1390 
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gac ttt ttg agt tea acg gta tac gac aaa ate ctg gac aag gag cct 
Asp Phe Leu Ser Ser Thr Val Tyr Asp Lys He Leu Asp Lys Glu Pro 
1395 1400 1405 



4224 



tac gag aca gee ata gcg gga ttc gcg tgg ctg gaa ate gcg aca aaa 
Tyr Glu Thr Ala He Ala Gly Phe Ala Trp Leu Glu He Ala Thr Lys 
1410 1415 1420 



4272 



tec gta atg gtc tac agt caa caa aac gaa acg caa cag tta aac gta 
Ser Val Met Val Tyr Ser Gin Gin Asn Glu Thr Gin Gin Leu Asn Val 
1425 1430 1435 1440 



4320 



ctg ctg age gag gta gaa aaa cag age acc gtc gcg cag cgt eta cac 

Leu Leu Ser Glu Val Glu Lys Gin Ser Thr Val Ala Gin Arg Leu His 
1445 1450 1455 

gat ttg gaa ctg teg gcg aaa aac acg gac gac gta aag gtg ctg aag 

Asp Leu Glu Leu Ser Ala Lys Asn Thr Asp Asp Val Lys Val Leu Lys 
1460 1465 1470 

cag gcg eta gac gaa etc gcg ccc etc agg gta aag ggc gga aaa acc 

Gin Ala Leu Asp Glu Leu Ala Pro Leu Arg Val Lys Gly Gly Lys Thr 
1475 1480 1485 



4368 



4416 



4464 



acc gta gac gcg tgg aaa caa aaa ctg gaa age ata gaa tec ctg ctt 
Thr Val Asp Ala Trp Lys Gin Lys Leu Glu Ser He Glu Ser Leu Leu 
1490 1495 1500 

cgc gee acg agg acg gca ggc gaa ata tea teg gag ctt gaa cgc ate 
Arg Ala Thr Arg Thr Ala Gly Glu He Ser Ser Glu Leu Glu Arg He 
1505 1510 1515 1520 

ggc aca cag gcg gtt ggc acc ate acc gtc cgc gat tta gga acg etc 
Gly Thr Gin Ala Val Gly Thr He Thr Val Arg Asp Leu Gly Thr Leu 
1525 1530 1535 



4512 



4560 



4608 



tec gat caa tgc egg gaa gee gca aat ttc etc aga cag gec agt eta 
Ser Asp Gin Cys Arg Glu Ala Ala Asn Phe Leu Arg Gin Ala Ser Leu 
1540 1545 1550 



4656 



ccc gaa ggc ttc teg gac ata ggc aca aaa etc age gag ctt cag gcg 
Pro Glu Gly Phe Ser Asp He Gly Thr Lys Leu Ser Glu Leu Gin Ala 
1555 1560 1565 



4704 



tac att aag tac aaa aaa cag ttt ctg gag cat ttt gaa aca acc cag 
Tyr He Lys Tyr Lys Lys Gin Phe Leu Glu His Phe Glu Thr Thr Gin 
1570 1575 1580 



4752 



cct aac gtc ttt caa cgc ttc ccg eta tec caa aac ata acc gaa aac 
Pro Asn Val Phe Gin Arg Phe Pro Leu Ser Gin Asn He Thr Glu Asn 
1585 1590 1595 1600 



4800 



gtt ccg gcg cgc ccg gcg atg gac teg gtg gee aga ctg acc aat cac 
Val Pro Ala Arg Pro Ala Met Asp Ser Val Ala Arg Leu Thr Asn His 
1605 1610 1615 



4848 



ctt cac gtg cgc ggc age gcg ccc cac ttt aca acg tgg ata gaa acg 
Leu His Val Arg Gly Ser Ala Pro His Phe Thr Thr Trp' He Glu Thr 
1620 1625 1630 



4896 



eta ccg acc gtc gat ccg gaa aaa cca act cac gtc ccg gcg cac gga 
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Leu Pro Thr Val Asp Pro Glu Lys Pro Thr His Val Pro Ala His Gly 

1635 1640 1645 

gga gcc cct ctg cac cgc cag ate acg tac tea aac gtc eta gag gcg 4992 

Gly Ala Pro Leu His Arg Gin lie Thr Tyr Ser Asn Val Leu Glu Ala 

1650 1655 1660 

ttg ttt tea tta tgt tec ace acg eta acc ccg gtt ccg acg gcc ccc 5040 
Leu Phe Ser Leu Cys Ser Thr Thr Leu Thr Pro Val Pro Thr Ala Pro 

1665 1670 1675 1680 

ggt ctg gaa ate gca acc agg gca cgc cgc ggg gca gag gcc gca acg 5 088 

Gly Leu Glu lie Ala Thr Arg Ala Arg Arg Gly Ala Glu Ala Ala Thr 

1685 1690 1695 

tgg atg gac aga cag tgg ccc gac ate get cag acg etc caa gac gtt 5136 

Trp Met Asp Arg Gin Trp Pro Asp lie Ala Gin Thr Leu Gin Asp Val 

1700 1705 1710 

etc gac acg tac gaa cac acc acc gcc cac gca aac egg gac gcg gca 5184 

Leu Asp Thr Tyr Glu His Thr Thr Ala His Ala Asn Arg Asp Ala Ala 

1715 1720 1725 

ttt aac aca ttc ttg gcg atg tgc gtt ttt acg caa ate ate agg ggc 5232 

Phe Asn Thr Phe Leu Ala Met Cys Val Phe Thr Gin lie lie Arg Gly 

1730 1735 1740 

get age aga gcc gtg acg etc ccg aag tta ccg age acc gcc gtc gat 5280 

Ala Ser Arg Ala Val Thr Leu Pro Lys Leu Pro Ser Thr Ala Val Asp 

1745 1750 1755 1760 

ttt cca gaa gag ate gtt eta aca ccc agg gaa tgc aca aca ctg gtc 5328 

Phe Pro Glu Glu lie Val Leu Thr Pro Arg Glu Cys Thr Thr Leu Val 

1765 1770 1775 

acc gcc atg tgg ccc acc ctg gcg gcc gca ate tta cga tta aaa tec 5376 

Thr Ala Met Trp Pro Thr Leu Ala Ala Ala lie Leu Arg Leu Lys Ser 

1780 1785 1790 

tac teg gaa gcc eta gga eta atg agt cgt ttc etc ccg eta atg ttc 5424 

Tyr Ser Glu Ala Leu Gly Leu Met Ser Arg Phe Leu Pro Leu Met Phe 

1795 1800 1805 

cag gcg ctg ccg cac eta acg eta gag gcc cag gtc aaa aac ggc cca 5472 

Gin Ala Leu Pro His Leu Thr Leu Glu Ala Gin Val Lys Asn Gly Pro 

1810 1815 1820 

cat aac acc ccg cct cag ttg aga tgc ttt gcc aaa aca gag gca att 5520 

His Asn Thr Pro Pro Gin Leu Arg Cys Phe Ala Lys Thr Glu Ala lie 

1825 1830 1835 1840 

ccg tat ttc ccg gcg caa tgg cag tea gcg aac eta gag cag age ctg 5568 

Pro Tyr Phe Pro Ala Gin Trp Gin Ser Ala Asn Leu Glu Gin Ser Leu 

1845 1850 1855 

tgg gga cag acg gac ttt ttg caa ate tgc gat aac aat caa cgc aag 5616 

Trp Gly Gin Thr Asp Phe Leu Gin lie Cys Asp Asn Asn Gin Arg Lys 

1860 1865 1870 

gcc agg gtg gcg gcc gtc acc tgg gcg etc acg acg ata gac ggc gtg 5664 

Ala Arg Val Ala Ala Val Thr Trp Ala Leu Thr Thr lie Asp Gly Val 
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1875 1880 1885 

gtt ttg gac caa ctg tgg tec aca ttt aaa ccc atg aca gec gcg tea 5712 

Val Leu Asp Gin Leu Trp Ser Thr Phe Lys Pro Met Thr Ala Ala Ser 

1890 1895 1900 

gac gac acg tac gtc gac eta gtc gag acc eta cac ctg ace ace ttt 5760 

Asp Asp Thr Tyr Val Asp Leu Val Glu Thr Leu His Leu Thr Thr Phe 

1905 1910 1915 1920 

ggc ccg cgc ggt cca acg ccg agg cga gaa acg acc acc gag cac ccg 5808 

Gly Pro Arg Gly Pro Thr Pro Arg Arg Glu Thr Thr Thr Glu His Pro 
1925 1930 1935 

ccg tac gag tac gga cag ccc acg ggc tac tgc ate teg ggt caa teg 5856 

Pro Tyr Glu Tyr Gly Gin Pro Thr Gly Tyr Cys lie Ser Gly Gin Ser 
1940 1945 1950 

acg acg ccg gtc cag get tea aac aca ccg gta tec get ttc gag gcg 5904 

Thr Thr Pro Val Gin Ala Ser Asn Thr Pro Val Ser Ala Phe Glu Ala 
1955 1960 1965 

gtg etc gga gca atg gtg ttt cac gta ccg ate aga ata ttt ttg gcg 5952 

Val Leu Gly Ala Met Val Phe His Val Pro lie Arg lie Phe Leu Ala 

1970 1975 1980 

gee acg ccc aag cgc ctt ggc cag gcg cgc ggc ggc atg ggg etc etc 6000 

Ala Thr Pro Lys Arg Leu Gly Gin Ala Arg Gly Gly Met Gly Leu Leu 

1985 1990 1995 2000 

aca ccc ate ctg gaa tgc gtc ccc gac gtc gag ccc ttc aaa age ctg 6048 

Thr Pro lie Leu Glu Cys Val Pro Asp Val Glu Pro Phe Lys Ser Leu 
2005 2010 2015 

tat aac gca ccc cgc aaa ccc gtg ccc att gaa acg eta ccc gca tec 6096 

Tyr Asn Ala Pro Arg Lys Pro Val Pro lie Glu Thr Leu Pro Ala Ser 
2020 2025 2030 

etc cac ccg cac gac gag cga cag gtc ttt ctg aga cag gca cag tgg 6144 

Leu His Pro His Asp Glu Arg Gin Val Phe Leu Arg Gin Ala Gin Trp 
2035 2040 2045 

eta tec tac cga ttc aca cca cac gaa gec gee egg teg teg act ccg 6192 

Leu Ser Tyr Arg Phe Thr Pro His Glu Ala Ala Arg Ser Ser Thr Pro 

2050 2055 2060 

ccg ctt ctg gtg gtc ata gac cct gaa aac etc gta acg gca acg tac 6240 

Pro Leu Leu Val Val lie Asp Pro Glu Asn Leu Val Thr Ala Thr Tyr 

2065 2070 2075 2080 

tec agt ggc ggg cct gca aat ttc gag age agg ccg ttt tac gtg atg 6288 

Ser Ser Gly Gly Pro Ala Asn Phe Glu Ser Arg Pro Phe Tyr Val Met 
2085 2090 2095 

ccc gga cca tac ccc cca gac tgg cca aaa acg ctg teg gta aca tea 6336 

Pro Gly Pro Tyr Pro Pro Asp Trp Pro Lys Thr Leu Ser Val Thr Ser 
2100 2105 2110 

aac acg tec gtg acg cac etc age cac gac gag ata tgt aac etc ttt 6384 

Asn Thr Ser Val Thr His Leu Ser His Asp Glu lie Cys Asn Leu Phe 
2115 2120 2125 
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act acg eta tec cga gaa cac ggg acc gtg caa ggc agg gat ate ttc 6432 

Thr Thr Leu Ser Arg Glu His Gly Thr Val Gin Gly Arg Asp lie Phe 

2130 2135 2140 

gca gcg get ccg aca aac gtc aca ccg gaa caa acc gee aat cct ccg 6480 

Ala Ala Ala Pro Thr Asn Val Thr Pro Glu Gin Thr Ala Asn Pro Pro 
2145 2150 2155 2160 

gca tgg gaa acg gat aac cga tta ata acg caa aca gaa acc gee aaa 6528 

Ala Trp Glu Thr Asp Asn Arg Leu He Thr Gin Thr Glu Thr Ala Lys 
2165 2170 2175 

aaa cct cat ata att cct gcg tct cct aaa gcg egg aca gat cca ccg 6576 

Lys Pro His He He Pro Ala Ser Pro Lys Ala Arg Thr Asp Pro Pro 
2180 2185 2190 

gtg gaa acc acg acc cac cat tea caa ggg caa gcg teg caa cac gca 6624 

Val Glu Thr Thr Thr His His Ser Gin Gly Gin Ala Ser Gin His Ala 

2195 2200 2205 

aac age aac gta aac cag ccc ggt caa att act tea cac gcg tea cgt 6672 

Asn Ser Asn Val Asn Gin Pro Gly Gin He Thr Ser His Ala Ser Arg 

2210 2215 2220 

aac aca ccg tea acc gca cct cag gec tea tct tea ccg gaa aaa ttc 6720 

Asn Thr Pro Ser Thr Ala Pro Gin Ala Ser Ser Ser Pro Glu Lys Phe 
2225 2230 2235 2240 



aac acg caa acg gtg cct cga eta ata tct caa acg teg gaa acg gec 
Asn Thr Gin Thr Val Pro Arg Leu He Ser Gin Thr Ser Glu Thr Ala 
2245 2250 2255 



ttt ggg acg tat aaa ccc cga gtg etc acc gaa ccc gec aaa ccc gca 
Phe Gly Thr Tyr Lys Pro Arg Val Leu Thr Glu Pro Ala Lys Pro Ala 
2275 2280 2285 



6768 



cat ata aac cag cca gec tec ggc cag gtc acc gaa cca aag gga ate 6816 
His He Asn Gin Pro Ala Ser Gly Gin Val Thr Glu Pro Lys Gly He 
2260 2265 2270 



6864 



aac gec ggc gta gec tct cgc caa cca gag gca acc acc acg gtc ccc 6912 

Asn Ala Gly Val Ala Ser Arg Gin Pro Glu Ala Thr Thr Thr Val Pro 
2290 2295 2300 

aag tta ccg att aat cca ccc acc get agg gtc ttt ata ggg acc gcg 6960 

Lys Leu Pro He Asn Pro Pro Thr Ala Arg Val Phe He Gly Thr Ala 
2305 2310 2315 2320 

tec aaa etc teg cca gec gtc gaa gag age cac ggc gec aca ccc gac 7008 

Ser Lys Leu Ser Pro Ala Val Glu Glu Ser His Gly Ala Thr Pro Asp 
2325 2330 2335 

gca cat cag teg aag ata gat egg gaa aaa tac gec gag agt egg cct 7056 

Ala His Gin Ser Lys He Asp Arg Glu Lys Tyr Ala Glu Ser Arg Pro 
2340 2345 2350 

cgc cgc acc cca cac etc gaa gag ggg cca egg gag cct cac gtc aac 7104 

Arg Arg Thr Pro His Leu Glu Glu Gly Pro Arg Glu Pro His Val Asn 
2355 2360 2365 
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act cca acc age gca cac ata aac gtc ccc tct age caa ggt caa aaa 7152 

Thr Pro Thr Ser Ala His lie Asn Val Pro Ser Ser Gin Gly Gin Lys 
2370 2375 2380 

aca gta cac ggg cgc gaa aat ccc ggc ctt caa aca gca act ccc age 7200 

Thr Val His Gly Arg Glu Asn Pro Gly Leu Gin Thr Ala Thr Pro Ser 

2385 2390 2395 2400 

gec ccc caa cca acc gca tea aac ccg cgc att caa tac acg etc ccc 7248 

Ala Pro Gin Pro Thr Ala Ser Asn Pro Arg lie Gin Tyr Thr Leu Pro 
2405 2410 2415 

aga acg gac ggc egg ttg ctt cac gac gaa teg gag gtg gaa teg acc 72 96 

Arg Thr Asp Gly Arg Leu Leu His Asp Glu Ser Glu Val Glu Ser Thr 
2420 2425 2430 

cca acc gag gag gta aaa cga teg cca aaa aca caa gat gtg tct cac 7344 

Pro Thr Glu Glu Val Lys Arg Ser Pro Lys Thr Gin Asp Val Ser His 
2435 2440 2445 



ggg ccc gaa ccg gac gac tec agg tgg acc gec ccg etc ggt cca acc 
Gly Pro Glu Pro Asp Asp Ser Arg Trp Thr Ala Pro Leu Gly Pro Thr 
2450 2455 2460 



7392 



ata gag att cat cga ctg gaa cac ccc caa att etc aaa aat ata aca 7440 

He Glu He His Arg Leu Glu His Pro Gin He Leu Lys Asn He Thr 

2465 2470 2475 2480 

tea etc acc gtc ccc act ccc aga gtc acc cca ate cct ccc act aac 7488 

Ser Leu Thr Val Pro Thr Pro Arg Val Thr Pro He Pro Pro Thr Asn 

2485 2490 2495 

ate tgg ata ccc eta tec cac gtc aac ate caa cac gaa gaa ate aca 7536 

He Trp He Pro Leu Ser His Val Asn He Gin His Glu Glu He Thr 

2500 2505 2510 

cga gec aag aat gtg tta atg cga ttt att caa aac gta cga aga aaa 7584 

Arg Ala Lys Asn Val Leu Met Arg Phe He Gin Asn Val Arg Arg Lys 

2515 2520 2525 

ctt caa gcg teg tct gac get eta tec gag get att gee aga ata aag 7632 

Leu Gin Ala Ser Ser Asp Ala Leu Ser Glu Ala He Ala Arg He Lys 

2530 2535 2540 

ttt tta tat ctg taa 7647 

Phe Leu Tyr Leu 

2545 



<210> 143 
<211> 2548 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 143 

Net Glu Leu Pro Pro He Phe Ser Lys Phe Lys He Glu Gly Val Ala 

15 10 15 

Thr Thr His Gin Ala Asp Cys Arg Phe Gly Gin Tyr Ala Gly Ser Gin 

20 25 30 

Cys Leu Ser Asn Cys Val He Tyr Leu Ala Gin Ser Tyr Phe Asn Arg 
35 40 45 
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175 




Val 


He 


Ser 


Ser 


Ser 


Asp 


Pro 


Thr 


Ala 


He 


Leu 


Glu 


Tyr 


Val 


Ser 


Pro 
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200 
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He 


Thr 


Asn 


His 
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He 
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He 
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He 


Ser 


Thr 


Gly 


He 
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He 
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Pro 
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Val 
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Ala 


Met 


Asp 


Lys 


He 


Met 


He 


Asn 


He 


He 










485 










490 










495 




Leu 


Glu 


His 


Gly 


Leu 


He 


Thr 


Asp 


Ala 


Gin 


Ala 


Arg 


Gly 


Pro 


Ser 


Ala 








500 










505 










510 






Cys 


Lys 


Asn 


Val 


Leu 


Gin 


Phe 


Phe 


He 


Leu 


Trp 


Gly 


Glu 


Lys 


Leu 


Asn 






515 










520 










525 








He 


Pro 


He 


Ser 


Asp 


Ala 


Lys 


Gin 


Val 


Leu 


Glu 


Leu 


Asp 


Leu 


Gin 


Leu 



530 535 540 

lie Pro Leu His Thr Ala He Ser Glu Gly Lys Phe Lys Gin Gly Ala 
545 550 555 560 

Phe Lys Lys His Leu Thr Thr Lys He Asn Arg Cys Leu Ala Ser Met 

565 570 575 

Arg Ala Thr His Ala Asp Ala Gin Lys Lys Leu Ala Ser Ala Phe Asn 

580 585 590 

Val Glu Gly Ser Gin lie Ser Ser Ser Glu Ala Lys He Ser Val Arg 

595 600 605 

Ala Leu Lys Glu Gin He Ala Asn His Leu Ser Pro Gly Phe Leu Ala 

610 615 620 

Val Tyr Ser Ala Asp Glu Val Lys His Leu Arg Asp Lys He Gin Asp 
625 630 635 640 

Leu Lys Thr Gly He Glu Gin Arg Asn Lys Glu He Gin Gin Glu Glu 

645 650 655 

Leu Phe Phe Asp Ala Met Leu Thr Ala Leu Asp Thr Phe Gin Pro Pro 

660 665 670 

Pro Lys Thr Ala Phe Pro Met Glu He Phe Pro His Arg Lys Thr Glu 

675 680 685 

Val Met Leu Asp His Leu Ala Ser He Thr Thr Arg Leu Thr Glu Asp 

690 695 700 

Ala Thr Glu Ala Leu Asn Asn Tyr Leu Glu Thr Pro Pro Asp Gin Gly 
705 710 715 720 

Thr His He Thr Asn He Pro Asn Phe Ser Ser He Val Ala Asn He 

725 730 735 

He Ser Thr Leu Lys He Leu Thr Tyr Ala Glu Asn Asp Met Gin Leu 

740 745 750 

Asn Val Thr Pro Met Ala Thr Tyr Arg Arg Gin Leu Leu Tyr Leu Gly 

755 760 765 

Gly Glu Leu Ala Thr He Phe Asn Leu Glu Trp Pro Tyr Glu Thr Val 

770 775 780 

Pro Pro Val Gin Glu Leu Pro Leu Val Ala Arg Ala Lys Ala Lys Met 
785 790 795 800 

Glu Ser Val Thr Lys Met Glu Lys Asn Gin Gin Ala Leu Asp Gin He 

805 810 815 

Leu Gly Asp Ala Glu Thr Leu Leu Asp Thr He Thr Ala Thr Ser Gly 

820 825 830 

Asp Glu Asn Pro Val Arg Ala Met Ser He Pro He Leu Glu Thr Tyr 

835 840 845 

He Thr Asn Ala Gly Ala Leu lie Gly Ser Ser Arg Asn Gin Arg Phe 

850 855 860 

Glu Lys Leu Lys Ala Ala lie His Asp Leu Ala Ser Ser Glu Ser Phe 
865 870 875 880 

He lie Met Leu Leu Asn Asn Thr Arg Leu Asp Asn lie Ser Asp Asn 

885 890 895 

Leu Ala Lys He Asp Gly He Leu Thr Asn Asn Thr Arg Phe Leu Ser 

900 905 910 

Asn Ala Thr Val Ser Lys Thr Leu Gin Thr Leu Gly Gly Ser Leu lie 

915 920 925 

Arg Glu Cys Val Glu Ala Leu Asn Lys Arg Ser Pro Ser Ser Leu Asn 

930 935 940 

Asn Ala Arg Leu Leu Ala Val Gin Thr He Leu Gly His Ala Ser Val 
945 950 955 960 

Pro Asp His Glu Thr Leu Thr Arg lie Val Ser Gly Val Ala Ser Ala 

965 970 975 

Gin Lys Glu Ser Ala Gly Asp Asp Pro Asp Arg Trp Thr Arg Val Thr 

980 985 990 

Gly His Leu Asn Glu Leu Lys Leu Val Thr Thr Gin Ser Arg Val Asp 

995 1000 1005 

Lys Ala Thr Arg Arg Lys Leu Leu Met He He Thr Arg Asp Leu Lys 
1010 1015 1020 



Glu Ala Glu Val Ser Gin Glu Thr Val Leu Glu Thr Arg Trp Gin Glu 
1025 1030 1035 1040 

Asn Val Leu Lys Phe Gin Pro Ser Thr Ser Lys Glu lie Glu Asp Phe 

1045 1050 1055 

Leu Gin Ser Ala Pro Ser Ala Lys Ala Arg Lys Phe Ala Glu Lys His 

1060 1065 1070 

Leu Arg Thr Leu lie Thr Gin Phe Asn Gly His Glu Arg Pro Pro Ser 

1075 1080 1085 

Glu Ala Thr Ala Val Pro Met Asp Tyr Thr Pro Thr Pro lie Pro Thr 

1090 1095 1100 

Pro Gin Ala Val Ser Thr Ala Thr Ala Glu Lys Gly Lys Ala Ala Trp 
1105 1110 1115 1120 

Asn Lys lie Gin Gin Ala Phe Gin Asp Phe Asn Phe His Leu lie Asp 

1125 1130 1135 

Ala Ser Asp Trp Gin Glu Met Ala Ser Glu Tyr Ser Arg His Gly Ser 

1140 1145 1150 

Ser Leu Pro Gly Thr Val Gly Pro Lys Leu Val Arg Phe Met Glu Ser 

1155 1160 1165 

lie Ser Asn Thr Leu Asp Asp lie Leu Thr Gin Lys Leu Ala Ser Leu 

1170 1175 1180 

Leu Pro Asn Gly Pro Ala Phe Arg Pro Pro Ala Phe Asp Trp lie Ala 
1185 1190 1195 1200 

Pro Tyr Gin Thr Arg Val Asn Ala Phe Leu Lys Thr lie Gly Leu Pro 

1205 1210 1215 

Met Val Arg Asn Leu Ala Asp Lys lie His His Gin Cys Gin Thr Val 

1220 1225 1230 

Ser His Ala Val Gin Ser Ala Asp Leu Gin Gin Ala Thr Val Gly Thr 

1235 1240 1245 

Ser Leu Glu Arg Pro Ala Ala Glu Tyr Cys Arg lie Leu Ser Asp Met 

1250 1255 1260 

Gin Val Ala Phe Asn Asp His Gly lie Ala Val Arg Ser Glu Ala Ala 
1265 1270 1275 1280 

Ala Tyr Thr Asp Ala lie Asn Ser Pro Ala Asn Val Val Thr Pro Pro 

1285 1290 1295 

Lys Pro Asn Leu Glu Ala Pro Lys Lys Leu lie Thr Ala Thr Asp Ala 

1300 1305 1310 

Leu Thr Val Glu Asp Phe Pro Asp Phe Leu Lys Thr Ser lie Leu Gin 

1315 1320 1325 

Gin Glu Gin Arg Leu lie Ala Leu Gin Arg Ala Glu Phe Gin Gin Leu 

1330 1335 1340 

Glu Ala Ser He Ser Ala Ala Glu Arg Leu Arg Gin Ser Thr Arg Asp 
1345 1350 1355 1360 

Glu He Ala Gly Lys Met Ala Thr Ala He Thr Gin Leu Leu Pro Arg 

1365 1370 1375 

Ala Pro Val Ala He Ser Ser Arg Pro Leu Asn Leu Ser Lys Pro He 

1380 1385 1390 

Asp Phe Leu Ser Ser Thr Val Tyr Asp Lys lie Leu Asp Lys Glu Pro 

1395 1400 1405 

Tyr Glu Thr Ala He Ala Gly Phe Ala Trp Leu Glu He Ala Thr Lys 

1410 1415 1420 

Ser Val Met Val Tyr Ser Gin Gin Asn Glu Thr Gin Gin Leu Asn Val 
1425 1430 1435 1440 

Leu Leu Ser Glu Val Glu Lys Gin Ser Thr Val Ala Gin Arg Leu His 

1445 1450 1455 

Asp Leu Glu Leu Ser Ala Lys Asn Thr Asp Asp Val Lys Val Leu Lys 

1460 1465 1470 

Gin Ala Leu Asp Glu Leu Ala Pro Leu Arg Val Lys Gly Gly Lys Thr 

1475 . 1480 1485 

Thr Val Asp Ala Trp Lys Gin Lys Leu Glu Ser He Glu Ser Leu Leu 

1490 1495 1500 

Arg Ala Thr Arg Thr Ala Gly Glu He Ser Ser Glu Leu Glu Arg He 



1505 1510 1515 1520 

Gly Thr Gin Ala Val Gly Thr He Thr Val Arg Asp Leu Gly Thr Leu 

1525 1530 1535 

Ser Asp Gin Cys Arg Glu Ala Ala Asn Phe Leu Arg Gin Ala Ser Leu 

1540 1545 1550 

Pro Glu Gly Phe Ser Asp He Gly Thr Lys Leu Ser Glu Leu Gin Ala 

1555 1560 1565 

Tyr He Lys Tyr Lys Lys Gin Phe Leu Glu His Phe Glu Thr Thr Gin 

1570 1575 1580 

Pro Asn Val Phe Gin Arg Phe Pro Leu Ser Gin Asn He Thr Glu Asn 
1585 1590 1595 1600 

Val Pro Ala Arg Pro Ala Met Asp Ser Val Ala Arg Leu Thr Asn His 

1605 1610 1615 

Leu His Val Arg Gly Ser Ala Pro His Phe Thr Thr Trp He Glu Thr 

1620 1625 1630 

Leu Pro Thr Val Asp Pro Glu Lys Pro Thr His Val Pro Ala His Gly 

1635 1640 1645 

Gly Ala Pro Leu His Arg Gin He Thr Tyr Ser Asn Val Leu Glu Ala 

1650 1655 1660 

Leu Phe Ser Leu Cys Ser Thr Thr Leu Thr Pro Val Pro Thr Ala Pro 
1665 1670 1675 1680 

Gly Leu Glu lie Ala Thr Arg Ala Arg Arg Gly Ala Glu Ala Ala Thr 

1685 1690 1695 

Trp Met Asp Arg Gin Trp Pro Asp He Ala Gin Thr Leu Gin Asp Val 

1700 1705 1710 

Leu Asp Thr Tyr Glu His Thr Thr Ala His Ala Asn Arg Asp Ala Ala 

1715 1720 1725 

Phe Asn Thr Phe Leu Ala Met Cys Val Phe Thr Gin He He Arg Gly 

1730 1735 1740 

Ala Ser Arg Ala Val Thr Leu Pro Lys Leu Pro Ser Thr Ala Val Asp 
1745 1750 1755 1760 

Phe Pro Glu Glu He Val Leu Thr Pro Arg Glu Cys Thr Thr Leu Val 

1765 1770 1775 

Thr Ala Met Trp Pro Thr Leu Ala Ala Ala He Leu Arg Leu Lys Ser 

1780 1785 1790 

Tyr Ser Glu Ala Leu Gly Leu Met Ser Arg Phe Leu Pro Leu Met Phe 

1795 1800 1805 

Gin Ala Leu Pro His Leu Thr Leu Glu Ala Gin Val Lys Asn Gly Pro 

1810 1815 1820 

His Asn Thr Pro Pro Gin Leu Arg Cys Phe Ala Lys Thr Glu Ala He 
1825 1830 1835 1840 

Pro Tyr Phe Pro Ala Gin Trp Gin Ser Ala Asn Leu Glu Gin Ser Leu 

1845 1850 1855 

Trp Gly Gin Thr Asp Phe Leu Gin He Cys Asp Asn Asn Gin Arg Lys 

1860 1865 1870 

Ala Arg Val Ala Ala Val Thr Trp Ala Leu Thr Thr He Asp Gly Val 

1875 1880 1885 

Val Leu Asp Gin Leu Trp Ser Thr Phe Lys Pro Met Thr Ala Ala Ser 

1890 1895 1900 

Asp Asp Thr Tyr Val Asp Leu Val Glu Thr Leu His Leu Thr Thr Phe 
1905 1910 1915 1920 

Gly Pro Arg Gly Pro Thr Pro Arg Arg Glu Thr Thr Thr Glu His Pro 

1925 1930 1935 

Pro Tyr Glu Tyr Gly Gin Pro Thr Gly Tyr Cys He Ser Gly Gin Ser 

1940 1945 1950 

Thr Thr Pro Val Gin Ala Ser Asn Thr Pro Val Ser Ala Phe Glu Ala 

1955 1960 1965 

Val Leu Gly Ala Met Val Phe His Val Pro He Arg He Phe Leu Ala 

1970 1975 1980 

Ala Thr Pro Lys Arg Leu Gly Gin Ala Arg Gly Gly Met Gly Leu Leu 
1985 1990 1995 2000 



Thr Pro lie Leu Glu Cys Val Pro Asp Val Glu Pro Phe Lys Ser Leu 

2005 2010 2015 

Tyr Asn Ala Pro Arg Lys Pro Val Pro lie Glu Thr Leu Pro Ala Ser 

2020 2025 2030 

Leu His Pro His Asp Glu Arg Gin Val Phe Leu Arg Gin Ala Gin Trp 

2035 2040 2045 

Leu Ser Tyr Arg Phe Thr Pro His Glu Ala Ala Arg Ser Ser Thr Pro 

2050 2055 2060 

Pro Leu Leu Val Val He Asp Pro Glu Asn Leu Val Thr Ala Thr Tyr 
2065 2070 2075 2080 

Ser Ser Gly Gly Pro Ala Asn Phe Glu Ser Arg Pro Phe Tyr Val Met 

2085 2090 2095 

Pro Gly Pro Tyr Pro Pro Asp Trp Pro Lys Thr Leu Ser Val Thr Ser 

2100 2105 2110 

Asn Thr Ser Val Thr His Leu Ser His Asp Glu He Cys Asn Leu Phe 

2115 2120 2125 

Thr Thr Leu Ser Arg Glu His Gly Thr Val Gin Gly Arg Asp He Phe 

2130 2135 2140 

Ala Ala Ala Pro Thr Asn Val Thr Pro Glu Gin Thr Ala Asn Pro Pro 
2145 2150 2155 2160 

Ala Trp Glu Thr Asp Asn Arg Leu He Thr Gin Thr Glu Thr Ala Lys 

2165 2170 2175 

Lys Pro His He He Pro Ala Ser Pro Lys Ala Arg Thr Asp Pro Pro 

2180 2185 2190 

Val Glu Thr Thr Thr His His Ser Gin Gly Gin Ala Ser Gin His Ala 

2195 2200 2205 

Asn Ser Asn Val Asn Gin Pro Gly Gin He Thr Ser His Ala Ser Arg 

2210 2215 2220 

Asn Thr Pro Ser Thr Ala Pro Gin Ala Ser Ser Ser Pro Glu Lys Phe 
2225 2230 2235 2240 

Asn Thr Gin Thr Val Pro Arg Leu He Ser Gin Thr Ser Glu Thr Ala 

2245 2250 2255 

His He Asn Gin Pro Ala Ser Gly Gin Val Thr Glu Pro Lys Gly He 

2260 2265 2270 

Phe Gly Thr Tyr Lys Pro Arg Val Leu Thr Glu Pro Ala Lys Pro Ala 

2275 2280 2285 

Asn Ala Gly Val Ala Ser Arg Gin Pro Glu Ala Thr Thr Thr Val Pro 

2290 2295 2300 

Lys Leu Pro He Asn Pro Pro Thr Ala Arg Val Phe He Gly Thr Ala 
2305 2310 2315 2320 

Ser Lys Leu Ser Pro Ala Val Glu Glu Ser His Gly Ala Thr Pro Asp 

2325 2330 2335 

Ala His Gin Ser Lys He Asp Arg Glu Lys Tyr Ala Glu Ser Arg Pro 

2340 2345 2350 

Arg Arg Thr Pro His Leu Glu Glu Gly Pro Arg Glu Pro His Val Asn 

2355 2360 2365 

Thr Pro Thr Ser Ala His He Asn Val Pro Ser Ser Gin Gly Gin Lys 

2370 2375 2380 

Thr Val His Gly Arg Glu Asn Pro Gly Leu Gin Thr Ala Thr Pro Ser 
2385 2390 2395 2400 

Ala Pro Gin Pro Thr Ala Ser Asn Pro Arg He Gin Tyr Thr Leu Pro 

2405 2410 2415 

Arg Thr Asp Gly Arg Leu Leu His Asp Glu Ser Glu Val Glu Ser Thr 

2420 2425 2430 

Pro Thr Glu Glu Val Lys Arg Ser Pro Lys Thr Gin Asp Val Ser His 

2435 2440 2445 

Gly Pro Glu Pro Asp Asp Ser Arg Trp Thr Ala Pro Leu Gly Pro Thr 

2450 2455 2460 

He Glu He His Arg Leu Glu His Pro Gin He Leu Lys Asn He Thr 
2465 2470 2475 2480 

Ser Leu Thr Val Pro Thr Pro Arg Val Thr Pro He Pro Pro Thr Asn 
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2485 2490 2495 

He Trp He Pro Leu Ser His Val Asn He Gin His Glu Glu He Thr 

2500 2505 2510 

Arg Ala Lys Asn Val Leu Met Arg Phe He Gin Asn Val Arg Arg Lys 

2515 2520 2525 

Leu Gin Ala Ser Ser Asp Ala Leu Ser Glu Ala He Ala Arg He Lys 

2530 2535 2540 

Phe Leu Tyr Leu 
2545 



<210> 144 
<211> 510 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) . - (510) 

<400> 144 

atg teg tec ttg egg gtt aag gag cca ate gtt cag gga cgc ctt gag 

Met Ser Ser Leu Arg Val Lys Glu Pro He Val Gin Gly Arg Leu Glu 

1 5 10 15 

cac gat tac cca aat cac ccg ctg gtt gec gag atg aac aac ctt ccc 
His Asp Tyr Pro Asn His Pro Leu Val Ala Glu Met Asn Asn Leu Pro 
20 25 30 

cag ggt gac atg tct ccc gec cag tat gec ate gcg aaa cgt aat tat 144 
Gin Gly Asp Met Ser Pro Ala Gin Tyr Ala He Ala Lys Arg Asn Tyr 
35 40 45 

ctg gtg ttt tta acg gec aaa cat cat tac gac atg tat atg caa aag 192 
Leu Val Phe Leu Thr Ala Lys His His Tyr Asp Met Tyr Met Gin Lys 
50 55 60 



aag aat gga att ctg cgc aaa gac cac etc egg ggc ctt cgc. ggc aaa 
Lys Asn Gly He Leu Arg Lys Asp His Leu Arg Gly Leu Arg Gly Lys 

70 75 80 



65 



ggg 



tec act teg tta tea cct age gac gca act act ctg gac aca aga 
Gly Ser Thr Ser Leu Ser Pro Ser Asp Ala Thr Thr Leu Asp Thr Arg 
145 150 155 160 



48 



96 



240 



288 



aag gac get agt tct agt ate teg ggc gtt ttg tec ggg tec ggc teg 

Lys Asp Ala Ser Ser Ser He Ser Gly Val Leu Ser Gly Ser Gly Ser 
85 90 95 

gec gec ccg age gtt gec ccg gtg gec tec acc etc ggg tea aat age 336 

Ala Ala Pro Ser Val Ala Pro Val Ala Ser Thr Leu Gly Ser Asn Ser 
100 105 HO 

ttt act acg ate tec age ggg ccc cat tea ttg ata ggc teg atg ggc 384 

Phe Thr Thr He Ser Ser Gly Pro His Ser Leu lie Gly Ser Met Gly 
115 120 125 

ccc gcg ccc ggt ggc ggc gga ccc ggg age gtg gcg tct tct ggc ata 432 

Pro Ala Pro Gly Gly Gly Gly Pro Gly Ser Val Ala Ser Ser Gly He 

130 135 140 



480 
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cgc teg tct caa aat aaa aaa age aag tga 510 
Arg Ser Ser Gin Asn Lys Lys Ser Lys 

165 170 



<210> 145 
<211> 169 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 145 



Met 


Ser 


Ser 


Leu 


Arg 


Val 


Lys 


Glu 


Pro 


He 


Val 


Gin 


Gly 


Arg -Leu 


Glu 


1 








5 










10 










15 




His 


Asp 


Tyr 


Pro 


Asn 


His 


Pro 


Leu 


Val 


Ala 


Glu 


Met 


Asn 


Asn 


Leu 


Pro 




20 










25 










30 






Gin 


Gly 


Asp 


Met 


Ser 


Pro 


Ala 


Gin 


Tyr 


Ala 


He 


Ala 


Lys 


Arg 


Asn 


Tyr 




35 










40 










45 








Leu 


Val 
50 


Phe 


Leu 


Thr 


Ala 


Lys 
55 


His 


His 


Tyr 


Asp 


Met 
60 


Tyr 


Met 


Gin 


Lys 


Lys 


Asn 


Gly 


He 


Leu 


Arg 


Lys 


Asp 


His 


Leu 


Arg 


Gly 


Leu 


Arg 


Gly 


Lys 


65 










70 










75 










80 


Lys 


Asp 


Ala 


Ser 


Ser 


Ser 


He 


Ser 


Gly 


Val 


Leu 


Ser 


Gly 


Ser 


Gly 


Ser 






85 










90 










95 




Ala 


Ala 


Pro 


Ser 
100 


Val 


Ala 


Pro 


Val 


Ala 
105 


Ser 


Thr 


Leu 


Gly 


Ser 
110 


Asn 


Ser 


Phe 


Thr 


Thr 
115 


He 


Ser 


Ser 


Gly 


Pro 
120 


His 


Ser 


Leu 


He 


Gly 
125 


Ser 


Met 


Gly 


Pro 


Ala 
130 


Pro 


Gly 


Gly 


Gly 


Gly 
135 


Pro 


Gly 


Ser 


Val 


Ala 
140 


Ser 


Ser 


Gly 


He 


Gly 


Ser 


Thr 


Ser 


Leu 


Ser 


Pro 


Ser 


Asp 


Ala 


Thr 


Thr 


Leu 


Asp 


Thr 


Arg 


145 










150 










155 










160 


Arg 


Ser 


Ser 


Gin 


Asn 


Lys 


Lys 


Ser 


Lys 
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<210> 146 

<211> 1347 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) . - (1347) 

<400> 146 

atg gec teg ggc cgc ttg cct aac ctg get gaa gac gaa get gec tgt 

Met Ala Ser Gly Arg Leu Pro Asn Leu Ala Glu Asp Glu Ala Ala Cys 
15 10 15 

cat ggg cgc ggt tct tat cct gec cat cgt tgg ctg gat ggt tct egg 
His Gly Arg Gly Ser Tyr Pro Ala His Arg Trp Leu Asp Gly Ser Arg 
20 25 30 

ctg ggc tta gat etc gcg gec tct ata cgc tea ate gga eta tgc ccc 144 
Leu Gly Leu Asp Leu Ala Ala Ser He Arg Ser He Gly Leu Cys Pro 
35 40 45 

gaa tgc tac gtg tgt ttt gtg acg tac ggg etc ggt gec tgg gac gga 192 
Glu Cys Tyr Val Cys Phe Val Thr Tyr Gly Leu Gly Ala Trp Asp Gly 



48 



96 
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50 55 60 

cga ccc cca aaa tgg get tgc acg tta att tct gec cca tec ttt cag 240 

Arg Pro Pro Lys Trp Ala Cys Thr Leu He Ser Ala Pro Ser Phe Gin 
65 70 75 80 

acc gec ctg aac gag att get ace ggc tgg agg ccg gat aac cca cct 288 

Thr Ala Leu Asn Glu He Ala Thr Gly Trp Arg Pro Asp Asn Pro Pro 
85 90 95 

aaa aac ggc gac gtg aga teg egg ctg cac gat ate ggc egg teg ttg 3 36 

Lys Asn Gly Asp Val Arg Ser Arg Leu His Asp lie Gly Arg Ser Leu 

100 105 110 

tta gag gcg tac gcg tgg gtg ctg agg tgt ate tgc acc ggc gtg ggc 3 84 

Leu Glu Ala Tyr Ala Trp Val Leu Arg Cys He Cys Thr Gly Val Gly 

115 120 125 

tgt ccc age gac gaa ggt tta agt tta acg gec gtt ccc agg age gcg 43 2 

Cys Pro Ser Asp Glu Gly Leu Ser Leu Thr Ala Val Pro Arg Ser Ala 

130 135 140 

tgg tec agg tat ctt gtt gtg teg ttt cag agg gcg tgt tgt etc gtg 480 

Trp Ser Arg Tyr Leu Val Val Ser Phe Gin Arg Ala Cys Cys Leu Val 
145 150 155 160 

tgc aaa acc ctt aac tgt cgc cag egg ttt ccc ctg gtt acc tgt ttg 528 

Cys Lys Thr Leu Asn Cys Arg Gin Arg Phe Pro Leu Val Thr Cys Leu 
165 170 175 

cca cag cac gcg ttg gat ctg ccc gtt ttg egg aag aaa tgg aac ggg 576 

Pro Gin His Ala Leu Asp Leu Pro Val Leu Arg Lys Lys Trp Asn Gly 

180 185 190 

ggc ggt tgt 9 tc teg atg cag ttg aac gtt ccg tec att tec aga cgt 624 

Gly Gly Cys Val Ser Met Gin Leu Asn Val Pro Ser He Ser Arg Arg 

195 200 205 

ctg gga get aac ctg aat gag agt gtc ccg ggc cct teg gac gcg gga 672 

Leu Gly Ala Asn Leu Asn Glu Ser Val Pro Gly Pro Ser Asp Ala Gly 

210 215 220 

ctt ctg gee teg ctg cga gaa ctg gcg ccg acc gtc ccg tgc ggc aac 720 

Leu Leu Ala Ser Leu Arg Glu Leu Ala Pro Thr Val Pro Cys Gly Asn 
225 230 235 240 

ccg ttt aac gee ctt ctg agg age ctg acg ttt agg gec ctg ctg age 768 

Pro Phe Asn Ala Leu Leu Arg Ser Leu Thr Phe Arg Ala Leu Leu Ser 
245 250 255 

atg tec cgt gtt gtg ttg ccc ata ggg gag teg acc gag acg gag ata 816 

Met Ser Arg Val Val Leu Pro He Gly Glu Ser Thr Glu Thr Glu He 

260 265 270 

tec cgc gac ctg ggt caa aaa gta ttg gcg tat aac gta ctg ttt ccg 864 

Ser Arg Asp Leu Gly Gin Lys Val Leu Ala Tyr Asn Val Leu Phe Pro 

275 280 285 

tgt ate tct eta cct gtt tgg tec cag gtg gtg gee agg age gta ctg 912 

Cys He Ser Leu Pro Val Trp Ser Gin Val Val Ala Arg Ser Val Leu 

290 295 300 
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gag aag acc gtg cca get ccg agg gtg gtg gtg tgc etc gag tgc ggt 960 

Glu Lys Thr Val Pro Ala Pro Arg Val Val Val Cys Leu Glu Cys Gly 

305 310 315 320 

tac tgt ttg aat ttc gga egg ggt aaa ttt gaa acc gta aat ttc cca 1008 

Tyr Cys Leu Asn Phe Gly Arg Gly Lys Phe Glu Thr Val Asn Phe Pro 
325 330 335 

ccc acc aac gtg ttt ttc agt cgc gac caa aaa gag aag cag ctt teg 1056 

Pro Thr Asn Val Phe Phe Ser Arg Asp Gin Lys Glu Lys Gin Leu Ser 

340 345 350 

ate tgt get acc acc ggg cgc gta tat tgc tct tat tgt ggc gga teg 1104 

He Cys Ala Thr Thr Gly Arg Val Tyr Cys Ser Tyr Cys Gly Gly Ser 

355 360 365 

cac atg egg gtt ata tct eta ttc gag ate acg tgt gtt gga gat ccc 1152 

His Met Arg Val He Ser Leu Phe Glu He Thr Cys Val Gly Asp Pro 

370 375 380 

tat tta egg tgc gtg etc gec aac aac gcg gca cat gee ata cga gac 1200 

Tyr Leu Arg Cys Val Leu Ala Asn Asn Ala Ala His Ala He Arg Asp 

385 390 395 400 

gcg aac tec ctg gtt age gtc gtc gtg ccc tgt ttg gcg teg ccg gac 1248 

Ala Asn Ser Leu Val Ser Val Val Val Pro Cys Leu Ala Ser Pro Asp 
405 410 415 

tgc gcg acc ggc eta tta aag cat ttg cgt gtg gec gag ctg ttt tat 1296 

Cys Ala Thr Gly Leu Leu Lys His Leu Arg Val Ala Glu Leu Phe Tyr 

420 " 425 430 

ttg acc teg tct ata teg tec ctg tec tgt gga aag tgt aat aga agt 1344 

Leu Thr Ser Ser He Ser Ser Leu Ser Cys Gly Lys Cys Asn Arg Ser 

435 440 445 



taa 



1347 



<210> 147 
<211> 448 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 147 



Met 


Ala 


Ser 


Gly 


Arg 


Leu 


Pro 


Asn 


Leu 


Ala 


Glu 


Asp 


Glu 


Ala 


Ala 


Cys 


1 








5 










10 










15 




His 


Gly 


Arg 


Gly 
20 


Ser 


Tyr 


Pro 


Ala 


His 
25 


Arg 


Trp 


Leu 


Asp 


Gly 
30 


Ser 


Arg 


Leu 


Gly 


Leu 
35 


Asp 


Leu 


Ala 


Ala 


Ser 
40 


He 


Arg 


Ser 


He 


Gly 
45 


Leu 


Cys 


Pro 


Glu 


Cys 
50 


Tyr 


Val 


Cys 


Phe 


Val 
55 


Thr 


Tyr 


Gly 


Leu 


Gly 
60 


Ala 


Trp 


Asp 


Gly 


Arg 


Pro 


Pro 


Lys 


Trp 


Ala 


Cys 


Thr 


Leu 


lie 


Ser 


Ala 


Pro 


Ser 


Phe 


Gin 


65 










70 










75 










80 


Thr 


Ala 


Leu 


Asn 


Glu 
85 


He 


Ala 


Thr 


Gly 


Trp 
90 


Arg 


Pro 


Asp 


Asn 


Pro 
95 


Pro 


Lys 


Asn 


Gly 


Asp 
100 


Val 


Arg 


Ser 


Arg 


Leu 
105 


His 


Asp 


He 


Gly 


Arg 
110 


Ser 


Leu 
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Leu Glu Ala Tyr Ala Trp Val Leu Arg Cys He Cys Thr Gly Val Gly 

115 120 125 

Cys Pro Ser Asp Glu Gly Leu Ser Leu Thr Ala Val Pro Arg Ser Ala 

130 135 140 

Trp Ser Arg Tyr Leu Val Val Ser Phe Gin Arg Ala Cys Cys Leu Val 
145 150 155 160 

Cys Lys Thr Leu Asn Cys Arg Gin Arg Phe Pro Leu Val Thr Cys Leu 

165 170 175 

Pro Gin His Ala Leu Asp Leu Pro Val Leu Arg Lys Lys Trp Asn Gly 

180 185 190 

Gly Gly Cys Val Ser Met Gin Leu Asn Val Pro Ser He Ser Arg Arg 

195 200 205 

Leu Gly Ala Asn Leu Asn Glu Ser Val Pro Gly Pro Ser Asp Ala Gly 

210 215 220 

Leu Leu Ala Ser Leu Arg Glu Leu Ala Pro Thr Val Pro Cys Gly Asn 
225 230 235 240 

Pro Phe Asn Ala 'Leu Leu Arg Ser Leu Thr Phe Arg Ala Leu Leu Ser 

245 250 255 

Met Ser Arg Val Val Leu Pro He Gly Glu Ser Thr Glu Thr Glu He 

260 265 270 

Ser Arg Asp Leu Gly Gin Lys Val Leu Ala Tyr Asn Val Leu Phe Pro 

275 280 285 

Cy-s He Ser Leu Pro Val Trp Ser Gin Val Val Ala Arg Ser Val Leu 

290 295 300 

Glu Lys Thr Val Pro Ala Pro Arg Val Val Val Cys Leu Glu Cys Gly 
305 310 315 320 

Tyr Cys Leu Asn Phe Gly Arg Gly Lys Phe Glu Thr Val Asn Phe Pro 

325 330 335 

Pro Thr Asn Val Phe Phe Ser Arg Asp Gin Lys Glu Lys Gin Leu Ser 

340 345 350 

He Cys Ala Thr Thr Gly Arg Val Tyr Cys Ser Tyr Cys Gly Gly Ser 

355 360 365 

His Met Arg Val He Ser Leu Phe Glu He Thr Cys Val Gly Asp Pro 

370 375 380 

Tyr Leu Arg Cys Val Leu Ala Asn Asn Ala Ala His Ala He Arg Asp 
385 390 395 400 

Ala Asn Ser Leu Val Ser Val Val Val Pro Cys Leu Ala Ser Pro Asp 

405 410 415 

Cys Ala Thr Gly Leu . Leu Lys His Leu Arg Val Ala Glu Leu Phe Tyr 

420 425 430 

Leu Thr Ser Ser He Ser Ser Leu Ser Cys Gly Lys Cys Asn Arg Ser 
435 440 445 



<210> 148 
<211> 675 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (675) 

<400> 148 

atg agt tec gga aaa cgt tta gtt gac gag ctg tgc gat tta gtt gtc 

Met Ser Ser Gly Lys Arg Leu Val Asp Glu Leu Cys Asp Leu Val Val 
15 10 15 

tct tac ctg gga ccc tct gga ate tec ctg gac ttg gag agg tgt cag 
Ser Tyr Leu Gly Pro Ser Gly He Ser Leu Asp Leu Glu Arg Cys Gin 



20 



25 



30 



gac ggg get ccc gta tac get aaa ggc ggg gcg gtc ccc gtg tgc ace 
Asp Gly Ala Pro Val Tyr Ala Lys Gly Gly Ala Val Pro Val Cys Thr 
35 40 45 

gtg cgc ctg cag cac ggc tgc gtc tat cat etc gag ttt gtg tat aag 
Val Arg Leu Gin His Gly Cys Val Tyr His Leu Glu Phe Val Tyr Lys 
50 55 60 

ttt tgg etc cac aaa eta gag aga ctg gec tac ccg ttt gec ccg tgt 
Phe Trp Leu His Lys Leu Glu Arg Leu Ala Tyr Pro Phe Ala Pro Cys 
65 70 75 80 

ttt gta att ate aac aac ggt ttg gee ace acg ctg aaa tgt ttt ttg 
Phe Val He He Asn Asn Gly Leu Ala Thr Thr Leu Lys Cys Phe Leu 
85 90 95 

tgt aag cca cgt gac gec gat gec cag ttt gga aaa aac ctg cct ata 
Cys Lys Pro Arg Asp Ala Asp Ala Gin Phe Gly Lys Asn Leu Pro He 
100 105 HO 

aat teg gac gtg tat ctt gag agg aac teg tec gtg tec ctg ggc cag 
Asn Ser Asp Val Tyr Leu Glu Arg Asn Ser Ser Val Ser Leu Gly Gin 
115 120 125 

gac gat ttt atg aaa ttt aag gca cgt ctg gtt ttc tec gga gac eta 
Asp Asp Phe Met Lys Phe Lys Ala Arg Leu Val Phe Ser Gly Asp Leu 



130 



135 140 



aac gtt tac age tec atg gtc ata tgc cgc ace tac ttt acg gag cac 
Asn Val Tyr Ser Ser Met Val He Cys Arg Thr Tyr Phe Thr Glu His 
145 150 155 160 



144 



192 



240 



288 



336 



384 



432 



480 



528 



cga cag gtt tta cag ttt ttg gtc gtg act cca aag age get aaa egg 
Arg Gin Val Leu Gin Phe Leu Val Val Thr Pro Lye Ser Ala Lys Arg 
165 170 175 

tta aaa acc ctt ctt aga acg gtt ttt gec ctg acg ggt cac tec gac 576 
Leu Lys Thr Leu Leu Arg Thr Val Phe Ala Leu Thr Gly His Ser Asp 
180 185 190 

ggc etc ggt gcg ttg agg cga acg ggc tec gtg gec cgc cct teg ggg 624 
Gly Leu Gly Ala Leu Arg Arg Thr Gly Ser Val Ala Arg Pro Ser Gly 
195 200 205 

teg gag ttg aag gat att ggg cgc gga gag cgt gcg gcg atg acc aat 672 
Ser Glu Leu Lys Asp He Gly Arg Gly Glu Arg Ala Ala Met Thr Asn 
210 215 220 



taa 



675 



225 



<210> 149 
<211> 224 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 149 



264 
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<210> 150 
<211> 1374 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 
<222> (1) . . (1374) 

<400> 150 

atg ttc gtc ccg tgg caa ctg gag acc etc atg aga cac tgg ccc tct 

Met Phe Val Pro Trp Gin Leu Glu Thr Leu Met Arg His Trp Pro Ser 
1 5 10 15 

ctg egg gga etc gta gaa caa tec ttc etc ccc ggt acc ccg gac gga 
Leu Arg Gly Leu Val Glu Gin Ser Phe Leu Pro Gly Thr Pro Asp Gly 
20 25 30 

get ttt aac age ccg gta tta ate cac act cag gac tct eta caa ccc 
Ala Phe Asn Ser Pro Val Leu lie His Thr Gin Asp Ser Leu Gin Pro 
35 40 45 

gec tea teg tgc agg gtg tgt age etc ctg ttc act ctg gtc egg aca 192 
Ala Ser Ser Cys Arg Val Cys Ser Leu Leu Phe Thr Leu Val Arg Thr 
50 55 60 



ttc cca ccc ccc gac tct ttc ttc gaa gac tac ggc tgg ttg tgc etc 
Phe Pro Pro Pro Asp Ser Phe Phe Glu Asp Tyr Gly Trp Leu Cys Leu 
65 70 75 80 



48 



96 



144 



240 



265 
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acc tgc eta tac gec ccc cga tea tgg acg get acc etc atg gtg get 
Thr Cys Leu Tyr Ala Pro Arg Ser Trp Thr Ala Thr Leu Met Val Ala 
85 90 95 



288 



gec gac ctt ttg gaa eta acg cac gtg tac ttc ccg caa tgc gtg aaa 336 
Ala Asp Leu Leu Glu Leu Thr His Val Tyr Phe Pro Gin Cys Val Lys 
100 105 110 



gat ggg cca gta tac acc gec caa age ate etc gga ate gac gtc cag 
Asp Gly Pro Val Tyr Thr Ala Gin Ser He Leu Gly He Asp Val Gin 
115 120 125 



ggc atg tta gaa ggc acg att ccg gga teg ttc tgt ttt aaa acg tec 
Gly Met Leu Glu Gly Thr He Pro Gly Ser Phe Cys Phe Lys Thr Ser 
165 170 175 



ctg cgt cgc gac att ctg gaa aac atg gaa aac aac gtt aag etc gtc 
Leu Arg Arg Asp He Leu Glu Asn Met Glu Asn Asn Val Lys Leu Val 
305 310 315 320 

aat cgc ata teg tac ate eta aac gat ccg gac tea ctg tea cac gtg 

266 



384 



ctg cac ttc ttc gca acc cgc tgc ttc cga ccc ate gac aga gaa caa 432 

Leu His Phe Phe Ala Thr Arg Cys Phe Arg Pro He Asp Arg Glu Gin 
130 135 140 

ata etc cac aca tct cat tta aat ttt tta caa acc gag ttt att agg 480 

He Leu His Thr Ser His Leu Asn Phe Leu Gin Thr Glu Phe He Arg 

145 150 155 160 



528 



tgg ccg cgc aca gaa aag gac gac caa caa cct acc gtt gcg tgt tgt 576 

Trp Pro Arg Thr Glu Lys Asp Asp Gin Gin Pro Thr Val Ala Cys Cys 
180 185 190 

tec gtt ggc cgc gga agt cac acc aac egg gat aac cgc eta ccc gag 624 

Ser Val Gly Arg Gly Ser His Thr Asn Arg Asp Asn Arg Leu Pro Glu 
195 200 205 

gac ctg gaa gag gcg ttc aac tec acg aac gec gag gaa aag ccc age 672 

Asp Leu Glu Glu Ala Phe Asn Ser Thr Asn Ala Glu Glu Lys Pro Ser 
210 215 220 

etc etc ggc gtc ttt teg gca acg tgg gca gaa tec cag ctt ctt ggc 720 

Leu Leu Gly Val Phe Ser Ala Thr Trp Ala Glu Ser Gin Leu Leu Gly 

225 230 235 240 

tec gac aca caa cag gca gat acc cat tta caa ccc tec gec ttc cca 

Ser Asp Thr Gin Gin Ala Asp Thr His Leu Gin Pro Ser Ala Phe Pro 
245 250 255 

acc cca gaa gat get gac caa tea cag ggc ccc tgc ctg atg cac cca 

Thr Pro Glu Asp Ala Asp Gin Ser Gin Gly Pro Cys Leu Met His Pro 
260 265 270 

acg etc aac eta aaa aca aaa aac cac acc gca tec ata tgc gtt eta 864 

Thr Leu Asn Leu Lys Thr Lys Asn His Thr Ala Ser He Cys Val Leu 
275 280 285 

tgc gag tgt ctg gec gca cac ccg gac gec ggt ccg gtt ctg aaa gat 912 

Cys Glu Cys Leu Ala Ala His Pro Asp Ala Gly Pro Val Leu Lys Asp 
290 295 300 



768 



816 



960 



100 



Asn Arg lie Ser Tyr lie Leu Asn Asp Pro Asp Ser Leu Ser His Val 

325 330 335 

cgc gac gaa cat ctg cgc ggc eta att aaa egg tgc teg gca caa gaa 1056 

Arg Asp Glu His Leu Arg Gly Leu lie Lys Arg Cys Ser Ala Gin Glu 

340 345 350 

ate cac aag cat ttt ttt tgc gac ccg gtg tgc gtc ctg aac acg tac 1104 

lie His Lys His Phe Phe Cys Asp Pro Val Cys Val Leu Asn Thr Tyr 

355 360 365 

teg cac tgt ccc gcg gtt tta ttt aaa tgc cca cct ccc gaa aag tat 1152 

Ser His Cys Pro Ala Val Leu Phe Lys Cys Pro Pro Pro Glu Lys Tyr 

370 375 380 

aag aag etc aaa get cgt ctg gca acc gga gag ttc eta gac tgc aac 12 00 

Lys Lys Leu Lys Ala Arg Leu Ala Thr Gly Glu Phe Leu Asp Cys Asn 

385 390 395 400 

aga ata ttt gac tgc gag acc tta cag acc ctg gec gtc etc ttt aag 1248 

Arg lie Phe Asp Cys Glu Thr Leu Gin Thr Leu Ala Val Leu Phe Lys 

405 410 415 

ggg tct caa ctg gec aaa ate ggc aaa acc acg teg etc gag ata ate 1296 

Gly Ser Gin Leu Ala Lys lie Gly Lys Thr Thr Ser Leu Glu lie lie 

420 425 430 



cgt gaa etc gga ttt caa ctg cgt cga cac aac att caa ate acc cac 
Arg Glu Leu Gly Phe Gin Leu Arg Arg His Asn lie Gin lie Thr His 
435 440 445 

ccg ttt caa acc tec aac eta tac att taa 
Pro Phe Gin Thr Ser Asn Leu Tyr lie 
450 455 



<210> 151 
<211> 457 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 151 

Met Phe Val Pro Trp Gin Leu Glu Thr Leu Met Arg His Trp Pro Ser 

1 5 10 15 

Leu Arg Gly Leu Val Glu Gin Ser Phe Leu Pro Gly Thr Pro Asp Gly 

20 25 30 

Ala Phe Asn Ser Pro Val Leu lie His Thr Gin Asp Ser Leu Gin Pro 

35 40 45 

Ala Ser Ser Cys Arg Val Cys Ser Leu Leu Phe Thr Leu Val Arg Thr 

50 55 60 

Phe Pro Pro Pro Asp Ser Phe Phe Glu Asp Tyr Gly Trp Leu Cys Leu 
65 70 75 80 

Thr Cys Leu Tyr Ala Pro Arg Ser Trp Thr Ala Thr Leu Met Val Ala 

85 90 95 

Ala Asp Leu Leu Glu Leu Thr His Val Tyr Phe Pro Gin Cys Val Lys 

100 105 110 

Asp Gly Pro Val Tyr Thr Ala Gin Ser lie Leu Gly lie Asp Val Gin 

115 120 125 

Leu His Phe Phe Ala Thr Arg Cys Phe Arg Pro He Asp Arg Glu Gin 

130 135 140 

He Leu His Thr Ser His Leu Asn Phe Leu Gin Thr Glu Phe He Arg 



1344 



1374 



267 



145 150 155 160 

Gly Met Leu Glu Gly Thr lie Pro Gly Ser Phe Cys Phe Lys Thr Ser 

165 170 175 

Trp Pro Arg Thr Glu Lys Asp Asp Gin Gin Pro Thr Val Ala Cys Cys 

180 185 190 

Ser Val Gly Arg Gly Ser His Thr Asn Arg Asp Asn Arg Leu Pro Glu 

195 200 205 

Asp Leu Glu Glu Ala Phe Asn Ser Thr Asn Ala Glu Glu Lys Pro Ser 

210 215 220 

Leu Leu Gly Val Phe Ser Ala Thr Trp Ala Glu Ser Gin Leu Leu Gly 
225 230 235 240 

Ser Asp Thr Gin Gin Ala Asp Thr His Leu Gin Pro Ser Ala Phe Pro 

245 250 255 

Thr Pro Glu Asp Ala Asp Gin Ser Gin Gly Pro Cys Leu Met His Pro 

260 265 270 

Thr Leu Asn Leu Lys Thr Lys Asn His Thr Ala Ser lie Cys Val Leu 

275 280 285 

Cys Glu Cys Leu Ala Ala His Pro Asp Ala Gly Pro Val Leu Lys Asp 

290 295 300 

Leu Arg Arg Asp lie Leu Glu Asn Met Glu Asn Asn Val Lys Leu Val 
305 310 315 320 

Asn Arg He Ser Tyr He Leu Asn Asp Pro Asp Ser Leu Ser His Val 

325 330 335 

Arg Asp Glu His Leu Arg Gly Leu He Lys Arg Cys Ser Ala Gin Glu 

340 345 350 

He His Lys His Phe Phe Cys Asp Pro Val Cys Val Leu Asn Thr Tyr 

355 360 365 

Ser His Cys Pro Ala Val Leu Phe Lys Cys Pro Pro Pro Glu Lys Tyr 

370 375 380 

Lys Lys Leu Lys Ala Arg Leu Ala Thr Gly Glu Phe Leu Asp Cys Asn 
385 390 395 400 

Arg He Phe Asp Cys Glu Thr Leu Gin Thr Leu Ala Val Leu Phe Lys 

405 410 415 

Gly Ser Gin Leu Ala Lys He Gly Lys Thr Thr Ser Leu Glu He He 

420 425 430 

Arg Glu Leu Gly Phe Gin Leu Arg Arg His Asn He Gin He Thr His 

435 440 445 

Pro Phe Gin Thr Ser Asn Leu Tyr He 
450 455 



<210> 152 
<211> 894 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (894) 

<400> 152 

atg cca aaa cag ccc aga agt cga ttg gcg tct cga gcg ccg tac gca 

Met Pro Lys Gin Pro Arg Ser Arg Leu Ala Ser Arg Ala Pro Tyr Ala 
15 10 15 

cct age gtc agg cga ccg gac ggg ccc cag tec acg cga ccg gca tec 
Pro Ser Val Arg Arg Pro Asp Gly Pro Gin Ser Thr Arg Pro Ala Ser 
20 25 30 



agg cac ggc age tgc aaa age gaa ate atg cag tgg aaa aag tta gtt 

268 
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Arg His Gly Ser Cys Lys Ser Glu lie Met Gin Trp Lys Lys Leu Val 
35 40 45 

tea gac acg cag ttt ttt tct gec eta acg cgc cgc cac gag ctg ggg 192 

Ser Asp Thr Gin Phe Phe Ser Ala Leu Thr Arg Arg His Glu Leu Gly 
50 55 60 



gtg gac ttt tta aga gaa atg ggg acc ccg ata tgc acc tea aag tec 
Val Asp Phe Leu Arg Glu Met Gly Thr Pro He Cys Thr Ser Lys Ser 

70 75 80 



65 



ctg cga etc att atg gac aac ctt cca aat tat gac ata acg gtg gac 

Leu Arg Leu He Met Asp Asn Leu Pro Asn Tyr Asp He Thr Val Asp 

225 230 235 240 

tgc ate aag caa acg tac ata atg aag ttt aca ccc teg cga ccg gac 

Cys He Lys Gin Thr Tyr He Met Lys Phe Thr Pro Ser Arg Pro Asp 

245 250 255 

aac gca acc gtg acg gtt cct gtc aac age att tgc gag gec gtg gec 

Asn Ala Thr Val Thr Val Pro Val Asn Ser He Cys Glu Ala Val Ala 

260 265 270 

acc eta gac tgc acc gac gag ttt cga gaa gaa att caa agg ggc acg 

Thr Leu Asp Cys Thr Asp Glu Phe Arg Glu Glu He Gin Arg Gly Thr 



240 



288 



336 



384 



gtt atg ttg ccg tta aac eta aaa acc ate gec ccg ggt egg tgc gtc 

Val Met Leu Pro Leu Asn Leu Lys Thr He Ala Pro Gly Arg Cys Val 

85 90 95 

tct etc tea tea ttc gga cac teg tea aac atg ggg ttc aac tgt teg 

Ser Leu Ser Ser Phe Gly His Ser Ser Asn Met Gly Phe Asn Cys Ser 

100 105 HO 

teg tgc acg cca act gac agg tea gcg gtg tct ctg gac gca aac gcg 

Ser Cys Thr Pro Thr Asp Arg Ser Ala Val Ser Leu Asp Ala Asn Ala 

115 120 125 

etc ggc gaa gat tec gec agg aaa aac age gag ctg tgt tea gtg gcg 432 

Leu Gly Glu Asp Ser Ala Arg Lys Asn Ser Glu Leu Cys Ser Val Ala 

130 135 140 

tta acc ttt tac cac cac gec gaa aag gtc gtg cag cac aag ggc ttt 480 

Leu Thr Phe Tyr His His Ala Glu Lys Val Val Gin His Lys Gly Phe 

145 150 155 160 

tac ctg tct ctg etc age cac tec atg gaa gtc gtt agg aaa age ttc 528 

Tyr Leu Ser Leu Leu Ser His Ser Met Glu Val Val Arg Lys Ser Phe 

165 170 175 

acg caa ccc ggg ttg etc tac gec cac eta gtc eta aaa acc ttt ggc 576 

Thr Gin Pro Gly Leu Leu Tyr Ala His Leu Val Leu Lys Thr Phe Gly 

180 185 190 

cac gat cct tta cct att ttt aca gtc gat gec gat gag aga etc gca 624 

His Asp Pro Leu Pro He Phe Thr Val Asp Ala Asp Glu Arg Leu Ala 

195 200 205 

etc tgg gcg gtg ttc cac act aga gac eta cac ctg ggg gaa acc agt 672 

Leu Trp Ala Val Phe His Thr Arg Asp Leu His Leu Gly Glu Thr Ser 

210 215 220 



720 



768 



816 



864 



269 




275 280 285 

gcc ate ata aac tec cag ggg eta ttg taa 894 

Ala lie lie Asn Ser Gin Gly Leu Leu 

290 295 



<210> 153 
<211> 297 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 153 

Met Pro Lys Gin Pro Arg Ser Arg Leu Ala Ser Arg Ala Pro Tyr Ala 

1 5 10 15 

Pro Ser Val Arg Arg Pro Asp Gly Pro Gin Ser Thr Arg Pro Ala Ser 

20 25 30 

Arg His Gly Ser Cys Lys Ser Glu He Met Gin Trp Lys Lys Leu Val 

35 40 45 

Ser Asp Thr Gin Phe Phe Ser Ala Leu Thr Arg Arg His Glu Leu Gly 

50 55 60 

Val Asp Phe Leu Arg Glu Met Gly Thr Pro He Cys Thr Ser Lys Ser 
65 70 75 80 

Val Met Leu Pro Leu Asn Leu Lys Thr He Ala Pro Gly Arg Cys Val 

85 90 95 

Ser Leu Ser Ser Phe Gly His Ser Ser Asn Met Gly Phe Asn Cys Ser 

100 105 HO 

Ser Cys Thr Pro Thr Asp Arg Ser Ala Val Ser Leu Asp Ala Asn Ala 

115 120 125 

Leu Gly Glu Asp Ser Ala Arg Lys Asn Ser Glu Leu Cys Ser Val Ala 

130 135 140 

Leu Thr Phe Tyr His His Ala Glu Lys Val Val Gin His Lys Gly Phe 
145 150 155 160 

Tyr Leu Ser Leu Leu Ser His Ser Met Glu Val Val Arg Lys Ser Phe 

165 170 175 

Thr Gin Pro Gly Leu Leu Tyr Ala His Leu Val Leu Lys Thr Phe Gly 

180 185 190 

His Asp Pro Leu Pro He Phe Thr Val Asp Ala Asp Glu Arg Leu Ala 

195 200 205 

Leu Trp Ala Val Phe His Thr Arg Asp Leu His Leu Gly Glu Thr Ser 

210 215 220 

Leu Arg Leu He Met Asp Asn Leu Pro Asn Tyr Asp He Thr Val Asp 
225 230 235 240 

Cys He Lys Gin Thr Tyr He Met Lys Phe Thr Pro Ser Arg Pro Asp 

245 250 255 

Asn Ala Thr Val Thr Val Pro Val Asn Ser He Cys Glu Ala Val Ala 

260 265 270 

Thr Leu Asp Cys Thr Asp Glu Phe Arg Glu Glu He Gin Arg Gly Thr 

275 280 285 

Ala He He Asn Ser Gin Gly Leu Leu 
290 295 



<210> 154 
<211> 525 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 
<221> CDS 



270 
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<222> (1) 



(525) 



<400> 154 

atg ttc ccg cat aag egg tta gtg gac ttc ggt cgc cac eta gag gca 

Met Phe Pro His Lys Arg Leu Val Asp Phe Gly Arg His Leu Glu Ala 

1 5 10 15 



48 



gac gac aga gag gee gtg ctg tgg ctt ttt gac egg ccg gcg tec gac 
Asp Asp Arg Glu Ala Val Leu Trp Leu Phe Asp Arg Pro Ala Ser Asp 
20 25 30 



96 



gat acg ccc gag ggg ttt gca aac ggg ctg tgc ccc tea act gga gaa 
Asp Thr Pro Glu Gly Phe Ala Asn Gly Leu Cys Pro Ser Thr Gly Glu 
35 40 45 



144 



ccc ggt att ccc etc ccg gtg ttg ctg gag gec gtg ttt etc gtt ggg 
Pro Gly He Pro Leu Pro Val Leu Leu Glu Ala Val Phe Leu Val Gly 
50 55 60 



192 



cga ttg gac ctg gtc tec acc ttt ttt tta eta gac gtg gga ttt att 
Arg Leu Asp Leu Val Ser Thr Phe Phe Leu Leu Asp Val Gly Phe He 
65 70 75 80 



240 



ate gag agg etc egg tec age ccc agt tac ttt agt cca tac aaa cac 
He Glu Arg Leu Arg Ser Ser Pro Ser Tyr Phe Ser Pro Tyr Lys His 
85 90 95 



288 



ctg atg etc tec att gac cgc cag etc tea gaa agg gac gtg aaa aat 
Leu Met Leu Ser He Asp Arg Gin Leu Ser Glu Arg Asp Val Lys Asn 
100 105 HO 



336 



tta gtt ttt eta acg ggc gac cag ctt ggt cgc agg cgc aac cag tea 
Leu Val Phe Leu Thr Gly Asp Gin Leu Gly Arg Arg Arg Asn Gin Ser 
115 120 125 

ccc acc ttt ttt egg tgg etc teg caa atg gaa aag gcg gee ctg gtc 
Pro Thr Phe Phe Arg Trp Leu Ser Gin Met Glu Lys Ala Ala Leu Val 
130 135 140 



384 



432 



age ccc tea aac tac atg gtt tta agt gac ctg ctg cag gec gtc tec 
Ser Pro Ser Asn Tyr Met Val Leu Ser Asp Leu Leu Gin Ala Val Ser 
145 150 155 160 

agg cgc gac gtg gee aag gtc gtg gec gec aac gca ccc ggt taa 
Arg Arg Asp Val Ala Lys Val Val Ala Ala Asn Ala Pro Gly 

165 170 175 



480 



525 



<210> 155 
<211> 174 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 155 

Met Phe Pro His Lys Arg Leu Val Asp Phe Gly Arg His Leu Glu Ala 

15 10 15 

Asp Asp Arg Glu Ala Val Leu Trp Leu Phe Asp Arg Pro Ala Ser Asp 

20 25 30 

Asp Thr Pro Glu Gly Phe Ala Asn Gly Leu Cys Pro Ser Thr Gly Glu 

3 5 4 0 4 5 

Pro Gly He Pro Leu Pro Val Leu Leu Glu Ala Val Phe Leu Val Gly 

271 
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50 

Arg Leu Asp Leu 
65 

lie Glu Arg Leu 

Leu Met Leu Ser 
100 

Leu Val Phe Leu 
115 

Pro Thr Phe Phe 
130 

Ser Pro Ser Asn 
145 

Arg Arg Asp Val 



55 

Val Ser Thr Phe 
70 

Arg Ser Ser Pro 
85 

lie Asp Arg Gin 

Thr Gly Asp Gin 
120 

Arg Trp Leu Ser 
135 

Tyr Met Val Leu 
150 

Ala Lys Val Val 
165 



60 

Phe Leu Leu Asp 

75 

Ser Tyr Phe Ser 
90 

Leu Ser Glu Arg 
105 

Leu Gly Arg Arg 

Gin Met Glu Lys 
140 

Ser Asp Leu Leu 
155 

Ala Ala Asn Ala 
170 



Val Gly Phe lie 
80 

Pro Tyr Lys His 
95 

Asp Val Lys Asn 
110 

Arg Asn Gin Ser 
125 

Ala Ala Leu Val 

Gin Ala Val Ser 
160 

Pro Gly 



<210> 156 
<211> 765 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (765) 

<400> 156 

atg get tct gtt ggc cca gtg ccc acc ggg acc att gac ccc gtc ctg 

Met Ala Ser Val Gly Pro Val Pro Thr Gly Thr lie Asp Pro Val Leu 
15 10 15 

tac cag gac egg gec etc tct aac etc ctg gec cac gaa gcg tec ttt 
Tyr Gin Asp Arg Ala Leu Ser Asn Leu Leu Ala His Glu Ala Ser Phe 
20 25 30 



ttc aag gec gec ttt tta tgc ttt tgc gcg gcg gag gat ttt tec gtg 
Phe Lys Ala Ala Phe Leu Cys Phe Cys Ala Ala Glu Asp Phe Ser Val 
115 120 125 



48 



96 



gta aca tec acg gec tgc tat ggg acg ata caa acg gag gtg acg gtg 144 
Val Thr Ser Thr Ala Cys Tyr Gly Thr lie Gin Thr Glu Val Thr Val 
35 40 45 



ggc atg cga gtt ate ttg ggg acg tgg atg egg tct gtt gec agg gec 192 
Gly Met Arg Val lie Leu Gly Thr Trp Met Arg Ser Val Ala Arg Ala 
50 55 60 



cac cag gcg gat get age gtt ttc cca ctg gec gtt agt att ctg gat 240 

His Gin Ala Asp Ala Ser Val Phe Pro Leu Ala Val Ser lie Leu Asp 

65 70 75 80 

cgc tat eta gag tgt cgc age ate ccc cgt egg egg ttt cag agg ctg 28 8 

Arg Tyr Leu Glu Cys Arg Ser lie Pro Arg Arg Arg Phe Gin Arg Leu 

85 90 95 

ggg gee gec tgt eta ttt tta gec ggc aag att aga gac ctg aat cct 336 

Gly Ala Ala Cys Leu Phe Leu Ala Gly Lys lie Arg Asp Leu Asn Pro 

100 105 HO 



384 



gcg gat tta ctg aaa cag gaa aaa tea ate ctt aag get tta agg tgg 432 

272 



Ala Asp Leu Leu Lys Gin Glu Lys Ser lie Leu Lys Ala Leu Arg Trp 

130 135 140 

aaa eta gag gec gtc tta cca acg gac gec ate ggg ccc acc ctg ttt 480 

Lys Leu Glu Ala Val Leu Pro Thr Asp Ala lie Gly Pro Thr Leu Phe 

145 150 155 160 



aaa age ggc ttt aca aaa gag cag etc ttt gec ctt cac age cag gtg 
Lys Ser Gly Phe Thr Lys Glu Gin Leu Phe Ala Leu His Ser Gin Val 
165 170 175 



get gcg ccg cct ccg gec aga ttg gcg gag gee gtc ggc gtt teg gec 

Ala Ala Pro Pro Pro Ala Arg Leu Ala Glu Ala Val Gly Val Ser Ala 

210 215 220 

gca acc etc gcg gec gec gec gag teg gtt gee acc acc ttg egg gaa 

Ala Thr Leu Ala Ala Ala Ala Glu Ser Val Ala Thr Thr Leu Arg Glu 

225 230 235 240 

ttt gac gaa gac cac att tta agt aac gec cgc ggt teg teg tga 

Phe Asp Glu Asp His lie Leu Ser Asn Ala Arg Gly Ser Ser 

245 250 255 



<210> 157 
<211> 254 

<212> PRT " * " 

<213> Macaca mulatta rhadinovirus 17577 

<400> 157 

Met Ala Ser Val Gly Pro Val Pro Thr Gly Thr lie Asp Pro Val Leu 

1 5 10 15 

Tyr Gin Asp Arg Ala Leu Ser Asn Leu Leu Ala His Glu Ala Ser Phe 

20 25 30 

Val Thr Ser Thr Ala Cys Tyr Gly Thr lie Gin Thr Glu Val Thr Val 

35 40 45 

Gly Met Arg Val lie Leu Gly Thr Trp Met Arg Ser Val Ala Arg Ala 

50 55 60 

His Gin Ala Asp Ala Ser Val Phe Pro Leu Ala Val Ser He Leu Asp 
65 70 75 80 

Arg Tyr Leu Glu Cys Arg Ser He Pro Arg Arg Arg Phe Gin Arg Leu 

85 90 95 

Gly Ala Ala Cys Leu Phe Leu Ala Gly Lys He Arg Asp Leu Asn Pro 

100 105 HO 

Phe Lys Ala Ala Phe Leu Cys Phe Cys Ala Ala Glu Asp Phe Ser Val 

115 120 125 

Ala Asp Leu Leu Lys Gin Glu Lys Ser He Leu Lys Ala Leu Arg Trp 

130 135 140 

Lys Leu Glu Ala Val Leu Pro Thr Asp Ala He Gly Pro Thr Leu Phe 
145 150 155 160 

Lys Ser Gly Phe Thr Lys Glu Gin Leu Phe Ala Leu His Ser Gin Val 

165 170 175 

Val Glu Ser Val His Lys Ala He Val Asn Pro Ala Thr Gly Gly Leu 



528 



gtg gag tct gtt cac aaa gec ate gtg aac ccg gec acc ggc ggt ctg 576 

Val Glu Ser Val His Lys Ala He Val Asn Pro Ala Thr Gly Gly Leu 
180 185 190 

ccc ccg tec ctg gtg gcg gec gec tgc gcg ctg ttt age etc ggt gec 624 

Pro Pro Ser Leu Val Ala Ala Ala Cys Ala Leu Phe Ser Leu Gly Ala 

195 200 205 



672 



720 



765 



273 
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180 

Pro Pro Ser Leu Val 
195 

Ala Ala Pro Pro Pro 
210 

Ala Thr Leu Ala Ala 
225 

Phe Asp Glu Asp His 
245 



185 

Ala Ala Ala Cys Ala 
200 

Ala Arg Leu Ala Glu 
215 

Ala Ala Glu Ser Val 
230 

lie Leu Ser Asn Ala 
250 



190 

Leu Phe Ser Leu Gly Ala 
205 

Ala Val Gly Val Ser Ala 
220 

Ala Thr Thr Leu Arg Glu 
235 240 
Arg Gly Ser Ser 



<210> 158 
<211> 1347 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) - - (1347) 

<400> 158 

atg tgg ggc age egg caa cat cgt tec ggc ata gta teg gga cac ggg 48 

Met Trp Gly Ser Arg Gin His Arg Ser Gly He Val Ser Gly His Gly 

1 5 10 15 

ttg cga agt tec tgc agg gga cat tgc gga aga cgc ggc ggc act aga 96 
Leu Arg Ser Ser Cys Arg Gly His Cys Gly Arg Arg Gly Gly Thr Arg 
20 25 30 

gaa cag gee ggg agg cgc ggc agg ggg cgt ggc acg gee gcg cca gec 144 
Glu Gin Ala Gly Arg Arg Gly Arg Gly Arg Gly Thr Ala Ala Pro Ala 
35 40 45 

gec gca ccc gcg ccc ccc get cca aca aca tea ggg ccg caa gtg egg 192 
Ala Ala Pro Ala Pro Pro Ala Pro Thr Thr Ser Gly Pro Gin Val Arg 
50 55 60 

gcg gtt gca gaa caa ggg cac gga teg gac acc gaa acg gee acc gaa 
Ala Val Ala Glu Gin Gly His Gly Ser Asp Thr Glu Thr Ala Thr Glu 
65 70 75 80 

tec agg cac ggg tea tec caa ggg teg cca tea ggg tea ggg tct gag 
Ser Arg His Gly Ser Ser Gin Gly Ser Pro Ser Gly Ser Gly Ser Glu 
85 90 95 

tec gtc ate gtg ttg gga tct cca acg ccg teg cca age ggg tct gcg 336 
Ser Val He Val Leu Gly Ser Pro Thr Pro Ser Pro Ser Gly Ser Ala 
100 105 HO 



ccg gtg tta get tea ggc ctg tea cct caa aac acg tea ggg teg tec 

Pro Val Leu Ala Ser Gly Leu Ser Pro Gin Asn Thr Ser Gly Ser Ser 

115 120 125 

ccg gca tec ccc gca teg cat teg ccc cca cca tea ccc ccg tea cac 

Pro Ala Ser Pro Ala Ser His Ser Pro Pro Pro Ser Pro Pro Ser His 

130 135 140 

ccc ggc ccc cac tea cct gca ccg ccg teg teg cat aat cca teg ccc 

Pro Gly Pro His Ser Pro Ala Pro Pro Ser Ser His Asn Pro Ser Pro 

145 150 155 160 



240 



288 



384 



432 



480 



274 
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aat cag cag cct teg teg ttt ttg caa cca teg cac cac gac tea cct 
Asn Gin Gin Pro Ser Ser Phe Leu Gin Pro Ser His His Asp Ser Pro 
165 170 175 



528 



gaa ccg cct gag cct ccg acg tec ttg ccc cct ccc gac tea cct gga 
Glu Pro Pro Glu Pro Pro Thr Ser Leu Pro Pro Pro Asp Ser Pro Gly 
180 185 190 



576 



ccg cca caa teg cca acg ccg acg tec teg ccc cct ccg caa tec ccg 
Pro Pro Gin Ser Pro Thr Pro Thr Ser Ser Pro Pro Pro Gin Ser Pro 
195 200 205 



624 



ccc gac tea ccg gga cca ccc caa teg cca acg cct caa cag gec cca 
Pro Asp Ser Pro Gly Pro Pro Gin Ser Pro Thr Pro Gin Gin Ala Pro 
210 215 220 



672 



tec cct aac acg caa cag gcg gtg age cac aca gac cac ccc acc ggg 
Ser Pro Asn Thr Gin Gin Ala Val Ser His Thr Asp His Pro Thr Gly 
225 230 235 240 



720 



ccg tct egg cca gga ccc ccg ttt cct ggc cat aca tec cac teg tac 
Pro Ser Arg Pro Gly Pro Pro Phe Pro Gly His Thr Ser His Ser Tyr 
245 250 255 



768 



acc gta ggg ggt tgg ggt cca ccc acg cgt gee gga gga gtt cct tgt 
Thr Val Gly Gly Trp Gly Pro Pro Thr Arg Ala Gly Gly Val Pro Cys 
260 265 270 



816 



tta cgt etc aga tgc acg agt cac aac tec cac gag gac gaa get cct 
Leu Arg Leu Arg Cys Thr Ser His Asn Ser His Glu Asp Glu Ala Pro 
275 280 28,5 



864 



gaa aga caa cag gag caa gag ggg gag gag egg cag cag caa ccg gcg 
Glu Arg Gin Gin Glu Gin Glu Gly Glu Glu Arg Gin Gin Gin Pro Ala 
290 295 300 



912 



cgc ccc ccg cgc ccc ccg cgc ccg ccg ega tac ccc att ccc ata ccg 
Arg Pro Pro Arg Pro Pro Arg Pro Pro Arg Tyr Pro He Pro He Pro 
305 310 315 320 



960 



tat ccc tea tea gag gaa gag gta ccc aga aaa tac cgc cca cag aga 
Tyr Pro Ser Ser Glu Glu Glu Val Pro Arg Lys Tyr Arg Pro Gin Arg 
325 330 335 



1008 



aga ttt tat cgt cag gta ttg gga ccc agg ate gac ccc cct egg ccg 
Arg Phe Tyr Arg Gin Val Leu Gly Pro Arg He Asp Pro Pro Arg Pro 
340 345 350 



1056 



ggc ccg tgg tgt cac gga gtc ate ttt tgt aac tea gac ccc tat tea 
Gly Pro Trp Cys His Gly Val lie Phe Cys Asn Ser Asp Pro Tyr Ser 
355 360 365 



1104 



ctg tat cgc ctg gca agg tgc eta cag ttt ccc ggc att aga gca tec 
Leu Tyr Arg Leu Ala Arg Cys Leu Gin Phe Pro Gly He Arg Ala Ser 
370 375 380 



1152 



age gtc cgc gtt ttg ccg gac gca cct ggc agt ccc gtc ate ccg get 
Ser Val Arg Val Leu Pro Asp Ala Pro Gly Ser Pro Val He Pro Ala 
385 390 395 400 



1200 



275 
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ttt tgc att acg gtg ttc tgt caa age cgc gga acc gec aag gec gtc 1248 

Phe Cys lie Thr Val Phe Cys Gin Ser Arg Gly Thr Ala Lys Ala Val 

405 410 415 

aaa aag gec cgt cgc egg tgg gaa cga cat cac cca tec gec ccc cac 1296 

Lys Lys Ala Arg Arg Arg Trp Glu Arg His His Pro Ser Ala Pro His 

420 425 430 

ttc cag gec age ata gtg cgc atg gac aga gga ctg cca att cag cac 1344 

Phe Gin Ala Ser lie Val Arg Met Asp Arg Gly Leu Pro lie Gin His 

435 440 445 

taa 1347 



<210> 159 
<211> 448 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 159 

Met Trp Gly Ser Arg Gin His Arg Ser Gly He Val Ser Gly His Gly 

15 10 15 

Leu Arg Ser Ser Cys Arg Gly His Cys Gly Arg Arg Gly Gly Thr Arg 

20 25 30 

Glu Gin Ala Gly Arg Arg Gly Arg Gly Arg Gly Thr Ala Ala Pro Ala 

35 40 45 

Ala Ala Pro Ala Pro Pro Ala Pro Thr Thr Ser Gly Pro Gin Val Arg 

50 55 60 

Ala Val Ala Glu Gin Gly His Gly Ser Asp Thr Glu Thr Ala Thr Glu 
65 70 75 80 

Ser Arg His Gly Ser Ser Gin Gly Ser Pro Ser Gly Ser Gly Ser Glu 

85 90 95 

Ser Val He Val Leu Gly Ser Pro Thr Pro Ser Pro Ser Gly Ser Ala 

100 105 110 

Pro Val Leu Ala Ser Gly Leu Ser Pro Gin Asn Thr Ser Gly Ser Ser 

115 120 125 

Pro Ala Ser Pro Ala Ser His Ser Pro Pro Pro Ser Pro Pro Ser His 

130 135 140 

Pro Gly Pro His Ser Pro Ala Pro Pro Ser Ser His Asn Pro Ser Pro 
145 150 155 160 

Asn Gin Gin Pro Ser Ser Phe Leu Gin Pro Ser His His Asp Ser Pro 

165 170 175 

Glu Pro Pro Glu Pro Pro Thr Ser Leu Pro Pro Pro Asp Ser Pro Gly 

180 185 190 

Pro Pro Gin Ser Pro Thr Pro Thr Ser Ser Pro Pro Pro Gin Ser Pro 

195 200 205 

Pro Asp Ser Pro Gly Pro Pro Gin Ser Pro Thr Pro Gin Gin Ala Pro 

210 215 220 

Ser Pro Asn Thr Gin Gin Ala Val Ser His Thr Asp His Pro Thr Gly 
225 230 235 240 

Pro Ser Arg Pro Gly Pro Pro Phe Pro Gly His Thr Ser His Ser Tyr 

245 250 255 

Thr Val Gly Gly Trp Gly Pro Pro Thr Arg Ala Gly Gly Val Pro Cys 

260 265 270 

Leu Arg Leu Arg Cys Thr Ser His Asn Ser His Glu Asp Glu Ala Pro 

275 280 285. 

Glu Arg Gin Gin Glu Gin Glu Gly Glu Glu Arg Gin Gin Gin Pro Ala 

290 295 300 

Arg Pro Pro Arg Pro Pro Arg Pro Pro Arg Tyr Pro He Pro He Pro 



276 
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305 310 315 320 

Tyr Pro Ser Ser Glu Glu Glu Val Pro Arg Lys Tyr Arg Pro Gin Arg 

325 330 335 

Arg Phe Tyr Arg Gin Val Leu Gly Pro Arg lie Asp Pro Pro Arg Pro 

340 345 350 

Gly Pro Trp Cys His Gly Val lie Phe Cys Asn Ser Asp Pro Tyr Ser 

355 360 365 

Leu Tyr Arg Leu Ala Arg Cys Leu Gin Phe Pro Gly lie Arg Ala Ser 

370 375 380 

Ser Val Arg Val Leu Pro Asp Ala Pro Gly Ser Pro Val lie Pro Ala 
385 390 395 400 

Phe Cys lie Thr Val Phe Cys Gin Ser Arg Gly Thr Ala Lys Ala Val 

405 410 415 

Lys Lys Ala Arg Arg Arg Trp Glu Arg His His Pro Ser Ala Pro His 

420 425 430 

Phe Gin Ala Ser lie Val Arg Met Asp Arg Gly Leu Pro lie Gin His 
435 440 445 



<210> 160 
<211> 762 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . - (762) 

<400> 160 

atg teg gga gga att aca tta acg ctg ctg ctg gcg acg ctt gec acg 48 

Met Ser Gly Gly lie Thr Leu Thr Leu Leu Leu Ala Thr Leu Ala Thr 
1 5 10 15 

gtt egg tgc get ctt caa acg cac tat gcg gcg gtc ccc gtg cac tct 96 
Val Arg Cys Ala Leu Gin Thr His Tyr Ala Ala Val Pro Val His Ser 
20 25 30 

acc gcg tec ctg ggg tgc gtg tta acg aca ccc cac gac gtt ctt ate 144 
Thr Ala Ser Leu Gly Cys Val Leu Thr Thr Pro His Asp Val Leu lie 
35 40 45 

gtt acc tgg caa aaa cag gaa teg cct agt ccc gtt aac gtg gec aca 192 
Val Thr Trp Gin Lys Gin Glu Ser Pro Ser Pro Val Asn Val Ala Thr 
50 55 60 

tat agt tec gaa gcg ggc acg gtg gtt cag ccc ccg ttc gec ggt agg 240 
Tyr Ser Ser Glu Ala Gly Thr Val Val Gin Pro Pro Phe Ala Gly Arg 
65 70 75 80 

gtt gac att ccc gaa cac aag ttg acc aga acg acc ctg aag ttt ttt 288 
Val Asp lie Pro Glu His Lys Leu Thr Arg Thr Thr Leu Lys Phe Phe 
85 90 95 

aat gec acc ctg gag gac gag ggg tgc tac ctg tgt ate ttt aac gcg 336 
Asn Ala Thr Leu Glu Asp Glu Gly Cys Tyr Leu Cys lie Phe Asn Ala 
100 105 110 

ttt gga gtg gga aag ctg teg gga acc gec tgc ttg acg gtt tac gtc 384 
Phe Gly Val Gly Lys Leu Ser Gly Thr Ala Cys Leu Thr Val Tyr Val 
115 120 125 



277 
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ccc ctg tec atg tec gtc acg ttt tac ccc ccg att aac ccg acg cag 432 

Pro Leu Ser Met Ser Val Thr Phe Tyr Pro Pro lie Asn Pro Thr Gin 
130 135 .140 

etc gtc tgt egg gee gag gee agt ccc gca ccg teg gtc aac tgg ace 480 

Leu Val Cys Arg Ala Glu Ala Ser Pro Ala Pro Ser Val Asn Trp Thr 

145 150 155 160 

ggc gtg ccg ccc gag ctg tgc age gaa cct gaa gtg ttt ccc egg ccc 528 

Gly Val Pro Pro Glu Leu Cys Ser Glu Pro Glu Val Phe Pro Arg Pro 

165 170 175 

aac gga aca ace ctg gtt gtg ggt cgc tgc aac gta acg teg gtg gac 576 

Asn Gly Thr Thr Leu Val Val Gly Arg Cys Asn Val Thr Ser Val Asp 

180 185 190 

ccc gaa gac ctt gag aac gee acg tgc ctg gtc act cat ata ggc ggt 624 

Pro Glu Asp Leu Glu Asn Ala Thr Cys Leu Val Thr His lie Gly Gly 

195 200 205 

ttg gec gcg gcg egg ccc ctg gac ccc gtg ttt teg gat ccc ctg gaa 672 

Leu Ala Ala Ala Arg Pro Leu Asp Pro Val Phe Ser Asp Pro Leu Glu 
210 215 220 

ggg acg age cac tac gtg gtg ggt gtg gtg gca gcg gec gee gtt tta 720 

Gly Thr Ser His Tyr Val Val Gly Val Val Ala Ala Ala Ala Val Leu 

225 230 235 240 

ggc att ttt tta acg ggt gtt ttt ttg tat agg tct atg tga 762 

Gly lie Phe Leu Thr Gly Val Phe Leu Tyr Arg Ser Met 

245 250 



<210> 161 

<211> 253 

<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 



<400> 161 

Met Ser Gly Gly lie Thr Leu Thr Leu Leu Leu Ala Thr Leu Ala Thr 

15 10 15 

Val Arg Cys Ala Leu Gin Thr His Tyr Ala Ala Val Pro Val His Ser 

20 25 30 

Thr Ala Ser Leu Gly Cys Val Leu Thr Thr Pro His Asp Val Leu He 

35 40 45 

Val Thr Trp Gin Lys Gin Glu Ser Pro Ser Pro Val Asn Val Ala Thr 

50 55 60 

Tyr Ser Ser Glu Ala Gly Thr Val Val Gin Pro Pro Phe Ala Gly Arg 
65 70 75 80 

Val Asp He Pro Glu His Lys Leu Thr Arg Thr Thr Leu Lys Phe Phe 

85 90 95 

Asn Ala Thr Leu Glu Asp Glu Gly Cys Tyr Leu Cys He Phe Asn Ala 

100 105 110 

Phe Gly Val Gly Lys Leu Ser Gly Thr Ala Cys Leu Thr Val Tyr Val 

115 120 125 

Pro Leu Ser Met Ser Val Thr Phe Tyr Pro Pro He Asn Pro Thr Gin 

130 135 140 

Leu Val Cys Arg Ala Glu Ala Ser Pro Ala Pro Ser Val Asn Trp Thr 
145 150 ' 155 160 

Gly Val Pro Pro Glu Leu Cys Ser Glu Pro Glu Val Phe Pro Arg Pro 

278 
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Asn Gly Thr Thr 
180 

Pro Glu Asp Leu 
195 

Leu Ala Ala Ala 
210 

Gly Thr Ser His 
225 

Gly lie Phe Leu 



165 

Leu Val Val Gly 

Glu Asn Ala Thr 
200 

Arg Pro Leu Asp 
215 

Tyr Val Val Gly 
230 

Thr Gly Val Phe 
245 



170 

Arg Cys Asn Val 
185 

Cys Leu Val Thr 

Pro Val Phe Ser 
220 

Val Val Ala Ala 
235 

Leu Tyr Arg Ser 
250 



175 

Thr Ser Val Asp 
190 

His lie Gly Gly 
205 

Asp Pro Leu Glu 

Ala Ala Val Leu 
240 

Met 



<210> 162 
<211> 1029 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (1029) 

<400> 162 

atg gac gcc ttg aac aat aac ctt aac ctg ctg atg gat ttt ctg tct 48 

Met Asp Ala Leu Asn Asn Asn Leu Asn Leu Leu Met Asp Phe Leu Ser 

15 10 15 

aac tat teg aat age tac agt agt tat gac gac aat atg tct tac acc 96 
Asn Tyr Ser Asn Ser Tyr Ser Ser Tyr Asp Asp Asn Met Ser Tyr Thr 
20 25 30 

tta gac acg gaa tec acg ctg tgt egg ctg acg gtg gtt ttc cca cct 144 
Leu Asp Thr Glu Ser Thr Leu Cys Arg Leu Thr Val Val Phe Pro Pro 
35 40 45 

acc gtt tat gcg att ata tgc ttt ttt att ttt tgc att acg ctg ttt 192 
Thr Val Tyr Ala lie He Cys Phe Phe He Phe Cys He Thr Leu Phe 
50 55 60 

ggg aac gcg ttg gtg eta tat att ttt ttt aaa ttt aaa gcg etc gcc 240 
Gly Asn Ala Leu Val Leu Tyr He Phe Phe Lys Phe Lys Ala Leu Ala 
65 70 75 80 

aac tct gtg gat gta ctg atg get ggg ttg tgt tgt aac tec ctg ttt 288 
Asn Ser Val Asp Val Leu Met Ala Gly Leu Cys Cys Asn Ser Leu Phe 
85 90 95 

ctg tgc gcg teg ttt ttg ttc age tgg ctg ctg tac gtc gcg cca cag 336 
Leu Cys Ala Ser Phe Leu Phe Ser Trp Leu Leu Tyr Val Ala Pro Gin 
100 105 110 

atg etc acg tec gcg acg tgc aag gtg gaa ate ttt ttc ttt tac ctg 384 
Met Leu Thr Ser Ala Thr Cys Lys Val Glu He Phe Phe Phe Tyr Leu 
115 120 125 

tac acg tac ttt ggc gtg tac att gtg gtg tgt ate age ctt ate agg 432 
Tyr Thr Tyr Phe Gly Val Tyr He Val Val Cys He Ser Leu He Arg 
130 135 140 



tgc ctg tta gtt gtg ttt tec cgc cgc ccg tgg gtc aag cac ggg gcc 



480 



279 



Cys Leu Leu Val Val Phe Ser Arg Arg Pro Trp Val Lys His Gly Ala 
145 150 155 160 



tec ggc ttt etc tgc gtg tgt gtg tct tta ate gtg gcg ctg gcg ctg 

Ser Gly Phe Leu Cys Val Cys Val Ser Leu lie Val Ala Leu Ala Leu 

165 170 175 

tct gec aac gcg age etc tat agg acg gee ctg cgt cac cca gag acc 

Ser Ala Asn Ala Ser Leu Tyr Arg Thr Ala Leu Arg His Pro Glu Thr 

180 185 190 

age gag tgg ata tgc tac gaa gat gec ggg gaa gat acc gtc aac tgg 

Ser Glu Trp lie Cys Tyr Glu Asp Ala Gly Glu Asp Thr Val Asn Trp 

195 200 205 

aag ctg aga ate aga acc acc age gcg ate tgc ggg ttt ttg gtt ccg 

Lys Leu Arg lie Arg Thr Thr Ser Ala lie Cys Gly Phe Leu Val. Pro 

210 215 220 

ttt ggg ctg atg gtg etc ttt tac gga ctt acg tgg tgt atg gtt aaa 

Phe Gly Leu Met Val Leu Phe Tyr Gly Leu Thr Trp Cys Met Val Lys 

225 230 235 240 

age acg aag ctg gec aga aag gga gec gtt agg ggt gta att gtg acg 

Ser Thr Lys Leu Ala Arg Lys Gly Ala Val Arg Gly Val lie Val Thr 

245 250 255 

gtg gtg gtg ctg ttt tta att ttt tgc ctg ccc tat cac ctg tgc aac 

Val Val Val Leu Phe Leu lie Phe Cys Leu Pro Tyr His Leu Cys Asn 

260 265 270 

ttt ttt gac acc ctg ttg agg acc ggt ttt ctg gec gaa acg tgc tac 

Phe Phe Asp Thr Leu Leu Arg Thr Gly Phe Leu Ala Glu Thr Cys Tyr 

275 280 285 

etc agg gac gtg ate age gtg gee atg cac ata tgc tec ctg eta cag 

Leu Arg Asp Val lie Ser Val Ala Met His lie Cys Ser Leu Leu Gin 

290 295 300 

age atg tat age gcg ttc gtg cca gtc gtg tat tct ggt ctt ggg tct 

Ser Met Tyr Ser Ala Phe Val Pro Val Val Tyr Ser Gly Leu Gly Ser 

305 310 315 320 

ctg ttt agg aga agg gtt agg gat acc tgg tec gtg ttt agg tgt ttt 

Leu Phe Arg Arg Arg Val Arg Asp Thr Trp Ser Val Phe Arg Cys Phe 

325 330 335 



tec act tea ggt agt tta tga 
Ser Thr Ser Gly Ser Leu 
340 



<210> 163 
<211> 342 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 163 

Met Asp Ala Leu Asn Asn Asn Leu Asn Leu Leu Met Asp Phe Leu Ser 

1 5 10 15 

Asn Tyr Ser Asn Ser Tyr Ser Ser Tyr Asp Asp Asn Met Ser Tyr Thr 



280 



O 2s& .™k J}„ HUTU ,,■« -O •* 









20 










25 










30 






Leu 


Asp 


Thr 


Glu 


Ser 


Thr 


Leu 


Cys 


Arg 


Leu 


Thr 


Val 


Val 


Phe 


Pro 


Pro 






35 










40 










45 








Thr 


Val 


Tyr 


Ala 


He 


He 


Cys 


Phe 


Phe 


He 


Phe 


Cys 


He 


Thr 


Leu 


Phe 




50 










55 










60 










Gly 


Asn 


Ala 


Leu 


Val 


Leu 


Tyr 


He 


Phe 


Phe 


Lys 


Phe 


Lys 


Ala 


Leu 


Ala 


65 










70 










75 










80 


Asn 


Ser 


Val 


Asp 


Val 


Leu 


Met 


Ala 


Gly 


Leu 


Cys 


Cys 


Asn 


Ser 


Leu 


Phe 










85 










90 










95 




Leu 


Cys 


Ala 


Ser 


Phe 


Leu 


Phe 


Ser 


Trp 


Leu 


Leu 


Tyr 


Val 


Ala 


Pro 


Gin 








100 










105 










110 






Met 


Leu 


Thr 


Ser 


Ala 


Thr 


Cys 


Lys 


Val 


Glu 


He 


Phe 


Phe 


Phe 


Tyr 


Leu 






115 










120 










125 








Tyr 


Thr 


Tyr 


Phe 


Gly 


Val 


Tyr 


He 


Val 


Val 


Cys 


He 


Ser 


Leu 


He 


Arg 




130 










135 










140 










Cys 


Leu 


Leu 


Val 


Val 


Phe 


Ser 


Arg 


Arg 


Pro 


Trp 


Val 


Lys 


His 


Gly Ala 


145 










150 










155 










160 


Ser 


Gly 


Phe 


Leu 


Cys 


Val 


Cys 


Val 


Ser 


Leu 


He 


Val 


Ala 


Leu 


Ala 


Leu 










165 










170 










175 




Ser 


Ala 


Asn 


Ala 


Ser 


Leu 


Tyr 


Arg 


Thr 


Ala 


Leu 


Arg 


His 


Pro 


Glu 


Thr 








180 










185 










190 






Ser 


Glu 


Trp 


He 


Cys 


Tyr 


Glu 


Asp 


Ala 


Gly 


Glu 


Asp 


Thr 


Val 


Asn 


Trp 






195 










200 










205 








Lys 


Leu 


Arg 


He 


Arg 


Thr 


Thr 


Ser 


Ala 


He 


Cys 


Gly 


Phe 


Leu 


Val 


Pro 




210 










215 










220 










Phe 


Gly 


Leu 


Met 


Val 


Leu 


Phe 


Tyr 


Gly 


Leu 


Thr 


Trp 


Cys 


Met 


Val 


Lys 


225 










230 










235 










240 


Ser 


Thr 


Lys 


Leu 


Ala 


Arg 


Lys 


Gly 


Ala 


Val 


Arg 


Gly 


Val 


He 


Val 


Thr 










245 










250 










255 




Val 


Val 


Val 


Leu 


Phe 


Leu 


He 


Phe 


Cys 


Leu 


Pro 


Tyr 


His 


Leu 


Cys 


Asn 








260 










265 










270 






Phe 


Phe 


Asp 


Thr 


Leu 


Leu 


Arg 


Thr 


Gly 


Phe 


Leu 


Ala 


Glu 


Thr 


Cys 


Tyr 






275 










280 










285 








Leu 


Arg 


Asp 


Val 


He 


Ser 


Val 


Ala 


Met 


His 


He 


Cys 


Ser 


Leu 


Leu 


Gin 




290 










295 










300 










Ser 


Met 


Tyr 


Ser 


Ala 


Phe 


Val 


Pro 


Val 


Val 


Tyr 


Ser 


Gly 


Leu 


Gly 


Ser 


305 










310 










315 










320 


Leu 


Phe 


Arg 


Arg 


Arg 


Val 


Arg 


Asp 


Thr 


Trp 


Ser 


Val 


Phe 


Arg 


Cys 


Phe 










325 










330 










335 




Ser 


Thr 


Ser 




Ser 


Leu 























340 



<210> 164 

<211> 3897 

<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

<220> 

<221> CDS 

<222> (1) . . (3897) 

<400> 164 

atg gcc caa cgc acg aac cct cgc tgg gcc get gca gec ctt agt cca 48 

Met Ala Gin Arg Thr Asn Pro Arg Trp Ala Ala Ala Ala Leu Ser Pro 

1 5 10 15 

gaa gag gag get ttt att cat gac aat teg gac gca gaa agt gtt ttg 96 
Glu Glu Glu Ala Phe He His Asp Asn Ser Asp Ala Glu Ser Val Leu 
20 25 30 



281 
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gcg ctt gtg cca gaa cag tgc ttt agt gaa ttt tta ctt tgg ctt gta 144 
Ala Leu Val Pro Glu Gin Cys Phe Ser Glu Phe Leu Leu Trp Leu Val 
3 5 4 0 4 5 



aca cgc ccc tct gat aac ttt gac aac gac gat gac gac cca gcg ctg 

Thr Arg Pro Ser Asp Asn Phe Asp Asn Asp Asp Asp Asp Pro Ala Leu 

50 • 55 60 

ggc gtt ate tgg cat ctt ctg gcg cct ctg gtt aat tat gca cct ctg 

Gly Val lie Trp His Leu Leu Ala Pro Leu Val Asn Tyr Ala Pro Leu 

65 70 75 80 

gaa act egg teg gcg cac etc cag ggc cat cat act ata tec ctg ccc 

Glu Thr Arg Ser Ala His Leu Gin Gly His His Thr lie Ser Leu Pro 

85 90 95 



gtg cag tgc ctt aga gac age ggc etc gat aga acg ttg egg tta gag 
Val Gin Cys Leu Arg Asp Ser Gly Leu Asp Arg Thr Leu Arg Leu Glu 
115 120 125 



etc ccg tta caa egg aac ata tta ccg ctg gec gtg cag gec tec aac 
Leu Pro Leu Gin Arg Asn lie Leu Pro Leu Ala Val Gin Ala Ser Asn 
195 200 205 



tac age tgc gtt ttc cag cgc etc ccg tgc gaa aac gta acc cac ata 

Tyr Ser Cys Val Phe Gin Arg Leu Pro Cys Glu Asn Val Thr His lie 

225 230 235 240 

get gac tea ttt aca cac ctg cac age gee att cag aca ggt gca ggt 

Ala Asp Ser Phe Thr His Leu His Ser Ala He Gin Thr Gly Ala Gly 

245 250 255 

gcg ctg caa aac att ctg ttc cat gee acg ctg ctg ccc ggg ggc gaa 

Ala Leu Gin Asn He Leu Phe His Ala Thr Leu Leu Pro Gly Gly Glu 
260 265 270 

282 



192 



240 



288 



tat ggc cca gac ctg atg cgc caa cct acc aca aga tct age gaa ata 336 
Tyr Gly Pro Asp Leu Net Arg Gin Pro Thr Thr Arg Ser Ser Glu He 
100 105 HO 



384 



gtg ggc aga cat ctg age tgc cag acg aga egg ttt gtc gec gat egg 43 2 

Val Gly Arg His Leu Ser Cys Gin Thr Arg Arg Phe Val Ala Asp Arg 

130 135 140 

gta ccc ccg ggc acc ttg gec gee ctg aca ctt ggc aca eta gta gaa 480 

Val Pro Pro Gly Thr Leu Ala Ala Leu Thr Leu Gly Thr Leu Val Glu 

145 150 155 160 

tat gat gtg cgc gtg cag cgc cag etc ccg gtg aca ttg caa tec acc 528 

Tyr Asp Val Arg Val Gin Arg Gin Leu Pro Val Thr Leu Gin Ser Thr 
165 170 175 

gee tgg aga ccg ttg ccc gag aga gac cca ata tgc gee gcg gtg atg 576 

Ala Trp Arg Pro Leu Pro Glu Arg Asp Pro He Cys Ala Ala Val Met 
180 185 ~ 190 



624 



ggc aac age tat acg gtg tec aga tac gec gtc atg gee cgc agg age 672 
Gly Asn Ser Tyr Thr Val Ser Arg Tyr Ala Val Met Ala Arg Arg Ser 
210 215 220 



720 



768 



816 
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ate aga teg gec ctg tgt gga ttt tac gec act acg ccg tea gtg ggc 864 

lie Arg Ser Ala Leu Cys Gly Phe Tyr Ala Thr Thr Pro Ser Val Gly 

275 280 285 

gca ttt tct cgc gca cgc cac aga get att aac aca aca gcg aca etc 912 

Ala Phe Ser Arg Ala Arg His Arg Ala He Asn Thr Thr Ala Thr Leu 

290 295 300 



cac tgc cag cag ctg gcg cgc acc ggc acg cct gtc etc ggt ggc ttt 
His Cys Gin Gin Leu Ala Arg Thr Gly Thr Pro Val Leu Gly Gly Phe 

310 315 320 



305 



ggt att acc cgc ate acc caa acg ttg teg ctg get tac gag att ttg 

Gly He Thr Arg He Thr Gin Thr Leu Ser Leu Ala Tyr Glu He Leu 

385 390 395 400 

gaa ggg ccc eta ttt acc age att aat cgc gec cat gaa ccc gec tct 

Glu Gly Pro Leu Phe Thr Ser He Asn Arg Ala His Glu Pro Ala Ser 
405 410 415 



960 



1008 



ctt aaa acc gtc cac age gee acc acc age gag gcg aac gtt att acc 

Leu Lys Thr Val His Ser Ala Thr Thr Ser Glu Ala Asn Val He Thr 

325 330 335 

acc aca teg ctg tta teg tgc gtg cct caa gca tac aca ttc etc agg 

Thr Thr Ser Leu Leu Ser Cys Val Pro Gin Ala Tyr Thr Phe Leu Arg 
340 345 350 

agg tct tta ttc agt cag cct ate ate tgt ctt ggg tct ttt gaa ccc 

Arg Ser Leu Phe Ser Gin Pro He He Cys Leu Gly Ser Phe Glu Pro 
355 360 365 

gtt gac ggc gat ggc aac cag cgc teg ctt tac ctg ggg age gee gca 1152 

Val Asp Gly Asp Gly Asn Gin Arg Ser Leu Tyr Leu Gly Ser Ala Ala 
370 375 380 



1056 



1104 



1200 



1248 



gtc ate ggc cac ctg gga gee ctg gtc teg egg ggc ggc ctg cgc etc 1296 

Val He Gly His Leu Gly Ala Leu Val Ser Arg Gly Gly Leu Arg Leu 
420 425 430 

ttt gtc tct cag ctt cca cca acc att ctg age caa ctg acc gec acg 1344 

Phe Val Ser Gin Leu Pro Pro Thr He Leu Ser Gin Leu Thr Ala Thr 
435 440 445 

cca gac ate tea egg gaa acc gtg aac gac ate eta gtt aac aag ttt 13 92 

Pro Asp He Ser Arg Glu Thr Val Asn Asp He Leu Val Asn Lys Phe 

450 455 460 

etc aac gtg tct gec tgc gtg gtc ttt gec gtc etc ccg cgc gac acg 144 0 

Leu Asn Val Ser Ala Cys Val Val Phe Ala Val Leu Pro Arg Asp Thr 

465 470 475 480 

gag ccg gaa ccg ggc ccg ttg gat gec ate agg agg gec gca cgc ate 1488 

Glu Pro Glu Pro Gly Pro Leu Asp Ala He Arg Arg Ala Ala Arg He 
485 490 495 

tgc gga tgc cct ttc gec gtc gtt ggg gaa acc tgc gaa gag ctt gga 1536 
Cys Gly Cys Pro Phe Ala Val Val Gly Glu Thr Cys Glu Glu Leu Gly 
500 505 510 

att cag ttc gtg aac gac ctg gag ctg tgg aac ccg gga gcg tgg ccg 1584 

283 



c* o « 'O t!„i! o - n;3 M ,k * 



He Gin Phe Val Asn Asp Leu Glu Leu Trp Asn Pro Gly Ala Trp Pro 
515 520 525 



ate aga cag cca aca teg gec gaa gtc ate gca act ttt ggg ttc gac 

He Arg Gin Pro Thr Ser Ala Glu Val He Ala Thr Phe Gly Phe Asp 

530 535 540 

gag cag ccc gtt tec tec aac tgg ctg gtg cgc cca gaa gaa cct gag 

Glu Gin Pro Val Ser Ser Asn Trp Leu Val Arg Pro Glu Glu Pro Glu 

545 550 555 560 



cac att gtc aac cac acg teg gtg ttt acg gac aga atg gcg egg gtg 
His He Val Asn His Thr Ser Val Phe Thr Asp Arg Met Ala Arg Val 
625 630 635 640 



705 



710 715 720 



ttc gec ate acg teg gec ggc age acc ccg tgc ctg teg gaa gaa ctg 
Phe Ala lie Thr Ser Ala Gly Ser Thr Pro Cys Leu Ser Glu Glu Leu 



1632 



1680 



1728 



gat ggt ggc gaa cag gca ccc teg ccg acc gac tgg ggc eta ttc cgc 

Asp Gly Gly Glu Gin Ala Pro Ser Pro Thr Asp Trp Gly Leu Phe Arg 

565 570 575 

ctg gec tec gtg gtc gat cag ctt ctg cga tgt ccg acg gtt ggc age 1776 

Leu Ala Ser Val Val Asp Gin Leu Leu Arg Cys Pro Thr Val Gly Ser 

580 585 590 

aaa gag ttt gtc acg cga cac gtg gac aga tgc tec aac gga etc gtg 

Lys Glu Phe Val Thr Arg His Val Asp Arg Cys Ser Asn Gly Leu Val 

595 600 605 

get cag cag tgc gaa gtg gga ccc ctg ggc egg ccg ctg tea gat tac 1872 

Ala Gin Gin Cys Glu Val Gly Pro Leu Gly Arg Pro Leu Ser Asp Tyr 

610 615 620 



1824 



1920 



1968 



2016 



ccc ata tat cgc ccc cag ccg ate acc agg cag gac gcg acg gaa cgc 

Pro He Tyr Arg Pro Gin Pro He Thr Arg Gin Asp Ala Thr Glu Arg 
645 650 655 

ctg gtt age cca gaa acc tgg gtc acc cag ggc agg ggc agg aac egg 

Leu Val Ser Pro Glu Thr Trp Val Thr Gin Gly Arg Gly Arg Asn Arg 
660 665 670 

tgg gtc gga cag tgc gtg get tat gga gaa cag gca tac aag atg ggc 2 064 

Trp Val Gly Gin Cys Val Ala Tyr Gly Glu Gin Ala Tyr Lys Met Gly 
675 680 685 

ate aac gcg gca gtg ggc gcg aga tac gee ate tgc gag gcg gtc acc 2112 

He Asn Ala Ala Val Gly Ala Arg Tyr Ala He Cys Glu Ala Val Thr 
690 695 700 

aac ate atg eta gcg cac gtg egg cgt eta age gac ate acg ctg acg 2160 

Asn He Met Leu Ala His Val Arg Arg Leu Ser Asp He Thr Leu Thr 



2208 



gcg teg gtc ggt tgg aac ccg gag gac gac cag gee tgg etc ctg cag 

Ala Ser Val Gly Trp Asn Pro Glu Asp Asp Gin Ala Trp Leu Leu Gin 

725 730 735 

cac aca ctg ttt gee tgc aag gaa eta tgc agg gac ctg age ate aac 2256 

His Thr Leu Phe Ala Cys Lys Glu Leu Cys Arg Asp Leu Ser He Asn 

740 745 750 



2304 
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755 760 765 

ate age gec acc caa cag cat cag acg gta gec ccc gtg ccg ttt aac 2352 

lie Ser Ala Thr Gin Gin His Gin Thr Val Ala Pro Val Pro Phe Asn 

770 775 780 



gca gtc gtc ate acg gec acc gec gag gtg aag teg tec agg caa cgc 
Ala Val Val He Thr Ala Thr Ala Glu Val Lys Ser Ser Arg Gin Arg 

790 795 800 



785 



tea gaa aga ate gtc ate tea cac aac aac gat gaa gtc tat tea cag 
Ser Glu Arg He Val He Ser His Asn Asn Asp Glu Val Tyr Ser Gin 



945 



950 955 960 



aag gac tac ggg tgc aac caa cac aac ctg ggc cac etc gec gag gtt 
Lys Asp Tyr Gly Cys Asn Gin His Asn Leu Gly His Leu Ala Glu Val 
995 1000 1005 



2400 



gtc aca cct gac ctg aag gec acc ggt aac etc ate gtg ctg gtg tec 2448 
Val Thr Pro Asp Leu Lys Ala Thr Gly Asn Leu He Val Leu Val Ser 
805 810 815 

ttc ccc gtg ccg cac ctg acc cag gga teg acc ttt gag cac ctg tgt 2496 
Phe Pro Val Pro His Leu Thr Gin Gly Ser Thr Phe Glu His Leu Cys 
820 825 830 

etc ttg ccg agt ccc acc eta ccg gac gtt cag gcg aca cac ctg gec 2544 
Leu Leu Pro Ser Pro Thr Leu Pro Asp Val Gin Ala Thr His Leu Ala 
835 840 845 

aac etc ttc atg eta aca gag gee ctg etc tec egg ggt ctg gtg gtg 2592 
Asn Leu Phe Met Leu Thr Glu Ala Leu Leu Ser Arg Gly Leu Val Val 
850 855 860 

tec ggc cac gac gtc age gac gga ggc atg gtg gtt acc get ate gag 2640 
Ser Gly His Asp Val Ser Asp Gly Gly Met Val Val Thr Ala He Glu 
865 870 875 880 

atg gec ctg gec ggt aac cga ggc eta cag att cgc ate ccg tea gag 2688 
Met Ala Leu Ala Gly Asn Arg Gly Leu Gin He Arg He Pro Ser Glu 
885 890 895 

gaa acg ccc ctg cag tgg ctg gtg tec gaa acc ccg ggc gtc ate ttc 2736 
Glu Thr Pro Leu Gin Trp Leu Val Ser Glu Thr Pro Gly Val He Phe 
900 905 910 

gaa ate cag ccc cag cac gtc gac gag gtg cgc cag gcg tgc caa aac 2784 
Glu He Gin Pro Gin His Val Asp Glu Val Arg Gin Ala Cys Gin Asn 
915 920 925 

ttt gac tgc egg gec acc gtg tgc ggc acc gtg ggt cag gag ggt ctg 2832 
Phe Asp Cys Arg Ala Thr Val Cys Gly Thr Val Gly Gin Glu Gly Leu 
930 935 940 



2880 



acc etc act teg gtg gcg gca aac tgg acc teg ttc teg gac gaa cag 2928 

Thr Leu Thr Ser Val Ala Ala Asn Trp Thr Ser Phe Ser Asp Glu Gin 
965 970 975 

tgg tac tea tgg ggg ccc age ttc acg ccc gcg cag gaa etc tac aga 2976 

Trp Tyr Ser Trp Gly Pro Ser Phe Thr Pro Ala Gin Glu Leu Tyr Arg 
980 985 990 



3024 



285 
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tgc aga aac age gaa ctg act ctg ttt gca acg ccg teg agg ccg cca 3072 
Cys Arg Asn Ser Glu Leu Thr Leu Phe Ala Thr Pro Ser Arg Pro Pro 
1010 1015 1020 

gca gtg gec gec ttg gtc acc ccc ggt gec ccg ctt ccc agg gcg ctg 3120 
Ala Val Ala Ala Leu Val Thr Pro Gly Ala Pro Leu Pro Arg Ala Leu 
1025 1030 1035 1040 

atg gec gec ttc acc aac gtc ggg ttc gac gtc gcg gec gtc tec acc 3168 
Met Ala Ala Phe Thr Asn Val Gly Phe Asp Val Ala Ala Val Ser Thr 
1045 1050 1055 

gac gac ctt aga gga ggg aac ate etc egg gga ttc tec ggt ctg acc 3216 
Asp Asp Leu Arg Gly Gly Asn Tie Leu Arg Gly Phe Ser Gly Leu Thr 
1060 1065 1070 

ate ggt gga acc gtg ggc ate gaa gac age tac gtg ggg gee cga tgc 3264 
lie Gly Gly Thr Val Gly He Glu Asp Ser Tyr Val Gly Ala Arg Cys 
1075 1080 1085 

gcg ate atg ggt ctt etc aac gac cct ggg tgc tat ggg ggc ctt atg 3312 
Ala He Met Gly Leu Leu Asn Asp Pro Gly Cys Tyr Gly Gly Leu Met 
1090 1095 HOO 

gee ttc ttc cgc cga gcg gat aca ttt tea eta tgc tgc ggg gag ttt 3360 
Ala Phe Phe Arg Arg Ala Asp Thr Phe Ser Leu Cys Cys Gly Glu Phe 
1105 1110 H15 H20 

ggg ttc cag etc ctg ggg gcg etc ggt ctt ctt aga gag acc cct cac 3408 
Gly Phe Gin Leu Leu Gly Ala Leu Gly Leu Leu Arg Glu Thr Pro His 
1125 1130 H35 

gac acg ccc gga ccc aaa acc ccc gac cag tgg gac ate cac ctg gag 
Asp Thr Pro Gly Pro Lys Thr Pro Asp Gin Trp Asp He His Leu Glu 
1140 1145 1150 

gag aac gca tea ggg aac cac gag tgc etc tgg eta aac etc cac ate 
Glu Asn Ala Ser Gly Asn His Glu Cys Leu Trp Leu Asn Leu His lie 
1155 1160 H65 

ccc cag acc acc ate age ate atg ttt agg gtt ctt egg ggc etc gtc 3552 
Pro Gin Thr Thr lie Ser He Met Phe Arg Val Leu Arg Gly Leu Val 
1170 1175 H80 



etc cca ggg tgg gee aac ggc egg tac ctg ggc gtg aga tac ccg cgc 
Leu Pro Gly Trp Ala Asn Gly Arg Tyr Leu Gly Val Arg Tyr Pro Arg 
1185 H90 H95 1200 



3456 



3504 



3600 



3648 



3696 



gac gee atg gag tac cat etc aac cag cag cag cgc ate gec ctg aac 

Asp Ala Met Glu Tyr His Leu Asn Gin Gin Gin Arg He Ala Leu Asn 

1205 1210 1215 

tac cac acg ggc aac gec gac ccg agg atg ttc gcg cag cac tac cca 

Tyr His Thr Gly Asn Ala Asp Pro Arg Met Phe Ala Gin His Tyr Pro 

1220 1225 1230 

cgc aac ccg tct gee aac teg gec gtg gee gee ate acg tea ccg gac 3744 

Arg Asn Pro Ser Ala Asn Ser Ala Val Ala Ala He Thr Ser Pro Asp 

1235 1240 1245 
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ggg cgc cac ctg gcc tec ctg gtc gac ccc gec gtg acg ttc cac ccc 3792 
Gly Arg His Leu Ala Ser Leu Val Asp Pro Ala Val Thr Phe His Pro 
1250 1255 1260 

tgg cag tgg gcg tac gtg ccg ccc gag ctt gcg gac atg aca gtt tec 3840 
Trp Gin Trp Ala Tyr Val Pro Pro Glu Leu Ala Asp Met Thr Val Ser 
1265 1270 1275 1280 

ccg tgg gcg ctg gcc ttc cag tec ctg ttc etc tgg tgc ata aga aac 3888 
Pro Trp Ala Leu Ala Phe Gin Ser Leu Phe Leu Trp Cys lie Arg Asn 
1285 1290 1295 

aga cag taa 3897 
Arg Gin 



<210> 165 
<211> 1298 
<212> PRT 

<213> Macaca mulatta rhadinovirus 17577 
<400> 165 

Met Ala Gin Arg Thr Asn Pro Arg Trp Ala Ala Ala Ala Leu Ser Pro 

15 10 15 

Glu Glu Glu Ala Phe lie His Asp Asn Ser Asp Ala Glu Ser Val Leu 

20 25 30 

Ala Leu Val Pro Glu Gin Cys Phe Ser Glu Phe Leu Leu Trp Leu Val 

35 40 45 

Thr Arg Pro Ser Asp Asn Phe Asp Asn Asp Asp Asp Asp Pro Ala Leu 

50 55 60 

Gly Val lie Trp His Leu Leu Ala Pro Leu Val Asn Tyr Ala Pro Leu 
65 70 75 80 

Glu Thr Arg Ser Ala His Leu Gin Gly His His Thr lie Ser Leu Pro 

85 90 95 

Tyr Gly Pro Asp Leu Met Arg Gin Pro Thr Thr Arg Ser Ser Glu He 

100 105 110 

Val Gin Cys Leu Arg Asp Ser Gly Leu Asp Arg Thr Leu Arg Leu Glu 

115 120 125 

Val Gly Arg His Leu Ser Cys Gin Thr Arg Arg Phe Val Ala Asp Arg 

130 135 140 

Val Pro Pro Gly Thr Leu Ala Ala Leu Thr Leu Gly Thr Leu Val Glu 
145 150 155 160 

Tyr Asp Val Arg Val Gin Arg Gin Leu Pro Val Thr Leu Gin Ser Thr 

165 170 175 

Ala Trp Arg Pro Leu Pro Glu Arg Asp Pro lie Cys Ala Ala Val Met 

180 185 190 

Leu Pro Leu Gin Arg Asn He Leu Pro Leu Ala Val Gin Ala Ser Asn 

195 200 205 

Gly Asn Ser Tyr Thr Val Ser Arg Tyr Ala Val Met Ala Arg Arg Ser 

210 215 220 

Tyr Ser Cys Val Phe Gin Arg Leu Pro Cys Glu Asn Val Thr His He 
225 230 235 240 

Ala Asp Ser Phe Thr His Leu His Ser Ala He Gin Thr Gly Ala Gly 

245 250 255 

Ala Leu Gin Asn He Leu Phe His Ala Thr Leu Leu Pro Gly Gly Glu 

260 265 270 

He Arg Ser Ala Leu Cys Gly Phe Tyr Ala Thr Thr Pro Ser Val Gly 

275 280 285 

Ala Phe Ser Arg Ala Arg His Arg Ala He Asn Thr Thr Ala Thr Leu 

290 295 300 

His Cys Gin Gin Leu Ala Arg Thr Gly Thr Pro Val Leu Gly Gly Phe 
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310 










315 










320 


Leu 


Lys 


Thr 


Val 


His 


Ser 


Ala 


Thr 


Thr 


Ser 


Glu 


Ala 


Asn 


Val 


He 


Thr 










325 
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335 




Thr 


Thr 


Ser 


Leu 


Leu 


Ser 


Cys 


Val 


Pro 


Gin 


Ala 


Tvr 


Thr 


Phe 


Leu 


Arcr 
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345 










350 






Arg 


Ser 


Leu 


Phe 


Ser 


Gin 


Pro 


He 


He 


Cvs 


Leu 


Glv 


Ser 


Phe 


Glu 


Pro 
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360 










365 








Val 


Asp 


Glv 


Asp 


Glv 


Asn 


Gin 


Aro 
> 


Ser 


Leu 


Tyr 


Leu 


Glv 


Ser 


Ala 


Ala 
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375 










380 










Gly 


He 


Thr 


Arg 


lie 


Thr 


Gin 


Thr 


Leu 


Ser 


Leu 


Ala 


Tvr 


Glu 


He 


Leu 


3 85 










3 90 










3 95 










4 00 


Glu 


Glv 


Pro 


Leu 


Phe 


Thr 


Ser 


He 


Asn 


Aro 


Ala 


His 


Glu 


Pro 


Ala 


Ser 
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410 










415 




Val 


He 


Gly 


His 


Leu 


Gly 


Ala 


Leu 


Val 


Ser 


Arg 


Glv 


Glv 


Leu 


Arg 


Leu 








420 










425 
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Phe 


Val 


Ser 


Gin 


Leu 


Pro 


Pro 


Thr 


He 


Leu 


Ser 


Gin 


Leu 


Thr 


Ala 


Thr 
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Pro 


Asp 


He 


Ser 


Arg 


Glu 


Thr 


Val 


Asn 


Asp 


He 


Leu 


Val 


Asn 


ys 


Phe 




4 5 0 
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Asn 


Val 


Ser 


Ala 


Cys 


Val 


Val 


Phe 


Ala 


Val 


Leu 


o 
ro 




sp 


Thr 
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Glu 


Pro 


Glu 


Pro 


Gly 


Pro 


Leu 


sp 


Ala 


He 


Arg 


Arg 


Ala 


Ala 


Arg 


He 
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ys 
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Pro 


Phe 


Ala 


Val 


Val 


Gly 


Glu 


Thr 


Cys 


Glu 


Glu 


Leu 


Gly 
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Tie 


Gin 


Phe 


Val 


Asn 


Asp 


Leu 


Glu 






Asn 


Pro 


Gly 


Ala 


Trn 
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Pro 






C -1 IT 

D13 










c; o n 










c; o 








lie 


Arg 


Gin 


Pro 


Thr 


Ser 


Ala 


Glu 


Val 


He 


Ala 


Thr 


Phe 


Gly 


Phe 


Asp 
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Glu 


Gin 


Pro 


Val 


Ser 


Ser 




± L p 


Leu 
eu 


Val 


Arg 


Pro 


Glu 


Glu 


Pro 


Glu 


^ d. 










55 0 










555 










56 0 


Asp 


Gly 


Gly 


Glu 


Gin 


Ala 


Pro 


Ser 


Pro 


Thr 


Asp 


■ LI F 


Gly 


Leu 


Phe 


.f-i..L y 










565 










570 










5 75 




Leu 


Ala 


Ser 


Val 


Val 


Asp 


Gin 


Leu 


Leu 


Arg 


Cys 


Pro 


Thr 


Val 


Gly 


Ser 








58 0 










5 8 5 










5 90 






Lys 


Glu 


Phe 


Val 


Thr 


ArQ 


His 


Val 


Asp 


Arg 


Cys 


Ser 


Asn 


Glv 


Leu 


Val 






5 95 










6 0 0 










605 








Ala 


Gin 


Gin 


Cys 


Glu 


Val 


Gly 


Pro 


Leu 


Gly Arg 


Pro 


Leu 


Ser 


Asp 


Tvr 




610 










615 










620 










His 


He 


Val 


Asn 


His 


Thr 


Ser 


Val 


Phe 


Thr 


Asp 


Arg 


Met 


Ala 


Arg 


Val 


625 










63 0 










635 










64 0 


Pro 


He 


Tvr 


Arg 


Pro 


Gin 


Pro 


He 


Thr 


Arg 


Gin 


Asp 


Ala 


Thr 


Glu 


Arg 










645 










650 










655 




Leu 


Val 


Ser 


Pro 


Glu 


Thr 


Trp 


Val 


Thr 


Gin 


Gly Arg 


Glv 


Arg 


Asn 


Arg 








660 










665 










670 






Trp 


Val 


Gly 


Gin 


Cys 


Val 


Ala 


Tyr 


Gly 


Glu 


Gin 


Ala 


Tvr 


Lvs 


Met 


Gly 






675 










680 










685 








lie 


Asn 


Ala 


Ala 


Val 


Gly 


Ala 


Arg 


Tyr 


Ala 


lie 


Cys 


Glu 


Ala 


Val 


Thr 




690 










695 










700 










Asn 


He 


Met 


Leu 


Ala 


His 


Val 


Arg 


Arg 


Leu 


Ser 


Asp 


He 


Thr 


Leu 


Thr 


705 










710 










715 










720 


Ala 


Ser 


Val 


Gly 


Trp 


Asn 


Pro 


Glu 


Asp 


Asp 


Gin 


Ala 


Trp 


Leu 


Leu 


Gin 










725 










730 










735 




His 


Thr 


Leu 


Phe 


Ala 


Cys 


Lys 


Glu 


Leu 


Cys 


Arg 


Asp 


Leu 


Ser 


He 


Asn 








740 










745 










750 






Phe 


Ala 


He 


Thr 


Ser 


Ala 


Gly 


Ser 


Thr 


Pro 


Cys 


Leu 


Ser 


Glu 


Glu 


Leu 






755 










760 










765 








He 


Ser 


Ala 


Thr 


Gin 


Gin 


His 


Gin 


Thr 


Val 


Ala 


Pro 


Val 


Pro 


Phe 


Asn 




770 










775 










780 










Ala 


Val 


Val 


He 


Thr 


Ala 


Thr 


Ala 


Glu 


Val 


Lys 


Ser 


Ser 


Arg 


Gin 


Arg 



785 790 795 800 
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Val Thr 


Pro 


Asp 


Leu 


Lys 


Ala 


Thr 


Gly 


Asn 


Leu 


He 


Val 


Leu 


Val 


Ser 








805 










810 










815 




Phe Pro 


Val 


Pro 


His 


Leu 


Thr 


Gin 


Gly 


Ser 


Thr 


Phe 


Glu 


His 


Leu 


Cys 






820 










825 










830 






Leu Leu 


Pro 


Ser 


Pro 


Thr 


Leu 


Pro 


Asp 


Val 


Gin 


Ala 


Thr 


His 


Leu 


Ala 




835 










840 










845 








Asn Leu 


Phe 


Met 


Leu 


Thr 


Glu 


Ala 


Leu 


Leu 


Ser 


Arg 


Gly 


Leu 


Val 


Val 


850 










855 










860 










Ser Gly 


His 


Asp 


Val 


Ser 


Asp 


Gly 


Gly 


Met 


Val 


Val 


Thr 


Ala 


He 


Glu 


865 








870 










875 










880 


Met Ala 


Leu 


Ala 


Gly 


Asn 


Arg 


Gly 


Leu 


Gin 


He 


Arg 


He 


Pro 


Ser 


Glu 








885 










890 










895 




Glu Thr 


Pro 


Leu 


Gin 


Trp 


Leu 


Val 


Ser 


Glu 


Thr 


Pro 


Gly 


Val 


lie 


Phe 






900 










905 










910 






Glu He 


Gin 


Pro 


Gin 


His 


Val 


Asp 


Glu 


Val 


Arg 


Gin 


Ala 


Cys 


Gin 


Asn 




915 










920 










925 








Phe Asp 


Cys 


Arg 


Ala 


Thr 


Val 


Cys 


Gly 


Thr 


Val 


Gly 


Gin 


Glu 


Gly 


Leu 


930 










935 










940 










Ser Glu 


Arg 


He 


Val 


He 


Ser 


His 


Asn 


Asn 


Asp 


Glu 


Val 


Tyr 


Ser 


Gin 


945 






950 










955 










960 


Thr Leu 


Thr 


Ser 


Val 


Ala 


Ala 


Asn 


Trp 


Thr 


Ser 


Phe 


Ser 


Asp 


Glu 


Gin 








965 










970 










975 




Trp Tyr 


Ser 


Trp 


Gly 


Pro 


Ser 


Phe 


Thr 


Pro 


Ala 


Gin 


Glu 


Leu 


Tyr 


Arg 




980 










985 










990 






Lys Asp 


Tyr 


Gly 


Cys 


Asn 


Gin 


His 


Asn 


Leu 


Gly 


His 


Leu 


Ala 


Glu 


Val 


995 








1000 








1005 








Cys Arg 


Asn 


Ser 


Glu 


Leu 


Thr 


Leu 


Phe 


Ala 


Thr 


Pro 


Ser 


Arg 


Pro 


Pro 


1010 








1015 








1020 










Ala Val 


Ala 


Ala 


Leu 


Val 


Thr 


Pro 


Gly 


Ala 


Pro 


Leu 


Pro 


Arg 


Ala 


Leu 


1025 






1030 








1035 








1040 


Met Ala 


Ala 


Phe 


Thr 


Asn 


Val 


Gly 


Phe 


Asp 


Val 


Ala 


Ala 


Val 


Ser 


Thr 






1045 








1050 








1055 




Asp Asp 


Leu 


Arg 


Gly 


Gly 


Asn 


He 


Leu 


Arg 


Gly 


Phe 


Ser 


Gly 


Leu 


Thr 


1060 








1065 








1070 






He Gly 


Gly 


Thr 


Val 


Gly 


He 


Glu 


Asp 


Ser 


Tyr 


Val 


Gly 


Ala 


Arg 


Cys 


L075 








1080 








1085 








Ala He 


Met 


Gly 


Leu 


Leu 


Asn 


Asp 


Pro 


Gly 


Cys 


Tyr 


Gly 


Gly 


Leu 


Met 


1090 








1095 








1100 










Ala Phe 


Phe 


Arg 


Arg 


Ala 


Asp 


Thr 


Phe 


Ser 


Leu 


Cys 


Cys 


Gly 


Glu 


Phe 


1105 






1110 








1115 








1120 


Gly Phe 


Gin 


Leu 


Leu 


Gly 


Ala 


Leu 


Gly 


Leu 


Leu 


Arg 


Glu 


Thr 


Pro 


His 




1125 








1130 








1135 




Asp Thr 


Pro 


Gly 


Pro 


Lys 


Thr 


Pro 


Asp 


Gin 


Trp 


Asp 


He 


His 


Leu 


Glu 


1140 








1145 








1150 






Glu Asn 


Ala 


Ser 


Gly 


Asn 


His 


Glu 


Cys 


Leu 


Trp 


Leu 


Asn 


Leu 


His 


He 


1155 








1160 








1165 








Pro Gin 


Thr 


Thr 


He 


Ser 


He 


Met 


Phe 


Arg 


Val 


Leu 


Arg 


Gly 


Leu 


Val 


1170 








1175 








1180 










Leu Pro 


Gly Trp 


Ala 


Asn 


Gly 


Arg 


Tyr 


Leu 


Gly 


Val 


Arg 


Tyr 


Pro 


Arg 


1185 






1190 








1195 










1200 


Asp Ala 


Met 


Glu 


Tyr 


His 


Leu 


Asn 


Gin 


Gin 


Gin 


Arg 


He 


Ala 


Leu 


Asn 




1205 










1210 










1215 




Tyr His 


Thr 


Gly 


Asn 


Ala 


Asp 


Pro 


Arg 


Met 


Phe 


Ala 


Gin 


His 


Tyr 


Pro 


1220 










1225 










1230 






Arg Asn 


Pro 


Ser 


Ala 


Asn 


Ser 


Ala 


Val 


Ala 


Ala 


He 


Thr 


Ser 


Pro 


Asp 


1235 










1240 










1245 








Gly Arg 


His 


Leu 


Ala 


Ser 


Leu 


Val 


Asp 


Pro 


Ala 


Val 


Thr 


Phe 


His 


Pro 


1250 










1255 










1260 










Trp Gin 


Trp 


Ala 


Tyr 


Val 


Pro 


Pro 


Glu 


Leu 


Ala 


Asp 


Met 


Thr 


Val 


Ser 


1265 








1270 










1275 










1280 


Pro Trp 


Ala 


Leu 


Ala 


Phe 


Gin 


Ser 


Leu 


Phe 


Leu 


Trp 


Cys 


He 


Arg 


Asn 



289 



1285 1290 

Arg Gin 



<210> 166 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 
<400> 166 

cctatgggct ccatgagc 



<210> 167 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 
<400> 167 

atcgtcaatc aggctgcg 



<210> 168 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 
<400> 168 

atattaaaca ctcgccgc 



<210> 169 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 

<400> 169 

atgaggggcc ttttcgtgtg c 



<210> 170 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 



290 



<400> 170 

ctgaatcccg ctgccaaggc c 



21 



<210> 171 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 



<400> 171 

atgttccctg tctggttcgt c 



21 



<210> 172 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 



<400> 172 

ttacatcata gctattgcgc g 



21 



<210> 173 
<211> 271 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 
<400> 173 

gcggcgcccg tgtttttggc ctgctgccag gtggcagccg tccaggcgcg tcactaccga 60 
aacccgtgct ctgcggccac tatcttcgcc agcgccacca ccggcctcct ctgcggcgcg 120 
cgtcaccgaa accccaaaca tgatgcttta ggttgcgttt ggggatatct ttcggtgccc 180 
tccgaacgac ccggcgcggc ggggattttt ccgggtcgtt cggagggcac cgaaagatgt 24 0 
tgccacgcgc gcgactggcg accgtgaaaa c 271 



<210> 174 
<211> 304 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 
<400> 174 

gcctagctcc taatgtttgc cttgccgcct agctcctaat gtttgccttg ccgcctagct 60 
cctaatgttt gccttgccgc ctagctccta atgtttgcct tgccgcctag ctcctaatgt 120 
ttgccttgcc gcctagctcc taatgtttgc cttgccgcct agctcctaat gtttgccttg 180 
ccgcctagct cctaatgttt gccttgccgc ctagctccta atgtttgcct tgccgcctag 240 
ctcctaatgt ttgccttgcc gcctagctcc taatgtttgc cttgccgcct agctcctaat 300 
gttt 304 



<210> 175 
<211> 1008 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 



<400> 175 



291 



gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 



ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 



tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 



ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 



cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccg 



gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 
ggtcccgggc 
tccccgttcc 
ccgagggtcc 
cgggctcccc 
gttccccgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1008 



<210> 176 
<211> 405 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 
<400> 176 

ggtggggtgc aagtcccccg gtggggtgca agtcccccgg tggggtgcag gtcccccggt 60 
ggggtgcagg tcccccggtg gggtgcaagt cccccggtgg ggtgcaagtc ccccggtggg 120 
gtgcaagtcc cccggtgggg tgcaagtccc ccggtggggt gcaagtcccc cggtggggtg 180 
caagtccccc ggtggggtgc aagtcccccg gtggggtgca agtcccccgg tggggtgcac 240 
gtcccccggt ggggtgcaag tcccccggtg gggtgcaagt cccccggtgg ggtgcaagtc 300 
ccccggtggg gtgcaagtcc cccggtgggg ttcaagtccc ccggtggggt gcaagtcccc 360 
cggtggggtg caagtccccc ggtggggtgc aggtcccccg gtggg 



405 



<210> 177 
<211> 1029 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 
<400> 177 

gcggctcggc tccggggtgg ctccgggtgg gggcggctcg gctccggggt ggctccgggt 60 
gggggcggct cggctccggg gtggctccgg gtgggggcgg ctcggctccg gggtggctcc 120 
gggtgggggc ggctcggctc cggggtggct ccgggtgggg gcggctcggc tccggggtgg 18 0 
ctccgggtgg gggcggctcg gctccggggt ggctccgggt gggggcggct cggctccggg 240 
gtggctccgg gtgggggcgg ctcggctccg gggtggctcc gggtgggggc ggctcggctc 300 
cggggtggct ccgggtgggg gcggctcggc tccggggtgg ctccgggtgg gggcggctcg 360 
gctccggggt ggctccgggt gggggcggct cggctccggg gtggctccgg gtgggggcgg 420 
ctcggctccg gggtggctcc gggtgggggc ggctcggctc cggggtggct ccgggtgggg 4 80 
gcggctcggc tccggggtgg ctccgggtgg gggcggctcg gctccggggt ggctccgggt 54 0 
gggggcggct cggctccggg gtggctccgg gtgggggcgg ctcggctccg gggtggctcc 600 
gggtgggggc ggctcggctc cggggtggct ccgggtgggg gcggctcggc tccggggtgg 660 
ctccgggtgg gggcggctcg gctccggggt ggctccgggt gggggcggct cggctccggg 720 
gtggctccgg gtgggggcgg ctcggctccg gggtggctcc gggtgggggc ggctcggctc 780 
cggggtggct ccgggtgggg gcggctcggc tccggggtgg ctccgggtgg gggcggctcg 84 0 
gctccggggt ggctccgggt gggggcggct cggctccggg gtggctccgg gtgggggcgg 900 
ctcggctccg gggtggctcc gggtgggggc ggctcggctc cggggtggct ccgggtgggg 960 
qcqqctcqqc tccggggtgg ctccgggtgg gggcggctcg gctccggggt ggctccgggt 1020 

' 1029 
gggggcggc 1 ■ 



<210> 178 



292 



<211> 196 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 



<400> 178 

ggtaagggct aggggtaagg gcaaggggta 
caaggggtaa gggcaagggg taagggcaag 
aagggcaagg ggtaagggca aggggtaagg 
ggggtaaggg ctaggg 



a ggg cta ggg gtaagggcaa ggggtaaggg 60 
gggtaagggc aaggggtaag ggcaaggggt 12 0 
gctaggggta agggctaggg gtaagggcta 180 

196 



<210> 179 
<211> 1387 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 



<400> 179 

cctagcggcc gggcccggag gcgcccggcc 
tgcgcccggg gcgagggtcc cccgcgcgcc 
ctcccccggc tcccggccct ccgccccgcc 
gccccgcgcc gcgcgccggg ccgctttcgg 
ggggccgccc ccgggtgccg ctccgccggg 
ggccgagccc ggcggcgccg cgccccgacg 
gcgcgcggct cccgatgccg ggcggccgcc 
cctccccca.c gccgccccga aaggtggtct 
gggcgcctcg gcggggcccg gcgcggggcg 
gggaaaacgc gaggggagcg ggggacaggg 
gtacggctgc ctgcctgctc gctggcctgc 
gctggcctgc ttgctgaggg gacagtaggc 
gctcgctggc ctgcttgctg aggggacagt 
gctggcctgc ttgcttgctc gctggcctgc 
gctggcctgc ttgcttgctc gctggcctgc 
gctggcctgc ttgcttgctc gctggcctgc 
gctaagggga cggtacgcct gcctgatggc 
ggctgctggg ctgctagtag ggctgctggg 
ggctcctggg ctgctagtag ggctgctggg 
ggctgctggg ctgctagtag ggctgctggg 
ggctgcctgc tggcttgctt gcttgcttgc 
gctgtgcagc tgggagaaca gagtagggct 
ggccagacga ggacacggga cccgggcctc 
cagatct 



ggcgccgccg ccgggccgcg gccgccatct 60 

ccccgggccc gcgcgccggg cgccgccggc 120 

ggcccgggcc cgcggccgcc gcccccggcc 180 

ttcgcggggc cgggggtccc gcggggggcc 240 

cccggccgac tcccgggagc gccccggtcc 300 

ccccccgggc ccggggcccc acaagccgcg 360 

gcccggcatg gcggtcctcc gccggcctcc 420 

ccgcgccgcc gggagggggg ccggggcccg 480 

cgaccgaggg ccccgggaga acgggggatc 54 0 

gacggcgtgt gcgtgcttgt gagacaccgg 600 

ttgctgaggg gacagtaggc ctgcttgctc 660 

ctgcttgctg aggggacagt aggcctgctt 720 

agggctgctg gcttgctagt agggctgctc 780 

ttgcttgctc gctggcctgc ttgcttgctc 840 

ttgcttgctc gctggcctgc ttgcttgctc 900 

ttgctgaggg gacagtaggg ctgcttgctt 960 

ttgatagtag ggctgctggg ctgctagtag 1020 

ctgctagtag ggctgctggg ctgctagtag 1080 

ctgctagtag ggctcctggg ctgctagtag 1140 

ctgctagtag ggctgctggg ctgctagtag 1200 

tagtggggcc gcttgcctgc tactagggct 1260 

gccggccagc tgcgtgcgag ggcgtccgag 1320 

tcccccgccc ggaccgccgg gcacccggcc 1380 

1387 
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